Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Weekly Schedule

Week 1 8/19-8/23Lecture TopicAssignment 
MonCourse IntroductionWelcome!
B. Dally et. al Domain-Specific Hardware Accelerators Comm of the ACM 2020
 
WedsTechnology Trends ReviewReview from Chapter 1 Computer Architecture: A Quantitative Approach, Hennessy and Patterson 
FriMoore’s Law, Dennard Scaling ReviewReading: Cramming more components onto integrated circuits 
Week 2 8/26-8/30Lecture TopicAssignment 
MonFinish review on Parallelism, Flynn’s TaxonomyReading: 
WedsGeneral Purpose Processors and the Virtuous CycleReading: N. Thompson, S. Spanuth The decline of computers as a general purpose technology
A. Fuchs, D. Wentzlaff The Accelerator Wall: Limits of Chip Specialization,
 
FriGeneral Purpose Processors and the Virtuous CycleReading: N. Thompson, S. Spanuth The decline of computers as a general purpose technology
A. Fuchs, D. Wentzlaff The Accelerator Wall: Limits of Chip Specialization,
 
Week 3 9/2-9/6Lecture TopicAssignment 
MonLabor Day  
WedsProcessor In Memory ArchitecturesReading: K. Asi Fuzzaman et. al. A Survey on processing-in-memory techniques: Advances and challenges 
FriProcessor In Memory ArchitecturesReading: K. Asi Fuzzaman et. al. A Survey on processing-in-memory techniques: Advances and challenges 
Week 4 9/9-9/13Lecture TopicAssignment 
MonTutorial on Memory-Centric ComputingRead Computational Power and AI 
WedsMachine Learning Boot CampReading: H&P Comp Arch: a Quant. Approach Ch 7.3-4
Optional: Implications of Makimoto’s Wave T. Makimoto
 
FriMachine Learning Boot CampReading: H&P Comp Arch: a Quant. Approach Ch 7.3-4
Optional: Implications of Makimoto’s Wave T. Makimoto
 
Week 5 9/16-9/20Lecture TopicAssignment 
MonIntroduction to the Roofline ModelRoofline: an insightful visual performance model for multicore architectures](https://dl.acm.org/doi/10.1145/1498765.1498785) S. Williams et. al. 
WedsIn-Datacenter Performance Analysis of a Tensor Processing UnitIn-Datacenter Performance Analysis of a Tensor Processing Unit N. Jouppi et. al
Roofline: an insightful visual performance model for multicore architecturesS. Williams et. al.
 
FriIn-Datacenter Performance Analysis of a Tensor Processing UnitIn-Datacenter Performance Analysis of a Tensor Processing Unit N. Jouppi et. al. 
Week 6 9/23-9/27Lecture TopicAssignment 
MonIn-Datacenter Performance Analysis of a Tensor Processing UnitIn-Datacenter Performance Analysis of a Tensor Processing Unit N. Jouppi et. al. 
WedsPaper Selection/Team Formations  
FriPresentation Outline  
Week 7 9/30-10/4Lecture TopicAssignment 
MonPresentation Style/Team Formations  
Weds   
FriPIM TechnologiesAccelerating Neural Network Inference with Processing-in-DRAM:From the Edge to the CloudG. Oliveira et. al.,extended and updated version of a paper published in IEEE Micro, pp. 1-14, 29 Aug. 2022. 
Week 8 10/7-10/11Lecture TopicAssignment 
MonBit Serial ArithmeticAccelerating Neural Network Inference with Processing-in-DRAM:From the Edge to the CloudG. Oliveira et. al.,extended and updated version of a paper published in IEEE Micro, pp. 1-14, 29 Aug. 2022. 
WedsProcessing in DRAMAccelerating Neural Network Inference with Processing-in-DRAM:From the Edge to the CloudG. Oliveira et. al.,extended and updated version of a paper published in IEEE Micro, pp. 1-14, 29 Aug. 2022. 
FriUPMEMAccelerating Neural Network Inference with Processing-in-DRAM:From the Edge to the CloudG. Oliveira et. al.,extended and updated version of a paper published in IEEE Micro, pp. 1-14, 29 Aug. 2022. 
Week 9 10/14-10/18Lecture TopicAssignment 
MonFall Break  
WedsStephanie,Abe,NathanielY. Chen and M. S. Abdelfattah, “BRAMAC: Compute-in-BRAM Architectures for Multiply-Accumulate on FPGAs,” 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), Marina Del Rey, CA, USA, 2023, pp. 52-62, doi: 10.1109/FCCM57271.2023.00015. 
FriNick,Karis,Kyle,CaseJ. Choquette, W. Gandhi, O. Giroux, N. Stam, and R. Krashinsky,NVIDIA A100 Tensor Core GPU: Performance and Innovation IEEE Micro Volume: 41, Issue: 2, 01 March-April 2021 
Week 10 10/21-10/25Lecture TopicAssignment 
MonUPMEMThe true Processor in Memory AcceleartorF. Devaux, IEEE Hot Chips 31 Symposium (HCS) 2019 
WedsMarvin,Jeff,DhruvJ.D. Kendall, S. Kumar The building blocks of a brain-inspired computer Appl. Phys. Rev. 1 March 2020; 7 (1): 011305. 
FriThomas,Matthew,StephenJ. Vasiljevic, D. Capalija Blackhole & TT-Mealium: The Standalone AI Computer and its Programming Model Proceedings of the 2024 IEEE Hot Chips 36 Symposium (HCS) 
Week 11 10/28-11/1Lecture TopicAssignment 
MonEmerging NVMs  
WedsNicholas,Donna,OgdenSean Lie Cerebras Architecture Deep Dive: First Look Inside the Hardware/Software Co-Design for Deep Learning IEEE Micro Volume: 43, Issue: 3, May-June 2023 
FriGrant,Gideon,HenryD. Brooks et. al RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing Proceedings of the 47th ACM/IEEE International Symposium on Computer Architecture (ISCA) Valencia, Spain, 2020, pp. 790-803, doi: 10.1109/ISCA45697.2020.00070. 
Week 12 11/4-11/8Lecture TopicAssignment 
MonEmerging NVMs  
WedsDRAM PIMMemory-Centric Computing with SK hynix’s Domain-Specific Memory 2023 Y. Kwon et. al.,IEEE Hot Chips 35 Symposium (HCS) 2023 
FriPUMIn Memory Intelligence Tim Finkbeiner et. al., IEEE Micro Volume: 37, Issue: 4, 2017 
Week 13 11/11-11/15Lecture TopicAssignment 
Mon   
WedsStephanie,Abe,NathanielAmam Arora et. al. CoMeFa: Compute-in-Memory Blocks for FPGAsProceedings of the IEEE 30th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) May 2022 
FriNick,Karis,Kyle,CaseT. Finkbeiner et. al. In-Memory Intelligence IEEE Micro Volume: 37, Issue: 4, 2017 
Week 14 11/18-11/22Lecture TopicAssignment 
Mon   
WedsMarvin,Jeff,DhruvMurali Emani et. al. Accelerating Scientific Applications With SambaNova Reconfigurable Dataflow Architecture Computing in Science & Engineering ( Volume: 23, Issue: 2, 01 March-April 2021) 
FriThomas,Matthew,StephenNorman Jouppi et. al. Ten Lessons From Three Generations Shaped Google’s TPUv4i : Industrial Product Proceedings of the 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) 
Week 15 11/25-11/29Lecture TopicAssignment 
Mon   
WedsThanksgiving Break!  
FriThanksgiving Break!  
Week 16 12/2-12/6Lecture TopicAssignment 
MonNicholas,Donna,OgdenMingyi Rao et. al. Thousands of conductance levels in memristors integrated on CMOS Nature, Vol 615, 30 March 2023 
WedsGrant,Gideon,HenryEmil Talpes et. al. Compute Solution for Tesla’s Full Self-Driving Computer IEEE Micro Volume 40, Issue 2, 01 March-April 2020 
FriReading Day  
Final 12/11**10:15pm - 12:15pm **Final is not comprehensive