ECE 721 Spring 2020 Advanced Microarchitectures
Schedule
date | topic | notes | quiz |
---|---|---|---|
Tues. Jan. 7 | Overview of class topics | ppt, pptx | |
TOPIC 1: Modern Superscalar Processors | |||
Thurs. Jan. 9 | Physical Register File management: phys. RF, RMT, freelist | pptx, pptx | |
Tues. Jan. 14 | Physical Register File management: committing and freeing registers, exception recovery, branch misprediction recovery | pptx | D. Wall. Limits of Instruction Level Parallelism. ASPLOS IV, April 1991. ACM Digital Library and IEEE Xplore paper links will work as-is if you are working from a computer on the NCSU network. If not accessing from an NCSU network, log in to lib.ncsu.edu and then use this proxy link. I obtained this link by logging in to lib.ncsu.edu and searching journals for ACM Digital Library (for IEEE papers use IEEE Xplore), and then from the ACM DL I searched on the paper title. |
Thurs. Jan. 16 | overflow lecture | ||
Tues. Jan. 21 | overflow lecture | ||
Thurs. Jan. 23 | Dynamic Scheduling Algorithm: Phys. RF ready bits, Issue Queue Sizing structures | pptx pptx | K. C. Yeager. The Mips R10000 superscalar microprocessor. IEEE Micro, 16(2):28-41, April 1996. |
Tues. Jan. 28 | Handling loads and stores: terminology (memory disambiguation and store-load forwarding), LQ/SQ operation, load speculation and memory dependence predictors | pptx | |
Thurs. Jan. 30 | overflow lecture | ||
Tues. Feb. 4 | overflow lecture | G. Chrysos and J. S. Emer. Memory Dependence Prediction Using Store Sets. ISCA-25, 1998. | |
Thurs. Feb. 6 | Canonical Superscalar Pipeline Pipeline stages: fetch, decode, rename, dispatch | pptx pptx | |
Tues. Feb. 11 | Pipeline stages: schedule, register read, execute, writeback, retire | pptx | |
Thurs. Feb. 13 | overflow lecture | ||
TOPIC 2: High-ILP Processors | |||
Tues. Feb. 18 | Trace cache | pptx | E. Rotenberg, S. Bennett, and J. E. Smith. Trace Cache: a Low Latency Approach to High Bandwidth Instruction Fetching. MICRO-29, Dec. 1996. |
Thurs. Feb. 20 | overflow lecture | ||
Tues. Feb. 25 | Value prediction | pptx | Y. Sazeides and J. E. Smith. The Predictability of Data Values. MICRO-30, Dec. 1997. |
Thurs. Feb. 27 | Midterm Exam | ||
Tues. Mar. 3 | Predication | pptx (aux. pptx) | A. Klauser, T. Austin, D. Grunwald, and B. Calder. Dynamic Hammock Predication for Non-predicated Instruction Set Architectures. PACT, Oct. 1998. |
Thurs. Mar. 5 | overflow lecture | ||
Tues. Mar. 10 | Spring Break | ||
Thurs. Mar. 12 | Spring Break | ||
Tues. Mar. 17 | NCSU-extended spring break Advice on research projects: Project guidelines Report formatPresentation guidelines | zoom recording | |
Thurs. Mar. 19 | NCSU-extended spring break | NO class this day | |
Tues. Mar. 24 | Trace processors | pre-recorded lecture pptx take the quiz zoom (discuss quiz, and Q&A on pre-recorded lecture) zoom recording | E. Rotenberg, Q. Jacobson, Y. Sazeides, and J. E. Smith. Trace processors. MICRO-30, Dec. 1997. |
Thurs. Mar. 26 | Simultaneous multithreading (SMT) | pre-recorded lecture: Part 1 Part 2 pptx take the quiz zoom (discuss quiz, and Q&A on pre-recorded lecture) zoom recording | D. M. Tullsen et al. Exploiting Choice: Instruction Fetch and Issue on an Implementable Simultaneous Multithreading Processor. ISCA-23, May 1996. |
TOPIC 3: Large-WindowProcessors | |||
Tues. Mar. 31 | Checkpoint Processing and Recovery (CPR): fine-grain (ROB) vs. coarse-grain (Checkpoint) recovery, aggressive register reclamation operation, example simulation | pre-recorded lecture: Part 1 Part 2 pptx pptx take the quiz zoom (discuss quiz, and Q&A on pre-recorded lecture) zoom recording | H. Akkary, R. Rajwar, and S. Srinivasan. Checkpoint Processing and Recovery: Towards Scalable Large Instruction Window Processors. MICRO-36, 2003. |
Thurs. Apr. 2 | Continual Flow Pipelines (CFP) | zoom recording pptx pptx (enhanced ROB+CFP animation) Project presentation guidelines | NOW OPTIONAL READING: S. Srinivasan, R. Rajwar, H. Akkary, A. Gandhi, and M. Upton. Continual Flow Pipelines. ASPLOS’04, 2004. |
Tues. Apr. 7 | Runahead Execution | zoom recording (see prev. pptx) | NOW OPTIONAL READING: O. Mutlu, J. Stark, C. Wilkerson, and Y. Patt. Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-Order Processors. HPCA-9, 2003. |
Thurs. Apr. 9 | project presentations | zoom | |
Tues. Apr. 14 | project presentations | zoom | |
Thurs. Apr. 16 | project presentations | zoom | |
Tues. Apr. 21 | project presentations | zoom | |
Thurs. Apr. 23 | project presentations | zoom |