Visitors Since 25-Aug-2014

Exam & Presentations
No Class: Monday December 1
I’ll be in my office (9/3469) to answer questions
or go over your presentation drafts.
Exam: Wednesday, December 3
1:00PM in 9/3400.
(CE Senior Projects Lab)
Presentations 1: Monday, December 8
1:00-2:50PM in 9/2139.
(normal lecture time/room)
Email talks by 2:00PM Friday December 5
Presentations 2: Wednesday, December 10
1:00-2:50PM in 9/2139.
(normal lecture time/room)
Email talks by 2:00PM Tuesday December 9
Presentations 3: Thursday, December 11
1:00-2:50PM in 9/2139.
(Reading Day)
Email talks by 2:00PM Wednesday December 10
Project papers due 2:00pm Monday December 15.
Email the paper and leave a hard copy under my office door (9/3469).
|


For the following lecture notes you can download or view a lecture as an Acrobat PDF file, or as a Microsoft Powerpoint file:
8-25-2014
|
Basics of Computer Design. Performance Measures.
Instruction Set Architecture (ISA) Characteristics and Classifications. CISC vs. RISC,
The MIPS64 ISA.
(Fourth Edition: Chapter 1, Appendix B, Third Edition: Chapters 1, 2)
|
|
|
8-27-2014
|
Pipelining Review: Definitions, Performance, Classic 5-stage Integer Pipeline, Pipeline Hazard Classification, Data Forwarding.
Compiler Pipeline Scheduling. Static Branch Prediction, Branch Delay Slot. MIPS R4000. Instruction Pipelining and Exception Handling.
Floating Point/Multicycle Pipelining
(Both Editions: Appendix A)
|
|
|
9-10-2014
|
Exploiting Instruction-Level Parallelism (ILP): Basic Instruction Block, Loop Unrolling. Further Classification of
Instruction Dependencies: Dependency Analysis and Graphs.
(Fourth Edition: Chapter 2.1, 2.2, Third Edition: Chapter 3.1, 4.1)
|
|
|
9-17-2014
|
Dynamic Hardware-Based Instruction Pipeline Scheduling: The Scoreboard, The Tomasulo Approach.
(Fourth Edition: Appendix A.7, Chapter 2.4, 2.5, Third Edition: Appendix A.8, Chapter 3.2, 3.3)
|
|
|
10-1-2014
|
Fundamental Dynamic Hardware-Based Branch Prediction Techniques: Branch-Target Buffer (BTB), Single-level,
Correlating Two-Level, Gshare, and Hybrid Dynamic Branch Predictors.
(Fourth Edition: Chapter 2.3, 2.9, Third Edition: Chapter 3.4, 3.5, 4.2)
|
|
|
10-6-2014
|
Multiple Instruction Issue, CPI <1 Approaches: Superscalar, VLIW. Hardware-Based Speculation: Speculative Tomasulo.
(Fourth Edition: Chapter 2.6-2.8, Third Edition: Chapter 3.6, 3.7, 4.3)
|
|
|
10-15-2014
|
Data Parallelism & Loop-Level Parallelism (LLP) Analysis. GCD Test. Software Pipelining.
FYI: Brief Introduction to Vector Processing.
Fourth Edition: Appendix G.1-3,
Third Edition: Chapter 4.4)
|
|
|
10-27-2014
|
Review of Memory Hierarchy & Cache Basics (from 350). 3Cs of Cache Misses, Cache Write Strategies & Performance.
Multi-Level Cache.
(Fourth Edition: Chapter 5.1, Appendix C.1-C.3 Third Edition: Chapter 5.1-5.4)
|
|
|
11-10-2014
|
Input/Output & System Performance Issues.
(Fourth Edition: Chapter 6.1, 6.2, 6.4, 6.5 Third Edition: Chapter 7.1-7.3, 7.7, 7.8)
|
|
|
11-17-2014
|
The Memory Hierarchy: Main Memory Issues. Performance Metrics: Latency & Bandwidth. DRAM System Memory Generations.
Basic Memory Bandwidth Improvement/Miss Penalty Reduction Techniques.
(Fourth Edition: Chapter 5.3 Third Edition: Chapter 5.8, 5.9)
|
|
|
11-24-2014
|
Exam Review.
|
|
|

1:00-2:15 PM Monday and Wednesday in (GLE) 9/2139
|

Quizzes/Homework: 40%
Exam: 35%
Special Topic paper and presentation: 25%
Quizzes:
Quizzes are announced one class in advance, and are given only during first 20-30 minutes of the specified class.
Quizzes are closed references (e.g. no books, notes, handouts, etc...). Calculators will be helpful.
There are no makeup quizzes and your lowest quiz grade is automatically excluded from the average calculation.
2-hour Exam:
A single all-inclusive 2-hour examination is given in week ~ 15.
Books, notes, quizzes and handouts are allowed in the exam. A calculator will be very helpful.
Special Topic paper and presentation:
Students will select one partner from the class to research a significant topic
in the field of Computer Architecture, write a report, and give a presentation. Each group’s topic must be presented
and approved by Dr. Shaaban. Duplicate topics are not permitted and proposals are accepted on a first come first serve basis.
The Paper: Each group will write a joint report (~ 6-8 pages) on their research using
the IEEE journal format/guidelines/template.
DO NOT CHANGE THE TEMPLATE!
Take great care in following the guidelines, especially properly citing illustrations, graphs and quoting from their respective sources.
The paper is due (hardcopy and electronic) at the beginning of the last day of the presentations.
Late submissions will be significantly penalized.
Plagiarism will result in a Zero
(see page 14 of the KGCOE 2014-2015 Undergraduate Student Handbook).
The Presentation: Each group will give a 20-minute PowerPoint presentation of their research to the entire class.
This is a joint presentation and the group must be thoroughly prepared to answer questions.
A signup sheet for a time slot will be available towards the end of the quarter.
Attendance is mandatory for all presentation sessions.
Missing your presentation slot and/or electronic submission time will result in a zero.
You must submit your Microsoft Power Point presentation electronically to Dr. Shaaban 24 hrs prior to your presentation time slot.
Samples of prior presentations are available on the course website.
|

he course covers various aspects of advanced
uniprocessor computer architecture design.
Instruction set architecture design alternatives are discussed.
Techniques to enhance performance such as pipelined execution optimizations, branch-handling, exploiting
instruction-level parallelism, multiple-instruction issue and dynamic scheduling are studied.
Cache, and memory hierarchy design and performance issues are also presented.
Finally, the design of efficient and reliable input/output systems are covered.
|
Computer Organization CMPE350.
|

Computer Architecture: A Quantitative Approach, Fourth Edition, John Hennessy, and David Patterson,
Morgan Kaufmann Publishers, 2006.
Reference Papers:
- Virtual Memory:
- Virtual memory: Issues of implementation,
PDF,
B. Jacob, and T. Mudge,
Computer, vol. 31, no. 6, pp. 33-43. June 1998.
- Virtual memory in contemporary microprocessors,
PDF,
B. Jacob, and T. Mudge,
Micro, vol. 18, no. 4, pp. 60-75. July/Aug. 1998.
- I/O Performance, RAID, Unix I/O Performance:
- Maximizing Performance in a Striped Disk Array,
PDF,
P. Chen and D.A. Patterson,
Proc. 17th Annual IEEE Symposium on Computer Architecture, 1990, pp. 322-331.
- Storage Performance--Metrics and Benchmarks,
PDF,
P. Chen and D. Patterson,
Proceedings of the IEEE 81(8):1151-1165, Aug., 1993.
- RAID: HighPerformance, Reliable Secondary Storage,
PDF,
P. M. Chen, E. K. Lee, G. A. Gibson, R. H. Katz and D. A. Patterson,
ACM Computing Surveys, Vol.26, No.2, June 1994, pp.145-185.
- Unix I O Performance in Workstations and Mainframes,
PDF,
Peter M. Chen, David A. Patterson,
Dept. of Electrical Engr. and Computer Science, University
of Michigan, Technical Report, CSE-TR-200-94, 1994.
- Striping in a RAID Level 5 Disk Array,
PDF,
P. Chen, P.M., AND E. Lee,
Proc. 1995 ACM SIGMETRICS Conference on Measurement and Modeling of
Computer Systems, pp.136---145, May 1995.
- Vector Processing, Vector IRAM:
- Vector Processors,
PDF,
Appendix G,
Computer Architecture: A Quantitative Approach, Third Edition.
-
Scalable Processors in the Billion Transistor Era: IRAM,
PDF,
Christoforos E. Kozyrakis et al.
IEEE Computer Special Issue: Future Microprocessors - How to use a Billion Transistors, September 1997.
|

Attending all lecture sessions is expected.
|

Week1: |
Fundamentals of Computer Architecture Design, Performance Measures Review.
Instruction Set Principles Review and Examples. CISC vs. RISC. A Typical RISC
Example: The MIPS64 Architecture. |
Week2: |
CPU Pipelining Concepts Review, Pipeline Hazards. Branch Handling: Static branch prediction
Static Compiler pipeline scheduling. Pipelining and Exception Handling. |
Week3: |
Advanced Pipelining, Exploiting Instruction-Level Parallelism (ILP).
Pipeline Scheduling, Loop Unrolling. Instruction Dependencies Analysis. |
Week4-5: |
Dynamic Instruction Scheduling: The Scoreboard Approach, The Tomasulo Approach. |
Week6: |
Dynamic Hardware-based Branch Prediction. |
Week7-8: |
Exploiting ILP Further: Multiple-Instruction Issue Approaches: Superscalar, VLIW.
Software and hardware Speculation. Compiler ILP support. |
Week9-10: |
Cache Design Issues, Memory-Hierarchy Design.
Advanced Techniques to Improve Cache Performance. |
Week11-12: |
Storage Systems, Bus Design, I/O Performance Measures and Benchmarks
Reliable Storage: Redundant Array of Inexpensive Disks (RAID).
I/O System Design Issues. |
Week13: |
Main Memory Issues and Performance. Virtual Memory Architecture. |
Week14: |
Introduction to parallel processing and multiprocessor system architecture. |
Week15-17: |
Exam/Project Presentations. |
|
 
This page is bytes long
and was last modified on:  
.
Made with at least 30% post-consumer recycled bits 
|