CS3350B Computer Architecture: Marc Moreno Maza

This document provides an introduction to the CS3350B Computer Architecture course taught by Marc Moreno Maza at the University of Western Ontario. The course covers computer hardware and software components, performance optimization techniques like parallelism and pipelining, memory hierarchies, instruction set architectures, logic circuits, and multicore architectures. Students will be evaluated based on four assignments worth 10% each and four in-class quizzes worth 5% each. Topics include the MIPS ISA, cache design, pipelining, and parallel processing fundamentals.

Uploaded by

AsHraf G. ElrawEi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

150 views45 pages

CS3350B Computer Architecture: Marc Moreno Maza

Uploaded by

AsHraf G. ElrawEi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 45

CS3350B Computer Architecture

Introduction

Marc Moreno Maza

http://www.csd.uwo.ca/~moreno/cs3350_moreno/index.html
Department of Computer Science
University of Western Ontario, Canada

Tuesday January 9, 2018

Konrad Zuse’s Z3 electro-mechanical computer (1941, Germany).
Turing complete, though conditional jumps were missing.
Colossus (UK, 1941) was the world’s first totally electronic
programmable computing device. But not Turing complete.
Harvard Mark I – IBM ASCC (1944, US). Electro-mechanical
computer (no conditional jumps and not Turing complete). It
could store 72 numbers, each 23 decimal digits long. It could do
three additions or subtractions in a second. A multiplication took
six seconds, a division took 15.3 seconds, and a logarithm or a
trigonometric function took over one minute. A loop was
accomplished by joining the end of the paper tape containing the
program back to the beginning of the tape (literally creating a
loop).
Electronic Numerical Integrator And Computer (ENIAC). The first
general-purpose, electronic computer. It was a Turing-complete,
digital computer capable of being reprogrammed and was running
at 5,000 cycles per second for operations on the 10-digit numbers.
The IBM Personal Computer, commonly known as the IBM PC
(Introduced on August 12, 1981).
The Pentium Family.
Core Core Core Core

L1 L1 L1 L1 L1 L1 L1 L1
inst data ins data ins data ins data

L2 L2

Main Memory
L1  Data Cache
Size Line Size Latency Associativty
32 KB
32 KB 64 bytes
64 bytes 3 cycles
3 cycles 8‐way
L1  Instruction Cache
Size Line Size Latency Associativty
32 KB 64 bytes 3 cycles 8‐way
L2 Cache
L2  Cache
Size Line Size Latency Associativty
6 MB 64 bytes 14 cycles 24‐way

Typical cache specifications of a multicore in 2008.

Once upon a time, every thing was slow in a computer . . .
Classes of Computers

▸ Personal computers
▸ General purpose, variety of software
▸ Subject to cost/performance trade-off
▸ Server computers
▸ Network based
▸ High capacity, performance, reliability
▸ Range from small servers to building sized
▸ Supercomputers
▸ High-end scientific and engineering calculations
▸ Highest capability but represent a small fraction of the overall
computer market
▸ Embedded computers
▸ Hidden as components of systems
▸ Stringent power/performance/cost constraints
Components of a computer

▸ Same components for all kinds of computer

▸ desktop, server, embedded
Below your program

▸ Application software
▸ Written in a high-level language
(HLL)
▸ System software
▸ Compiler: translates HLL code to
machine code
▸ Operating system: service code
▸ Handling input/output
▸ Managing memory and storage
▸ Scheduling tasks & sharing
resources
▸ Hardware
▸ Processor, memory, I/O
controllers
Levels of program code

▸ High-level language
▸ Level of abstraction closer
to problem domain
▸ Provides for productivity
and portability
▸ Assembly language
▸ Textual representation of
machine instructions
▸ Hardware representation
▸ Binary digits (bits)
▸ Encoded instructions and
data
Old-school machine structures (layers of abstraction)
New-school machine structures
Software Hardware
▸ Parallel Requests
Assigned to computer
e.g., Search “Katz”
▸ Parallel Threads
Assigned to core
e.g., Look-up, Ads
▸ Parallel Instructions
>1 instruction @ one
time
e.g., 5 pipelined
instructions
▸ Parallel Data
>1 data item @ one
time
e.g., Add of 4 pairs of
words
▸ Hardware descriptions
All gates working in
parallel at same time
Why do computers become so complicated?

Pursuing performance!
▸ Eight great ideas
▸ Use abstraction to simplify design
▸ Design for Moore’s Law
▸ Make the common case fast
▸ Performance via parallelism
▸ Performance via pipelining
▸ Performance via prediction
▸ Hierarchy of memories
▸ Dependability via redundancy
Great Idea #1: Abstraction

temp = v[k];
v[k] = v[k+1];
v[k+1] = temp;

lw $t0, 0($2)
lw $t1, 4($2)
sw $t1, 0($2)
sw $t0, 4($2)
# Anything can be
represented as a
# number, i.e., data or
instructions
0000 1001 1100 0110 1010 1111
0101 1000
1010 1111 0101 1000 0000 1001
1100 0110
Great idea #2: Moore’s Law
Great idea #4: Performance via parallelism
Great idea #5: Performance via pipelining
Great idea #7: Memory hierarchy (principle of locality)
Great Idea #8: Dependability via redundancy

▸ Redundancy so that a failing piece doesn’t make the whole

system fail

▸ Increasing transistor density reduces the cost of redundancy

Great Idea #8: Dependability via redundancy

Applies to everything from data centers to storage to memory to

instructors
▸ Redundant data centers so that can lose 1 datacenter but
Internet service stays online
▸ Redundant disks so that can lose 1 disk but not lose data
(Redundant Arrays of Independent Disks/RAID)
▸ Redundant memory bits of so that can lose 1 bit but no data
(Error Correcting Code/ECC Memory)
Understanding performance

▸ Algorithm
Determines number of operations executed
▸ Programming language, compiler, architecture
Determine number of machine instructions executed per
operation
▸ Processor and memory system
Determine how fast instructions are executed
▸ I/O system (including OS)
Determines how fast I/O operations are executed
What you will learn

▸ How programs are translated into the machine language, and

▸ how the hardware executes them
▸ The hardware/software interface
▸ What determines program performance, and
▸ how it can be improved
▸ How hardware designers improve performance
▸ What is parallel processing
Course Topics

1. Introduction
▸ Machine structures: layers of abstraction
▸ Eight great ideas
2. Performance Metrics I
▸ CPU performance
▸ perf, a profiling tool
3. Memory Hierarchy
▸ The principle of locality
▸ DRAM and cache
▸ Cache misses
▸ Performance metrics II: memory performance and profiling
▸ Cache design and cache mapping techniques
4. MIPS Instruction Set Architecture (ISA)
▸ MIPS number representation
▸ MIPS instruction format, addressing modes and procedures
▸ SPIM assembler and simulator
Course Topics (cont’d)
5. Introduction to Logic Circuit Design
▸ Switches and transistors
▸ State circuits
▸ Combinational logic circuits
▸ Combinational logic blocks
▸ MIPS single cycle and multiple cycle CPU data-path and
control
6. Instruction Level Parallelism
▸ Pipelining the MIPS ISA
▸ Pipelining hazards and solutions
▸ Multiple issue processors
▸ Loop unrolling, SSE
7. Multicore Architecture
▸ Multicore organization
▸ Memory consistency and cache coherence
▸ Thread level parallelism
8. GPU Architecture
▸ Memory model
▸ Execution model: scheduling and synchronization
Student evaluation

▸ Four assignments, each worth 10% of the final mark

▸ Assignment 1 (CPU performance and memory hierarchy), due
Friday, Feb. 2
▸ Assignment 2 (MIPS and logic circuits), due Friday, Feb. 16
▸ Assignment 3 (Circuits and data-path), due Friday, March 23,
▸ Assignment 4 (ILP and multicore), due Friday, April 6.
▸ Four quizzes (key concepts, 30-minute in class), each worth
5% of the final mark
▸ Quiz 1 (CPU/memory performance metrics and hierarchical
memory), Thursday, Jan. 25
▸ Quiz 2 (MIPS), Thursday, Feb. 1
▸ Quiz 3 (Circuits and data-path), Thursday, March 15
▸ Quiz 4 (ILP and multicore), Thursday, March 29
▸ One final exam (covering all the course contents), worth 40%
of the final mark
Recommended (but not required) textbook

Patterson & Hennessy (2011 or 2013), "Computer Organization

and Design: The Hardware/Software Interface“, revised 4th or 5th
edition. ISBN: 978-0-12-374750-1
Teaching crew

Instructor: Marc Moreno Maza

▸ Email: [email protected] (only for personal matters)
▸ For questions about the lectures or homework, use the OWL
forum
▸ Office room: MC327
▸ Office hours: Tuesdays 1:30pm - 3:15pm
Teaching Assistants: Alexander Brandt, Davood Mohajerani
and Mingda Sun
▸ Emails: [email protected], [email protected] and
[email protected]
▸ For questions about the lectures or homework, use the OWL
forum
▸ Office hours: TFA
Acknowledgements

The lecturing slides of this course are adapted from the slides
accompanied with the text book and the teaching materials posted
on the www by other instructors who are teaching Computer
Architecture courses.

Siemens Simos18.10 Irom tc1791 384 SSM Vag
No ratings yet
Siemens Simos18.10 Irom tc1791 384 SSM Vag
5 pages
Diagnoses and Identification of Faulty Computer and Network
No ratings yet
Diagnoses and Identification of Faulty Computer and Network
13 pages
Change The IMEI Number
100% (2)
Change The IMEI Number
3 pages
L1-intro
No ratings yet
L1-intro
23 pages
Unit I Overview & Instructions: Cs6303-Computer Architecture
100% (1)
Unit I Overview & Instructions: Cs6303-Computer Architecture
16 pages
Advanced Computer Architecture: CSE-401 E
No ratings yet
Advanced Computer Architecture: CSE-401 E
71 pages
Computer Organization Unit 1: Overview
No ratings yet
Computer Organization Unit 1: Overview
32 pages
chapter_1
No ratings yet
chapter_1
53 pages
Chapter 1 Edit PDF
No ratings yet
Chapter 1 Edit PDF
40 pages
Chapter 1 Edit
No ratings yet
Chapter 1 Edit
463 pages
12796
No ratings yet
12796
56 pages
Study Notes COAL Mids
No ratings yet
Study Notes COAL Mids
14 pages
L01 Introduction
No ratings yet
L01 Introduction
22 pages
Buy ebook (eBook PDF) Computer Organization and Design ARM Edition: The Hardware Software Interface cheap price
100% (1)
Buy ebook (eBook PDF) Computer Organization and Design ARM Edition: The Hardware Software Interface cheap price
50 pages
Unit-1 ACA
No ratings yet
Unit-1 ACA
86 pages
CH6 - Computer Abstractions and Technology
No ratings yet
CH6 - Computer Abstractions and Technology
69 pages
Advance Operating System-Computer Organization: Chap 1a: Overview
No ratings yet
Advance Operating System-Computer Organization: Chap 1a: Overview
71 pages
UG - B.Sc. - Computer Science - PG - B.Sc. - Computer Science - 130 53 - Computer Architecture - 2964
No ratings yet
UG - B.Sc. - Computer Science - PG - B.Sc. - Computer Science - 130 53 - Computer Architecture - 2964
198 pages
CH 1 - Introduction To Computer Architecture and Performance Measurement
No ratings yet
CH 1 - Introduction To Computer Architecture and Performance Measurement
42 pages
CPSC 321 Computer Architecture: Fall 2006
No ratings yet
CPSC 321 Computer Architecture: Fall 2006
36 pages
Introduction
No ratings yet
Introduction
5 pages
PDF
No ratings yet
PDF
41 pages
Cs6303comparchnotes PDF
No ratings yet
Cs6303comparchnotes PDF
250 pages
Chapter1 Computer Abstractions and Technology
No ratings yet
Chapter1 Computer Abstractions and Technology
52 pages
CA I - Chapter 1 Introduction
No ratings yet
CA I - Chapter 1 Introduction
39 pages
Aca
No ratings yet
Aca
71 pages
Module Part1
No ratings yet
Module Part1
21 pages
CSE 820 Graduate Computer Architecture: Dr. Enbody
No ratings yet
CSE 820 Graduate Computer Architecture: Dr. Enbody
25 pages
WINSEM2024-25_BCSE205L_TH_VL2024250501432_2024-12-13_Reference-Material-I
No ratings yet
WINSEM2024-25_BCSE205L_TH_VL2024250501432_2024-12-13_Reference-Material-I
23 pages
Computer Organization - MIPS Assembly Part 1
No ratings yet
Computer Organization - MIPS Assembly Part 1
6 pages
فایل 1
No ratings yet
فایل 1
23 pages
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
No ratings yet
Computer Architecture and Operating Systems (Caos) Course Code: CS31702 4-0-0
33 pages
Defining Computer Architecture
No ratings yet
Defining Computer Architecture
6 pages
Cs1304-Computer Architecture Department of Cse & It
No ratings yet
Cs1304-Computer Architecture Department of Cse & It
105 pages
1 Introduction
No ratings yet
1 Introduction
20 pages
Computer Abstractions and Technology
No ratings yet
Computer Abstractions and Technology
51 pages
CA Lec1
No ratings yet
CA Lec1
29 pages
ReviewedCSC303 CompiledNote 2023 24
No ratings yet
ReviewedCSC303 CompiledNote 2023 24
78 pages
Chapter 1
No ratings yet
Chapter 1
33 pages
Computer Organization 01
No ratings yet
Computer Organization 01
22 pages
ComputerOrganization - Architecture Regular HO
No ratings yet
ComputerOrganization - Architecture Regular HO
8 pages
Cse431 02
No ratings yet
Cse431 02
50 pages
Unit I-Basic Structure of A Computer: System
No ratings yet
Unit I-Basic Structure of A Computer: System
64 pages
BTech_Semester_III_Computer Architecture
No ratings yet
BTech_Semester_III_Computer Architecture
6 pages
PPT#01
No ratings yet
PPT#01
30 pages
CAO-Fall-2024-Lecture-01-Introduction-Motivation
No ratings yet
CAO-Fall-2024-Lecture-01-Introduction-Motivation
68 pages
EE360 Embedded Systems: Omputer Rganization and Esign
No ratings yet
EE360 Embedded Systems: Omputer Rganization and Esign
70 pages
Computer Architecture and Operating Systems (CS31702)
No ratings yet
Computer Architecture and Operating Systems (CS31702)
30 pages
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
No ratings yet
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
151 pages
Computer System Overview: 1 Spring 2015
No ratings yet
Computer System Overview: 1 Spring 2015
48 pages
Lecture 1 Digital Computer Systems
No ratings yet
Lecture 1 Digital Computer Systems
26 pages
Cse.m-ii-Advances in Computer Architecture (12scs23) - Notes
No ratings yet
Cse.m-ii-Advances in Computer Architecture (12scs23) - Notes
213 pages
Chapter 1
No ratings yet
Chapter 1
63 pages
Instructor: L. N. Bhuyan
No ratings yet
Instructor: L. N. Bhuyan
32 pages
RTSEC Documentation
No ratings yet
RTSEC Documentation
4 pages
Omputer Rganization and Esign: The Hardware/Software Interface
No ratings yet
Omputer Rganization and Esign: The Hardware/Software Interface
64 pages
Lecture1 2
No ratings yet
Lecture1 2
30 pages
Chap 1
No ratings yet
Chap 1
48 pages
Administrative Stuff : Instructor
No ratings yet
Administrative Stuff : Instructor
8 pages
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Understanding Software Engineering Vol 1: Where does the software run and how? The hardware.
From Everand
Understanding Software Engineering Vol 1: Where does the software run and how? The hardware.
Gabriel Clemente
No ratings yet
CS3350B Computer Architecture Memory Hierarchy: How?: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture Memory Hierarchy: How?: Marc Moreno Maza
33 pages
CS3350B Computer Architecture MIPS Introduction: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture MIPS Introduction: Marc Moreno Maza
24 pages
L7 Multicore 2
No ratings yet
L7 Multicore 2
22 pages
CS3350B Computer Architecture Memory Hierarchy: Why?: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture Memory Hierarchy: Why?: Marc Moreno Maza
30 pages
CS3350B Computer Architecture CPU Performance and Profiling: Marc Moreno Maza
No ratings yet
CS3350B Computer Architecture CPU Performance and Profiling: Marc Moreno Maza
28 pages
An Overview of General Purpose Graphics Processing Units: Marc Moreno Maza
No ratings yet
An Overview of General Purpose Graphics Processing Units: Marc Moreno Maza
18 pages
L7 Multicore 1
No ratings yet
L7 Multicore 1
50 pages
CS3350B Computer Architecture: Lecture 6.2: Instructional Level Parallelism: Hazards and Resolutions
No ratings yet
CS3350B Computer Architecture: Lecture 6.2: Instructional Level Parallelism: Hazards and Resolutions
31 pages
CS3350B Computer Architecture: Lecture 6.3: Instructional Level Parallelism: Advanced Techniques
No ratings yet
CS3350B Computer Architecture: Lecture 6.3: Instructional Level Parallelism: Advanced Techniques
24 pages
Section 9
No ratings yet
Section 9
60 pages
Nano PDF
No ratings yet
Nano PDF
1 page
AlgNotes PDF
No ratings yet
AlgNotes PDF
106 pages
مكتبة نور - مميز بالاصفر PDF
No ratings yet
مكتبة نور - مميز بالاصفر PDF
229 pages
AliHamza PDC Assignment 1
No ratings yet
AliHamza PDC Assignment 1
4 pages
Ecs h55h-m Rev1.0 PDF
No ratings yet
Ecs h55h-m Rev1.0 PDF
28 pages
The 8051 Microcontroller: Timer Operation
No ratings yet
The 8051 Microcontroller: Timer Operation
47 pages
EMCO Network Inventory Professional Manual
No ratings yet
EMCO Network Inventory Professional Manual
121 pages
Explain The Differences Between The Following: A) RISC and CISC Processors B) Harvard and Von-Neumann Architectures
No ratings yet
Explain The Differences Between The Following: A) RISC and CISC Processors B) Harvard and Von-Neumann Architectures
45 pages
Sharp Error Codes
No ratings yet
Sharp Error Codes
18 pages
2.9.2 Lab - Basic Switch and End Device Configuration
No ratings yet
2.9.2 Lab - Basic Switch and End Device Configuration
8 pages
(A) Cpu (B) Memory (C) Alu (D) Control Unit
No ratings yet
(A) Cpu (B) Memory (C) Alu (D) Control Unit
4 pages
Research Project Mayormente - Marvin
No ratings yet
Research Project Mayormente - Marvin
11 pages
Introduction To Computer CH 2
No ratings yet
Introduction To Computer CH 2
53 pages
Dont Starve Together
No ratings yet
Dont Starve Together
46 pages
Iplugmate Quick Guide
No ratings yet
Iplugmate Quick Guide
17 pages
ABIT 211 Notes
No ratings yet
ABIT 211 Notes
5 pages
Diskpart PDF
0% (1)
Diskpart PDF
2 pages
Gigabyte Technology Gigabyte Technology Gigabyte Technology: GA-G41MT-ES2L GA-G41MT-ES2L GA-G41MT-ES2L
No ratings yet
Gigabyte Technology Gigabyte Technology Gigabyte Technology: GA-G41MT-ES2L GA-G41MT-ES2L GA-G41MT-ES2L
33 pages
Embedded Peripherals IP User Guide
No ratings yet
Embedded Peripherals IP User Guide
461 pages
Service Manual: Led LCD Monitor
No ratings yet
Service Manual: Led LCD Monitor
49 pages
Spool Administration
No ratings yet
Spool Administration
5 pages
Lsi MR Sas SW Ug
No ratings yet
Lsi MR Sas SW Ug
329 pages
3RD Periodical Exam in Tle 7 Regular
No ratings yet
3RD Periodical Exam in Tle 7 Regular
2 pages
Switching Networks: Single Stage Network
No ratings yet
Switching Networks: Single Stage Network
67 pages
MS C910 Datasheet
No ratings yet
MS C910 Datasheet
2 pages
LabBGN Hardware Interface 6486 Data Sheet
No ratings yet
LabBGN Hardware Interface 6486 Data Sheet
4 pages
MB Manual Ga-Z87 (h87) - Hd3 e
No ratings yet
MB Manual Ga-Z87 (h87) - Hd3 e
100 pages
MPS - Ch11 - AVR - Serial Port Programming in Assembly and C
No ratings yet
MPS - Ch11 - AVR - Serial Port Programming in Assembly and C
81 pages
Mangonel Lecture 1
No ratings yet
Mangonel Lecture 1
37 pages
LPC2138 Education Board User's Guide
No ratings yet
LPC2138 Education Board User's Guide
36 pages