0% found this document useful (0 votes)

58 views43 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

The document provides an overview of the CIS775: Computer Architecture course. The key topics covered include: - Course objectives such as evaluating instruction set design, advanced pipelining techniques, and memory system design. - Defining computer architecture as the functional operation and information flow within a computer system. - Major topics that will be covered like instruction set architecture, pipelining, memory hierarchy, multiprocessors, and performance evaluation methods. - How computer systems have changed dramatically over the past decades due to advances in technology, computer architecture, and a drop in costs.

Uploaded by

padma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views43 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

padma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 43

CIS775: Computer Architecture

Chapter 1: Fundamentals of
Computer Design
1

Course Objectives
To evaluate the issues involved in choosing and
designing instruction set.
To learn concepts behind advanced pipelining
techniques.
To understand the hitting the memory wall
problem and the current state-of-art in memory
system design.
To understand the qualitative and quantitative
tradeoffs in the design of modern computer systems

What is Computer Architecture?

Functional operation of the individual HW
units within a computer system, and the
flow of information and control among
them.
Technology

Parallelism

Computer
Hardware Organization Architecture:
Measurement &
Evaluation

Programming
Language
Interface

Interface Design
(ISA)
Applications

OS
3

Computer Architecture Topics

Input/Output and Storage
Disks, WORM, Tape

Emerging Technologies
Interleaving Memories

DRAM

Memory
Hierarchy

VLSI

Coherence,
Bandwidth,
Latency

L2 Cache

L1 Cache
Instruction Set Architecture

RAID

Addressing,
Protection,
Exception Handling

Pipelining, Hazard Resolution,

Superscalar, Reordering,
Prediction, Speculation,
Vector, DSP

Pipelining and Instruction

Level Parallelism

Computer Architecture Topics

Interconnection Network

Processor-Memory-Switch

Multiprocessors
Networks and Interconnections

Shared Memory,
Message Passing,
Data Parallelism
Network Interfaces
Topologies,
Routing,
Bandwidth,
Latency,
Reliability

Measurement and Evaluation

Architecture is an iterative process:
Searching the space of possible designs
At all levels of computer systems

Design

Analysis

Creativity
Cost /
Performance
Analysis

Good Ideas

Bad Ideas

Mediocre Ideas
6

Issues for a Computer Designer

Functional Requirements Analysis (Target)
Scientific Computing HiPerf floating pt.
Business transactional support/decimal arith.
General Purpose balanced performance for a range of tasks

Level of software compatibility

PL level
Flexible, Need new compiler, portability an issue

Binary level (x86 architecture)

Little flexibility, Portability requirements minimal

OS requirements
Address space issues, memory management, protection

Conformance to Standards
Languages, OS, Networks, I/O, IEEE floating pt.

Computer Systems: Technology

Trends
1988

Supercomputers
Massively Parallel Processors
Mini-supercomputers
Minicomputers
Workstations
PCs

2002
Powerful PCs and
SMP Workstations
Network of SMP
Workstations
Mainframes
Supercomputers
Embedded Computers

Why Such Change in 10 years?

Performance
Technology Advances
CMOS (complementary metal oxide semiconductor) VLSI dominates older
technologies like TTL (transistor transistor logic) in cost AND performance

Computer architecture advances improves low-end

RISC, pipelining, superscalar, RAID,

Price: Lower costs due to

Simpler development
CMOS VLSI: smaller systems, fewer components

Higher volumes
Lower margins by class of computer, due to fewer services

Function :Rise of networking/local interconnection technology

Growth in Microprocessor
Performance

Six Generations of DRAMs

Updated Technology Trends

(Summary)
Capacity

Speed (latency)

Logic

4x in 4 years

2x in 3 years

DRAM

4x in 3 years

2x in 10 years

Disk

4x in 2 years

2x in 10 years

Network (bandwidth) 10x in 5 years

Updates during your study period??
BS (4 yrs)
MS (2 yrs)
PhD (5 yrs)

Integrated Circuits Costs

IC cost = Die cost + Testing cost + Packaging cost
Final test yield
Die cost =
Wafer cost
Dies per Wafer * Die yield
Dies per wafer = * ( Wafer_diam / 2)2 * Wafer_diam Test dies
Die Area
2 * Die Area

Die Yield = Wafer yield * 1 +

Defects_per_unit_area * Die_Area

Die Cost goes roughly with die area4

DAP.S98 1

Performance Trends
(Summary)
Workstation performance (measured in Spec
Marks) improves roughly 50% per year
(2X every 18 months)
Improvement in cost performance estimated
at 70% per year

Computer Engineering
Methodology
Implementation
Complexity

Evaluate Existing
Systems for
Bottlenecks
Benchmarks

Technology
Trends

Implement Next
Generation System

Simulate New
Designs and
Organizations

Workloads
16

How to Quantify Performance?

Plane

DC to Paris

Speed

Passengers

Throughput
(pmph)

Boeing 747

6.5 hours

610 mph

470

286,700

BAD/Sud
Concodre

3 hours

1350 mph

132

178,200

Time to run the task (ExTime)

Execution time, response time, latency

Tasks per day, hour, week, sec, ns (Performance)

Throughput, bandwidth

The Bottom Line:

Performance and Cost or Cost
and Performance?
"X is n times faster than Y" means
ExTime(Y)
--------ExTime(X)

Performance(X)
--------------Performance(Y)

Speed of Concorde vs. Boeing 747

Throughput of Boeing 747 vs. Concorde
Cost is also an important parameter in the
equation which is why concordes are being put
to pasture!
18

Measurement Tools
Benchmarks, Traces, Mixes
Hardware: Cost, delay, area, power estimation
Simulation (many levels)
ISA, RT, Gate, Circuit

Queuing Theory
Rules of Thumb
Fundamental Laws/Principles
Understanding the limitations of any
measurement tool is crucial.
19

Metrics of Performance
Application

Answers per month

Operations per second

Programming
Language
Compiler
(millions) of Instructions per second: MIPS
ISA
(millions) of (FP) operations per second:
MFLOP/s
Datapath
Megabytes per second
Control
Function Units
Cycles per second (clock rate)
Transistors Wires Pins

Cases of Benchmark Engineering

The motivation is to tune the system to the benchmark to achieve peak
performance.
At the architecture level
Specialized instructions

At the compiler level (compiler flags)

Blocking in Spec89 factor of 9 speedup
Incorrect compiler optimizations/reordering.
Would work fine on benchmark but not on other programs

I/O level
Spec92 spreadsheet program (sp)
Companies noticed that the produced output was always out put to a file (so they stored
the results in a memory buffer) and then expunged at the end (which was not measured).
One company eliminated the I/O all together.

After putting in a blazing performance on the benchmark test,

Sun issued a glowing press release claiming that it had
outperformed Windows NT systems on the test.
Pendragon president Ivan Phillips cried foul, saying the results
weren't representative of real-world Java performance and that
Sun had gone so far as to duplicate the test's code within Sun's
Just-In-Time compiler. That's cheating, says Phillips, who claims
that benchmark tests and real-world applications aren't
the same thing.
Did Sun issue a denial or a mea culpa? Initially, Sun neither
denied optimizing for the benchmark test nor apologized for
it. "If the test results are not representative of real-world Java
applications, then that's a problem with the benchmark,"
Sun's Brian Croll said.
After taking a beating in the press, though, Sun retreated and
issued an apology for the optimization.[Excerpted from PC Online221997]

Issues with Benchmark

Engineering
Motivated by the bottom dollar, good
performance on classic suites more
customers, better sales.
Benchmark Engineering Limits the
longevity of benchmark suites
Technology and Applications Limits the
longevity of benchmark suites.
23

SPEC: System Performance

Evaluation Cooperative
First Round 1989
10 programs yielding a single number (SPECmarks)

Second Round 1992

SPECInt92 (6 integer programs) and SPECfp92 (14 floating point
programs)
Compiler Flags unlimited. March 93
new set of programs: SPECint95 (8 integer programs) and SPECfp95 (10
floating point)

benchmarks useful for 3 years

Single flag setting for all programs: SPECint_base95, SPECfp_base95
SPEC CPU2000 (11 integer benchmarks CINT2000, and 14
floating-point benchmarks CFP2000

SPEC 2000 (CINT 2000)Results

SPEC 2000 (CFP 2000)Results

Reporting Performance Results

Reproducability
Apply them on publicly available
benchmarks. Pecking/Picking order

Real Programs
Real Kernels
Toy Benchmarks
Synthetic Benchmarks
27

How to Summarize
Performance
Arithmetic mean (weighted arithmetic mean) tracks

execution time: sum(Ti)/n or sum(Wi*Ti)

Harmonic mean (weighted harmonic mean) of rates (e.g.,
MFLOPS) tracks execution time:
n/sum(1/Ri) or 1/sum(Wi/Ri)
Normalized execution time is handy for scaling
performance (e.g., X times faster than SPARCstation 10)
But do not take the arithmetic mean of normalized
execution time,
use the geometric mean = (Product(Ri)^1/n)

Performance Evaluation
For better or worse, benchmarks shape a field
Good products created when have:
Good benchmarks
Good ways to summarize performance

Given sales is a function in part of performance relative to

competition, investment in improving product as reported by
performance summary
If benchmarks/summary inadequate, then choose between
improving product for real programs vs. improving product to get
more sales;
Sales almost always wins!
Execution time is the measure of computer performance!

Simulations
When are simulations useful?
What are its limitations, I.e. what real world
phenomenon does it not account for?
The larger the simulation trace, the less
tractable the post-processing analysis.
30

Queueing Theory
What are the distributions of arrival rates
and values for other parameters?
Are they realistic?
What happens when the parameters or
distributions are changed?
31

Quantitative Principles of Computer

Design
Make the Common Case Fast
Amdahls Law

CPU Performance Equation

Clock cycle time
CPI
Instruction Count

Principles of Locality
Take advantage of Parallelism
32

Amdahl's Law
Speedup due to enhancement E:
ExTime w/o E
Speedup(E) = ------------ExTime w/ E

Performance w/ E
----------------Performance w/o

Suppose that enhancement E accelerates a fraction F

of the task by a factor S, and the remainder of the
task is unaffected
33

Amdahls Law
ExTimenew = ExTimeold x (1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

Speedupoverall =

ExTimeold
ExTimenew

1
=

(1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

Amdahls Law
Floating point instructions improved to run 2X; but
only 10% of actual instructions are FP
ExTimenew =
Speedupoverall =

CPU Performance Equation

CPU
CPUtime
time

== Seconds
Seconds == Instructions
Instructions xx Cycles
Cycles xx Seconds
Seconds
Program
Program
Instruction
Cycle
Program
Program
Instruction
Cycle

Program

Inst Count CPI

Compiler

(X)

Inst. Set.

Organization
Technology

Clock Rate

X
X
36

Cycles Per Instruction

Average Cycles per Instruction
CPI = (CPU Time * Clock Rate) / Instruction Count
= Cycles / Instruction Count
n

CPU time = CycleTime *

i =1

CPIi

* iI

Instruction Frequency
n

CPI =

i =1

CPI
i

where iF

iI
Instruction Count

Invest Resources where time is Spent!

Example: Calculating CPI

Base Machine (Reg / Reg)
Op
Freq Cycles CPI(i)
ALU
50%
1
.5
Load
20%
2
.4
Store
10%
2
.2
Branch
20%
2
.4
1.5

(% Time)
(33%)
(27%)
(13%)
(27%)

Typical Mix

Chapter Summary, #1
Designing to Last through Trends
Capacity

Speed

Logic

2x in 3 years

DRAM

4x in 3 years

2x in 10 years

Disk

4x in 3 years

2x in 10 years

6yrs to graduate => 16X CPU speed, DRAM/Disk size

Time to run the task

Execution time, response time, latency

Tasks per day, hour, week, sec, ns,

Throughput, bandwidth

X is n times faster than Y means

ExTime(Y)
--------ExTime(X)

Performance(X)
-------------Performance(Y)

Chapter Summary, #2
Amdahls Law:
Speedupoverall =

CPI Law:
CPU
CPUtime
time

ExTimeold
ExTimenew

1
=

(1 - Fractionenhanced) + Fractionenhanced
Speedupenhanced

== Seconds
Seconds == Instructions
Instructions xx Cycles
Cycles xx Seconds
Seconds
Program
Program
Instruction
Cycle
Program
Program
Instruction
Cycle

Execution time is the REAL measure of computer

performance!
Good products created when have:

Good benchmarks, good ways to summarize performance

Die Cost goes roughly with die area4

Food for thought

Two companies reports results on two benchmarks
one on a Fortran benchmark suite and the other on
a C++ benchmark suite.
Company As product outperforms Company Bs
on the Fortran suite, the reverse holds true for the
C++ suite. Assume the performance differences are
similar in both cases.
Do you have enough information to compare the
two products. What information will you need?
41

Food for Thought II

In the CISC vs. RISC debate a key argument of the
RISC movement was that because of its simplicity,
RISC would always remain ahead.
If there were enough transistors to implement a CISC
on chip, then those same transistors could implement
a pipelined RISC
If there was enough to allow for a pipelined CISC
there would be enough to have an on-chip cache for
RISC. And so on.
After 20 years of this debate what do you think?
Hint: Think of commercial PCs, Moores law and
some of the data in the first chapter of the book (and
on these slides)
42

Amdahls Law (answer)

Floating point instructions improved to run 2X; but
only 10% of actual instructions are FP
ExTimenew = ExTimeold x (0.9 + .1/2) = 0.95 x ExTimeold
Speedupoverall =

1
0.95

1.053

Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
80% (5)
Computer Organization & Design The Hardware/Software Interface, 2nd Edition Patterson & Hennessy
118 pages
Logic Gates Programming in PLC
88% (8)
Logic Gates Programming in PLC
19 pages
DX200 Maintainence
No ratings yet
DX200 Maintainence
1,166 pages
ATtiny 85
100% (2)
ATtiny 85
201 pages
CoDeSys Version 3.5 First Steps Ver. 1
No ratings yet
CoDeSys Version 3.5 First Steps Ver. 1
74 pages
(G) - LEC Feedback and Oscillators
100% (2)
(G) - LEC Feedback and Oscillators
43 pages
Monitor - Mac - Apple 22 Inch LCD - TFT - M8149 - Parts and Service
No ratings yet
Monitor - Mac - Apple 22 Inch LCD - TFT - M8149 - Parts and Service
36 pages
ASM51
0% (1)
ASM51
199 pages
Alllpdf PDF
No ratings yet
Alllpdf PDF
253 pages
Chapter 03
No ratings yet
Chapter 03
78 pages
D C S 1 A: Jquery Program To Apply Borders To Text Area and Paragraphs N: C: R N
No ratings yet
D C S 1 A: Jquery Program To Apply Borders To Text Area and Paragraphs N: C: R N
34 pages
Bus
No ratings yet
Bus
82 pages
L-2 (Computer Performance)
No ratings yet
L-2 (Computer Performance)
52 pages
Top 135 Networking Interview Questions and Answers (2021)
No ratings yet
Top 135 Networking Interview Questions and Answers (2021)
33 pages
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
No ratings yet
Fundamentals of Computer Design Unit 1-Chapter 1: Reference
53 pages
Buses and I/O System: Computer Architecture and Assembly Language Fall 2003
No ratings yet
Buses and I/O System: Computer Architecture and Assembly Language Fall 2003
45 pages
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
No ratings yet
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
43 pages
Ci 7 CH 3 Network
No ratings yet
Ci 7 CH 3 Network
23 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
74 pages
Knowledge Creation and Knowledge Architecture
No ratings yet
Knowledge Creation and Knowledge Architecture
27 pages
Service Manuals LG TV LCD RT20LA33 RT-20LA33 Service Manual
No ratings yet
Service Manuals LG TV LCD RT20LA33 RT-20LA33 Service Manual
25 pages
Evaluation of Fruit Ripeness Using Electronic Nose: This Paper Describes The Use of An
No ratings yet
Evaluation of Fruit Ripeness Using Electronic Nose: This Paper Describes The Use of An
34 pages
HDM 944 H 70
No ratings yet
HDM 944 H 70
13 pages
ch1 PDF
No ratings yet
ch1 PDF
33 pages
Waveform and Receiver Filter Selection
No ratings yet
Waveform and Receiver Filter Selection
80 pages
Advanced Computer Architecture Fundamentals of Computer Design
No ratings yet
Advanced Computer Architecture Fundamentals of Computer Design
48 pages
Computer Architecture Introduction
No ratings yet
Computer Architecture Introduction
61 pages
Neutral Point Clamped and Cascaded H-Bridge Multilevel Inverter Topologies - A Comparison
No ratings yet
Neutral Point Clamped and Cascaded H-Bridge Multilevel Inverter Topologies - A Comparison
8 pages
Fundamentals of Quantitative Design and Analysis: A Quantitative Approach, Fifth Edition
No ratings yet
Fundamentals of Quantitative Design and Analysis: A Quantitative Approach, Fifth Edition
37 pages
Modle 01 - HPC Introduction To Pipeline
No ratings yet
Modle 01 - HPC Introduction To Pipeline
124 pages
Computer Architecture: Fundamentals Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Fundamentals Prof. Jerry Breecher CSCI 240 Fall 2003
36 pages
SIM800 Series IP Application Note V1.05-1
No ratings yet
SIM800 Series IP Application Note V1.05-1
24 pages
Sem-I PSP:: Ii Year
No ratings yet
Sem-I PSP:: Ii Year
13 pages
1 - Performance
No ratings yet
1 - Performance
38 pages
Computer Architecture: Fundamentals
No ratings yet
Computer Architecture: Fundamentals
36 pages
CSE130 Asgn2 Winter23 v7-1
No ratings yet
CSE130 Asgn2 Winter23 v7-1
10 pages
14 - Operations Scheduling
No ratings yet
14 - Operations Scheduling
11 pages
Advanced Computer Architecture: 563 L02.1 Fall 2011
No ratings yet
Advanced Computer Architecture: 563 L02.1 Fall 2011
57 pages
DS-M7508HNI Series: Main Features
No ratings yet
DS-M7508HNI Series: Main Features
3 pages
CG Lab Report Must Edit The Style
No ratings yet
CG Lab Report Must Edit The Style
50 pages
Ico22 - 1 - Computer Abstraction and Technology
No ratings yet
Ico22 - 1 - Computer Abstraction and Technology
42 pages
Computer Organization and Architecture (AT70.01)
No ratings yet
Computer Organization and Architecture (AT70.01)
29 pages
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
No ratings yet
Advanced Computer Architecture ECE 6373: Pauline Markenscoff N320 Engineering Building 1 E-Mail: Markenscoff@uh - Edu
151 pages
CAQA6e ch1
No ratings yet
CAQA6e ch1
31 pages
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
No ratings yet
Home Work 3: Class: M.C.A SECTION: RE3004 Course Code: CAP211
15 pages
VHD Wimboot
No ratings yet
VHD Wimboot
11 pages
Design Thinking
No ratings yet
Design Thinking
1 page
Nandha Educational Institutions, Erode: Morning Snacks
No ratings yet
Nandha Educational Institutions, Erode: Morning Snacks
4 pages
SPM Syllabus
No ratings yet
SPM Syllabus
1 page
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
No ratings yet
CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design
43 pages
ARM Computer Organization-Chapter01
No ratings yet
ARM Computer Organization-Chapter01
55 pages
4 Performance
No ratings yet
4 Performance
67 pages
Relevant Tools, Etc.
No ratings yet
Relevant Tools, Etc.
16 pages
(Autonomous) ,: User Interface Design
No ratings yet
(Autonomous) ,: User Interface Design
3 pages
Chapter 1 Measuring Understanding Performance
No ratings yet
Chapter 1 Measuring Understanding Performance
63 pages
Aula Ch1
No ratings yet
Aula Ch1
40 pages
CMSC 611: Advanced Computer Architecture
No ratings yet
CMSC 611: Advanced Computer Architecture
21 pages
Lecture 1 8405 Computer Architecture
No ratings yet
Lecture 1 8405 Computer Architecture
15 pages
MSC - Microprocessors: Dr. Konstantinos Tatas Com - Tk@Fit - Ac.Cy
No ratings yet
MSC - Microprocessors: Dr. Konstantinos Tatas Com - Tk@Fit - Ac.Cy
92 pages
Lect4 - IC Technology
No ratings yet
Lect4 - IC Technology
43 pages
Cs2405 Computer Graphics Laboratory L T P C 0 0 3 2
No ratings yet
Cs2405 Computer Graphics Laboratory L T P C 0 0 3 2
1 page
Retest III
No ratings yet
Retest III
1 page
Larman Chapter 6
No ratings yet
Larman Chapter 6
56 pages
CCM Setup
No ratings yet
CCM Setup
39 pages
BIT TORRENT Dsad.
No ratings yet
BIT TORRENT Dsad.
1 page
CMP2008 L1
No ratings yet
CMP2008 L1
47 pages
Lecture1 Cda3101
No ratings yet
Lecture1 Cda3101
44 pages
Com - Tapatap.loader Logcat
No ratings yet
Com - Tapatap.loader Logcat
507 pages
ACA Mod1
No ratings yet
ACA Mod1
118 pages
Unit 1
No ratings yet
Unit 1
68 pages
CSS224 Lec1
No ratings yet
CSS224 Lec1
30 pages
Signals and Systems - EC3354 - Hand Written Notes - Unit 1 - Classification of Signals and Systems
100% (1)
Signals and Systems - EC3354 - Hand Written Notes - Unit 1 - Classification of Signals and Systems
54 pages
FernFlex Controller Data Sheet
No ratings yet
FernFlex Controller Data Sheet
2 pages
Defining Computer Architecture
No ratings yet
Defining Computer Architecture
6 pages
ACA UNit 1
No ratings yet
ACA UNit 1
29 pages
PDF File Rsndom - Buscar Con Google
No ratings yet
PDF File Rsndom - Buscar Con Google
4 pages
PPT#01
No ratings yet
PPT#01
30 pages
Chapter Two
No ratings yet
Chapter Two
33 pages
Dell Tape Compatibility Matrix Compatibility-Matrix2
No ratings yet
Dell Tape Compatibility Matrix Compatibility-Matrix2
21 pages
Chapter - 01 - Computer Abstractions
No ratings yet
Chapter - 01 - Computer Abstractions
37 pages
Administrative Stuff : Instructor
No ratings yet
Administrative Stuff : Instructor
8 pages
Chapter4 Performance
No ratings yet
Chapter4 Performance
36 pages
CCS 1202 Lecture 2 - Computer Evolution and Performance
No ratings yet
CCS 1202 Lecture 2 - Computer Evolution and Performance
32 pages
CS5204/EE5364 - Advanced Computer Architecture - Performance
No ratings yet
CS5204/EE5364 - Advanced Computer Architecture - Performance
56 pages
ACSA1 Introduction
No ratings yet
ACSA1 Introduction
33 pages
Computer Performance
No ratings yet
Computer Performance
18 pages
Chapter 01 Modified
No ratings yet
Chapter 01 Modified
55 pages
System Complexity Estimation - Lecture - 9
No ratings yet
System Complexity Estimation - Lecture - 9
17 pages
Performance Issues
No ratings yet
Performance Issues
19 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
49 pages
Computer Architecture: Vnu - University Engineering Technology
No ratings yet
Computer Architecture: Vnu - University Engineering Technology
30 pages
CH02-COA10e Spring 2025
No ratings yet
CH02-COA10e Spring 2025
24 pages
Fortigate 1000f Series
No ratings yet
Fortigate 1000f Series
10 pages
CHAPTER 1 and 2
No ratings yet
CHAPTER 1 and 2
25 pages
M01B-L01 - The 5 SS, Condensed
No ratings yet
M01B-L01 - The 5 SS, Condensed
39 pages
FIT9134 Week11
No ratings yet
FIT9134 Week11
21 pages
ACA Question Bank
No ratings yet
ACA Question Bank
16 pages

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

CIS775: Computer Architecture: Chapter 1: Fundamentals of Computer Design

Uploaded by

CIS775: Computer Architecture

What is Computer Architecture?

Computer Architecture Topics

Pipelining, Hazard Resolution,

Pipelining and Instruction

Computer Architecture Topics

Measurement and Evaluation

Issues for a Computer Designer

Level of software compatibility

Binary level (x86 architecture)

Computer Systems: Technology

Why Such Change in 10 years?

Computer architecture advances improves low-end

Price: Lower costs due to

Function :Rise of networking/local interconnection technology

Six Generations of DRAMs

Updated Technology Trends

Network (bandwidth) 10x in 5 years

Integrated Circuits Costs

Die Yield = Wafer yield * 1 +

Die Cost goes roughly with die area4

How to Quantify Performance?

Time to run the task (ExTime)

Tasks per day, hour, week, sec, ns (Performance)

The Bottom Line:

Speed of Concorde vs. Boeing 747

Answers per month

Cases of Benchmark Engineering

At the compiler level (compiler flags)

After putting in a blazing performance on the benchmark test,

Issues with Benchmark

SPEC: System Performance

Second Round 1992

benchmarks useful for 3 years

SPEC 2000 (CINT 2000)Results

SPEC 2000 (CFP 2000)Results

Reporting Performance Results

execution time: sum(Ti)/n or sum(Wi*Ti)

Given sales is a function in part of performance relative to

Quantitative Principles of Computer

CPU Performance Equation

Suppose that enhancement E accelerates a fraction F

CPU Performance Equation

Inst Count CPI

Cycles Per Instruction

CPU time = CycleTime *

Invest Resources where time is Spent!

Example: Calculating CPI

6yrs to graduate => 16X CPU speed, DRAM/Disk size

Time to run the task

Tasks per day, hour, week, sec, ns,

X is n times faster than Y means

Execution time is the REAL measure of computer

Good benchmarks, good ways to summarize performance

Die Cost goes roughly with die area4

Food for thought

Food for Thought II

Amdahls Law (answer)

You might also like