Lec 8 Performance Enhancement-Computer Architecture

The document discusses various techniques for performance enhancement in computing, focusing on cache memory, pipelining, and instruction pre-fetching. It explains the types of cache memory, cache performance, and different mapping methods, as well as the concept of pipelining and its associated hazards. Additionally, it covers the advantages and disadvantages of pipelining and the role of instruction pre-fetching in improving execution speed.

Uploaded by

george wills

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views23 pages

Lec 8 Performance Enhancement-Computer Architecture

Uploaded by

george wills

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 23

PERFORMANCE

ENHANCEMENT

LECTURE 8
Performance enhancement
Can be achieved through
■ Cache memory
■ Pipelining
■ Instruction pre fetch
Cache Memory:
Cache memory is a storage device placed in between CPU and main
memory. These are semiconductor memories. These are basically fast
memory device, faster than main memory.
A cache hit occurs when the requested data can be found in a cache,
while a cache miss occurs when it cannot. Cache hits are served by
reading data from the cache, which is faster than re-computing a result
or reading from a slower data store; thus, the more requests can be
served from the cache, the faster the system performs
A cache's sole purpose is to reduce accesses to the underlying slower
storage. Cache is also usually an abstraction layer that is designed to be
invisible from the perspective of neighboring layers.
Types of Cache Memory

■ Primary Cache A primary cache is always located on the processor

chip. This cache is small and its access time is comparable to that of
processor registers.
■ Secondary Cache Secondary cache is placed between the primary
cache and the rest of the memory. It is referred to as the level 2 (L2)
cache. Often, the Level 2 cache is also housed on the processor chip
Cache Performance

When the processor needs to read or write a location in main memory, it

first checks for a corresponding entry in the cache.
If the processor finds that the memory location is in the cache, a cache
hit has occurred and data is read from cache If the processor does not
find the memory location in the cache, a cache miss has occurred.
For a cache miss, the cache allocates a new entry and copies in data
from main memory, then the request is fulfilled from the contents of the
cache.
Cache Mapping

There are three different types of mapping used for the purpose of cache
memory which are as follows: Direct mapping, Associative mapping, and
Set-Associative mapping
 Direct Mapping
The simplest technique, known as direct mapping, maps each block of
main memory into only one possible cache line. or In Direct mapping,
assign each memory block to a specific line in the cache.
If a line is previously taken up by a memory block when a new block
needs to be loaded, the old block is trashed
 Associative Mapping
In this type of mapping, the associative memory is used to store content and
addresses of the memory word. Any block can go into any line of the cache.
This means that the word id bits are used to identify which word in the block is
needed, but the tag becomes all of the remaining bits. This enables the placement
of any word at any place in the cache memory. It is considered to be the fastest
and the most flexible mapping form.
 Set-associative Mapping
This form of mapping is an enhanced form of direct mapping where the drawbacks
of direct mapping are removed. Set associative addresses the problem of possible
thrashing in the direct mapping method. It does this by saying that instead of
having exactly one line that a block can map to in the cache, we will group a few
lines together creating a set. Then a block in memory can map to any one of the
lines of a specific set
Pipelining
■ It is observed that organization enhancements to the CPU
can improve performance. We have already seen that use
of multiple registers rather than a single a accumulator,
and use of cache memory improves the performance
considerably. Another organizational approach, which is
quite common, is instruction pipelining.
PIPELINING
■ A pipeline is a set of data processing elements connected in series,
where the output of one element is the input of the next one.
■ It allows storing and executing instructions in an orderly process. It is
also known as pipeline processing.
■ Pipelining increases the overall instruction throughput.
■ In pipeline system, each segment consists of an input register
followed by a combinational circuit. The register is used to hold data
and combinational circuit performs operations on it. The output of
combinational circuit is applied to the input register of the next
segment.
■ Pipeline system is like the modern day assembly line setup in
factories. For example in a car manufacturing industry, huge
assembly lines are setup and at each point, there are robotic arms to
perform a certain task, and then the car moves on ahead to the next
arm
Pipelining cont’d
■ To apply the concept of instruction execution in pipeline, it is required
to break the instruction in different task. Each task will be executed in
different processing elements of the CPU.
■ As we know that there are two distinct phases of instruction
execution: one is instruction fetch and the other one is instruction
execution. Therefore, the processor executes a program by fetching
and executing instructions, one after another.
Let us refer to the fetch and execute steps for instruction . Execution of
a program consists of a sequence of fetch and execute steps is shown on
the next slide
Pipelining cont’d
As a simple approach, consider subdividing instruction processing into two
stages:
■ fetch instruction and
■ execute instruction.
There are times during the execution of an instruction when main memory is
not being accessed. This time could be used to fetch the next instruction in
parallel with the execution of the current one.
The pipeline has two independent stages.
The first stage fetches an instruction and buffers it. When the second stage is
free, the first stage passes it the buffered instruction. While the second stage
is executing the instruction, the first stage takes advantage of any unused
memory cycles to fetch and buffer the next instruction. This is called
instruction pre-fetch or fetch overlap.
Note that this approach, which involves instruction buffering, requires more
registers. In general, pipelining requires registers to store data between stages
Instruction decomposition
To gain further speedup, the pipeline must have more stages. Let
us consider the following decomposition of the instruction
processing
■ Fetch instruction (FI): Read the next expected instruction into a
buffer.
■ Decode instruction (DI): Determine the opcode and the operand
specifiers.
■ Calculate operands (CO): Calculate the effective address of each
source operand. This may involve displacement, register indirect,
indirect, or other forms of address calculation.
■ Fetch operands (FO): Fetch each operand from memory. Operands in
registers need not be fetched.
■ Execute instruction (EI):Perform the indicated operation and store the
result, if any, in the specified destination operand location.
■ Write operand (WO): Store the result in memory.
Pipelining cont’d
■ With this decomposition, the various stages will be of more nearly
equal duration. For the sake of illustration, let us assume equal
duration. Using this assumption, Figure 12.10 shows that a six-stage
pipeline can reduce the execution time for 9 instructions from 54 time
units to 14 time units.
■ Several comments are in order: The diagram assumes that each
instruction goes through all six stages of the pipeline. This will not
always be the case. For example, a load instruction does not need the
WO stage. However, to simplify the pipeline hardware, the timing is
set up assuming that each instruction requires all six stages. Also, the
diagram assumes that all of the stages can be performed in parallel.
In particular, it is assumed that there are no memory conflicts
Pipeline Hazards
A pipeline hazard occurs when the pipeline, or some portion of the
pipeline, must stall because conditions do not permit continued
execution. Such a pipeline stall is also referred to as a pipeline bubble.
There are three types of hazards: resource, data, and control.
■ RESOURCE HAZARDS/STRUCTURAL HAZARD.
A resource hazard occurs when two (or more) instructions that are
already in the pipeline need the same resource. The result is that the
instructions must be executed in serial rather than parallel for a portion
of the pipeline. A resource hazard is sometime referred to as a
structural hazard.
■ CONTROL HAZARDS/BRANCH HAZARD.
A control hazard, also known as a branch hazard, occurs when the
pipeline makes the wrong decision on a branch prediction and therefore
brings instructions into the pipeline that must subsequently be
discarded. We discuss approaches to dealing with control hazards next
Pipeline Hazards
■ DATA HAZARDS
A data hazard occurs when there is a conflict in the access of an operand
location. In general terms, we can state the hazard in this form: Two
instructions in a program are to be executed in sequence and both
access a particular memory or register operand. If the two instructions
are executed in strict sequence, no problem occurs. However, if the
instructions are executed in a pipeline, then it is possible for the operand
value to be updated in such a way as to produce a different result than
would occur with strict sequential execution. In other words, the program
produces an incorrect result because of the use of pipelining
Pipeline Conflicts
■ There are some factors that cause the pipeline to deviate its normal
performance. Some of these factors are given below:
1. Timing Variations
All stages cannot take same amount of time. This problem generally
occurs in instruction processing where different instructions have
different operand requirements and thus different processing time.

2. Data Hazards
When several instructions are in partial execution, and if they reference
same data then the problem arises. We must ensure that next instruction
does not attempt to access data before the current instruction, because
this will lead to incorrect results.
Pipeline Conflicts
3. Branching
In order to fetch and execute the next instruction, we must know what
that instruction is. If the present instruction is a conditional branch, and
its result will lead us to the next instruction, then the next instruction
may not be known until the current one is processed.
4. Interrupts
Interrupts set unwanted instruction into the instruction stream. Interrupts
effect the execution of instruction
5. Data Dependency
It arises when an instruction depends upon the result of a previous
instruction but this result is not yet available
Advantages of Pipelining
■ The cycle time of the processor is reduced.
■ It increases the throughput of the system
■ It makes the system reliable.

Disadvantages of Pipelining
■ The design of pipelined processor is complex and costly to
manufacture.
■ The instruction latency is more.
Instruction pre fetch
■ In computer architecture, instruction pre fetch is a technique used in
central processor units to speed up the execution of a program by
reducing wait states.
■ Pre fetching occurs when a processor requests an instruction or data
block from main memory before it is actually needed. Once the block
comes back from memory, it is placed in a cache. When the
instruction/data block is actually needed, it can be accessed much
more quickly from the cache than if it had to make a request from
memory.
■ Since programs are generally executed sequentially, performance is
likely to be best when instructions are pre fetched in program order.
■ Alternatively, the pre fetch may be part of a complex branch
prediction algorithm, where the processor tries to anticipate the result
of a calculation and fetch the right instructions in advance
Types of pre fetching.
Pre fetching can be classified in many ways;
■ Data or instruction pre fetching. As the name implies, the pre
fetching can be performed for either data blocks or instruction blocks.
Since data access patterns show less regularity than instruction
patterns, accurate data pre fetching is generally more challenging
than instruction pre fetching.

■ Hardware or software pre fetching. Pre fetching can be

performed in either hardware or software. Hardware pre fetchers may
use some storage to detect access patterns and based on it, pre fetch
instructions are issued. Software pre fetchers insert pre fetch
instructions in program source-code based on knowledge of program
control flow.
THE END

5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Lecture 5
No ratings yet
Lecture 5
50 pages
Ch#16 (CPU Structure and Function)
No ratings yet
Ch#16 (CPU Structure and Function)
48 pages
Module 4-Pipelining
No ratings yet
Module 4-Pipelining
39 pages
Pipe Lining
No ratings yet
Pipe Lining
35 pages
C.Arch Large
No ratings yet
C.Arch Large
57 pages
Module 5 Part2 Pipelining
No ratings yet
Module 5 Part2 Pipelining
36 pages
Chapter 4
No ratings yet
Chapter 4
78 pages
Moduel 5
No ratings yet
Moduel 5
46 pages
Pipelining and Parallel Processing
No ratings yet
Pipelining and Parallel Processing
26 pages
Module 5 - Pipelining
No ratings yet
Module 5 - Pipelining
61 pages
Pipelining
No ratings yet
Pipelining
26 pages
COA Lecture 10
No ratings yet
COA Lecture 10
22 pages
Pipelining - Computer Architecture and Organization
No ratings yet
Pipelining - Computer Architecture and Organization
40 pages
Pipelining Basic Concept
No ratings yet
Pipelining Basic Concept
23 pages
Unit 6
No ratings yet
Unit 6
20 pages
10 Pipelining
No ratings yet
10 Pipelining
44 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
Chapter6 - Pipelining
No ratings yet
Chapter6 - Pipelining
61 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
UNIT - 5 Pipeling Concept
No ratings yet
UNIT - 5 Pipeling Concept
15 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
Unit3 Pipelining
No ratings yet
Unit3 Pipelining
54 pages
Module 3
No ratings yet
Module 3
20 pages
Computer Architecture 1st Semester Spring Session Unit 3
No ratings yet
Computer Architecture 1st Semester Spring Session Unit 3
33 pages
Unit-V: Performance Enhancement Techinques
No ratings yet
Unit-V: Performance Enhancement Techinques
61 pages
Pipeline 1
No ratings yet
Pipeline 1
6 pages
4-Concept of Pipelining
No ratings yet
4-Concept of Pipelining
20 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
COA Unit - V Notes
No ratings yet
COA Unit - V Notes
21 pages
Module 3 Pipelining
No ratings yet
Module 3 Pipelining
7 pages
Coal Assignment
No ratings yet
Coal Assignment
10 pages
Advanced Computer Architecture (ACA) Assignment
No ratings yet
Advanced Computer Architecture (ACA) Assignment
16 pages
Chapter6 - Pipelining
No ratings yet
Chapter6 - Pipelining
61 pages
Unit 6
No ratings yet
Unit 6
11 pages
Lecture 7 - PIPELINING
No ratings yet
Lecture 7 - PIPELINING
16 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
Coa Iat-2 QB Soln
No ratings yet
Coa Iat-2 QB Soln
16 pages
ACA Unit 2,7th Sem CSE
No ratings yet
ACA Unit 2,7th Sem CSE
13 pages
Dpco Unit 4
No ratings yet
Dpco Unit 4
21 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipe Lining
No ratings yet
Pipe Lining
5 pages
Pipelining
No ratings yet
Pipelining
5 pages
Pipe Lining
No ratings yet
Pipe Lining
29 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Techopedia Explains: Amdahl's Law
No ratings yet
Techopedia Explains: Amdahl's Law
19 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
QN 3
No ratings yet
QN 3
1 page
CO Pipelining PDF Notes
No ratings yet
CO Pipelining PDF Notes
10 pages
Chapter 6 - Pipelining
0% (1)
Chapter 6 - Pipelining
61 pages
St. Cyril of Alexandria Term Paper For Patrology
100% (3)
St. Cyril of Alexandria Term Paper For Patrology
16 pages
Lecture 1
100% (1)
Lecture 1
10 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
s.4 Internal Mock Examinations Pract 2-2024
100% (1)
s.4 Internal Mock Examinations Pract 2-2024
5 pages
Service Manual: DNP-720AE
No ratings yet
Service Manual: DNP-720AE
58 pages
London City Hall: Architectural Analysis Course: Intelligent Building
100% (2)
London City Hall: Architectural Analysis Course: Intelligent Building
17 pages
Hanover Report 1978
100% (1)
Hanover Report 1978
10 pages
Data Umum SSH 2024
No ratings yet
Data Umum SSH 2024
376 pages
Agriculture Assist (Synopsis)
79% (14)
Agriculture Assist (Synopsis)
13 pages
POEM
No ratings yet
POEM
7 pages
Monsoon Theories
100% (1)
Monsoon Theories
14 pages
Lec 6 Memory Systems-Computer Architecture
No ratings yet
Lec 6 Memory Systems-Computer Architecture
40 pages
Bone Forming Tumors
No ratings yet
Bone Forming Tumors
81 pages
Internship Jntuh 160425 With Schedule
No ratings yet
Internship Jntuh 160425 With Schedule
3 pages
Thesis User Manual Sample
100% (3)
Thesis User Manual Sample
8 pages
Mkt350 Final Report The Art of Potano
No ratings yet
Mkt350 Final Report The Art of Potano
30 pages
Get General Organic and Biochemistry 4th Edition Katherine Denniston Free All Chapters
100% (7)
Get General Organic and Biochemistry 4th Edition Katherine Denniston Free All Chapters
82 pages
PIL - 3rd Sem LLB
No ratings yet
PIL - 3rd Sem LLB
68 pages
s3 Physics 1
No ratings yet
s3 Physics 1
5 pages
Teaching Behavioral Ethics by Robert A. Prentice
No ratings yet
Teaching Behavioral Ethics by Robert A. Prentice
41 pages
CHAPTER 7 - MATHEMATICS of FINANCE, Seventh Edition by Robert L. Brown, Steve Kopp and Petr Zima (Z-Lib - Org) - 261-289
No ratings yet
CHAPTER 7 - MATHEMATICS of FINANCE, Seventh Edition by Robert L. Brown, Steve Kopp and Petr Zima (Z-Lib - Org) - 261-289
29 pages
Easy Approach
No ratings yet
Easy Approach
67 pages
Lec 9 Taxonomy, RISC, CISC-computer Architecture
No ratings yet
Lec 9 Taxonomy, RISC, CISC-computer Architecture
31 pages
Resumen Productos Datalogic SENSORES
No ratings yet
Resumen Productos Datalogic SENSORES
219 pages
4 6001245270262683889
No ratings yet
4 6001245270262683889
22 pages
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
No ratings yet
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
172 pages
Chitoglucan New Overview
No ratings yet
Chitoglucan New Overview
6 pages
MFM Assignment 1 Draft
No ratings yet
MFM Assignment 1 Draft
9 pages
Nama Alat Dan Spesifikasi
No ratings yet
Nama Alat Dan Spesifikasi
128 pages
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
No ratings yet
Acceleration-Deceleration Behaviour of Various Vehicle Types PDF
29 pages
#01 G.R. No. 100113
No ratings yet
#01 G.R. No. 100113
19 pages
t7 2009 Dec Q
No ratings yet
t7 2009 Dec Q
8 pages
Automobile Road Test
No ratings yet
Automobile Road Test
2 pages
Sustainable Architecture Wiki
No ratings yet
Sustainable Architecture Wiki
9 pages
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
No ratings yet
How Human Behaviour Amplifies The Bullwhip Effect A Study Based On The Beer Distribution Game Online
12 pages
Guidanc CTspection
No ratings yet
Guidanc CTspection
17 pages
Acebrofilina+budesonida
No ratings yet
Acebrofilina+budesonida
3 pages
Mine CBX
No ratings yet
Mine CBX
10 pages
Coursework Computer Architecture
No ratings yet
Coursework Computer Architecture
2 pages
Onion - Wikipedia, The Free Encyclopedia1
No ratings yet
Onion - Wikipedia, The Free Encyclopedia1
7 pages
ST1BUK050824 p08
No ratings yet
ST1BUK050824 p08
1 page
Expected No. of Questions For Exams Lecturer TOPIC Details TCEM 1101 ENGINEERING MATHEMATICS I SEM-I YR I-2020/2021 Topics and Lecturers
No ratings yet
Expected No. of Questions For Exams Lecturer TOPIC Details TCEM 1101 ENGINEERING MATHEMATICS I SEM-I YR I-2020/2021 Topics and Lecturers
1 page
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
Computer Science II Essentials
From Everand
Computer Science II Essentials
Randall Raus
No ratings yet
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Lec 8 Performance Enhancement-Computer Architecture

Uploaded by

Lec 8 Performance Enhancement-Computer Architecture

Uploaded by

PERFORMANCE

■ Primary Cache A primary cache is always located on the processor

When the processor needs to read or write a location in main memory, it

■ Hardware or software pre fetching. Pre fetching can be

You might also like