0% found this document useful (0 votes)

90 views

Module 4 - Parallel & Pipeline Processing - Final

This document provides an overview of parallel and pipeline processing concepts. It discusses what parallel processing is, the purpose of parallel processing to increase computational speed, and Flynn's classification of computer architectures based on the number of instruction and data streams. It also covers pipeline processing, describing the stages of instruction execution in a pipeline and how instruction pipelining can improve throughput by allowing overlapping execution of multiple instructions. The key advantages of both parallel and pipeline processing are to improve computational speed and throughput.

Uploaded by

jbd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

90 views

Module 4 - Parallel & Pipeline Processing - Final

Uploaded by

jbd

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Module 4 : Parallel & Pipeline Processing

Mrs. Minakshi S. Ghorpade

Page 1
Introduction to parallel processing concepts
What is Parallel Processing ?
• Large class of techniques used to provide simultaneous data processing tasks for the purpose of increasing
the computational speed of a computer system.
• Instead of processing each instructions sequentially as in a conventional computer, a Parallel Processing
system is able to perform concurrent data processing to achieve faster execution time.
• The system may have two or more ALUs and be able to execute two or more instructions at the same time.
• The system may have two or more processors operating concurrently.

Page 2
Purpose of Parallel Processing :
To speed up the computer processing capability.
To increase its throughput

Disadvantage of Parallel Processing

The amount of Hardware increases, so the cost of the system increases.

Classification of Parallel Processing : Various ways of Classification

1. From the internal Organization of the Processors
2. From the interconnection structure between Processors
3. From the flow of information through the system

Page 3
Flynn's Classification of Computers

• M.J. Flynn proposed a classification for the organization of a computer system by the number of instructions and
data items that are manipulated simultaneously.

• The sequence of instructions read from memory constitutes an instruction stream.

• The operations performed on the data in the processor constitute a data stream.

• Parallel processing may occur in the instruction stream, in the data stream, or both.

• It is a way of organizing multiple processor system.

Flynn's classification divides computers into four major groups that are:
1.Single instruction stream, single data stream (SISD)
2.Single instruction stream, multiple data stream (SIMD)
3.Multiple instruction stream, single data stream (MISD)
4.Multiple instruction stream, multiple data stream (MIMD)

Page 4
Single instruction stream, single data stream (SISD)

Page 5
Single instruction stream, multiple data stream (SIMD)

Page 6
Multiple instruction stream, single data stream (MISD)

Page 7
Multiple instruction stream, multiple data stream (MIMD)

Page 8
Instruction Level Parallelism
• Instruction Level Parallelism (ILP) is used to refer to the architecture in which multiple operations
can be performed parallelly in a particular process, with its own set of resources – address space,
registers, identifiers, state, program counters.
• It refers to the compiler design techniques and processors designed to execute operations, like
memory load and store, integer addition, float multiplication, in parallel to improve the performance
of the processors.
• Examples of architectures that exploit ILP are VLIWs, Superscalar Architecture.
• A typical ILP allows multiple-cycle operations to be pipelined.
• ILP is a measure of how many of the operations in a computer program can be performed
simultaneously

Page 9
Classification
• The classification of ILP architectures can be done in the following ways –

• Sequential Architecture :
• Here, program is not expected to explicitly convey any information
regarding parallelism to hardware,
• Dependence Architectures :
• Here, program explicitly mentions information regarding dependencies
between operations like dataflow architecture.
• Independence Architecture :
• Here, program gives information regarding which operations are
independent of each other so that they can be executed in stead of the
‘nop’s.
Page 10
Pipeline processing

Page 11
Pipeline processing
• Pipelining is the process of arrangement of hardware elements of CPU such
that its overall performance is increased.
• Simultaneous execution of more than one instruction tasks place in
pipelined process.
• In pipelining multiple instructions are overlapped in execution.
• General structure of n segment pipeline

Page 12
Example

Page 13
Pipeline stages

• Pipelining organizes the execution of the multiple instructions simultaneously.

• Pipelining improves the throughput of the system. In pipelining the instruction is divided into the
subtasks.
• Each subtask performs the dedicated task.
• The instruction is divided into 5 subtasks: instruction fetch, instruction decode, operand
fetch, instruction execution and operand store.
• The instruction fetch subtask will only perform the instruction fetching operation, instruction decode
subtask will only be decoding the fetched instruction and so on the other subtasks will do.

Page 14
Pipeline stages- Three Stage Pipeline

Page 15
pipeline stages

An instruction in a process is divided into 5 subtasks likely,

• In the first subtask, the instruction is fetched.
• The fetched instruction is decoded in the second stage.
• In the third stage, the operands of the instruction are fetched.
• In the fourth, arithmetic and logical operation are performed on the operands to
execute the instruction.
• In the fifth stage, the result is stored in memory.

Page 16
• The first instruction gets completed in 5 clock cycle.
• After the completion of first instruction, in every new clock cycle, a new instruction completes its execution.
• Observe that when the Instruction fetch operation of the first instruction is completed in the next clock cycle
the instruction fetch of second instruction gets started.
• This way the hardware never sits idle it is always busy in performing some or other operation.
• But, no two instructions can execute their same stage at the same clock cycle.

Page 17
Advantages of Pipelining
• Advantages of Pipelining
• Instruction throughput increases.
• Increase in the number of pipeline stages increases the number of instructions executed simultaneously.
• Faster ALU can be designed when pipelining is used.
• Pipelining increases the overall performance of the CPU.

• Disadvantages of Pipelining
• Designing of the pipelined processor is complex.
• The throughput of a pipelined processor is difficult to predict.

Page 18
Instruction pipelining
• Pipeline processing can occur not only in the data stream but in the instruction
stream as well.
• Most of the digital computers with complex instructions require instruction
pipeline to carry out operations like fetch, decode and execute instructions.
• In general, the computer needs to process each instruction with the following
sequence of steps.
1. Fetch instruction from memory.
2. Decode the instruction.
3. Calculate the effective address.
4. Fetch the operands from memory.
5. Execute the instruction.
6. Store the result in the proper place.

Page 19
Instruction pipelining

• Each step is executed in a particular segment, and there are times

when different segments may take different times to operate on the
incoming information.
• The organization of an instruction pipeline will be more efficient if the
instruction cycle is divided into segments of equal duration.
• One of the most common examples of this type of organization is
a Four-segment instruction pipeline.

Page 20
Four-segment instruction pipeline

• Segment 1:
• The instruction fetch segment can be
implemented using first in, first out (FIFO)
buffer.
• Segment 2:
• The instruction fetched from memory is
decoded in the second segment, and
eventually, the effective address is
calculated in a separate arithmetic circuit.
• Segment 3:
• An operand from memory is fetched in the
third segment.
• Segment 4:
• The instructions are finally executed in the
last segment of the pipeline organization. Page 21
Types of Pipelining

• Arithmetic Pipelining
• Instruction Pipelining
• Processor Pipelining
• Unifunction Vs. Multifunction Pipelining
• Static vs Dynamic Pipelining
• Scalar vs Vector Pipelining

Page 22
Advantages of Pipelining
• The cycle time of the processor is decreased. It can improve the instruction
throughput. Pipelining doesn't lower the time it takes to do an instruction.
Rather than, it can raise the multiple instructions that can be processed
together ("at once") and lower the delay between completed instructions
(known as 'throughput').
• If pipelining is used, the CPU Arithmetic logic unit can be designed quicker,
but more complex.
• Pipelining increases execution over an un-pipelined core by an element of
the multiple stages (considering the clock frequency also increases by a
similar factor) and the code is optimal for pipeline execution.
• Pipelined CPUs frequently work at a higher clock frequency than the RAM
clock frequency, (as of 2008 technologies, RAMs operate at a low
frequency correlated to CPUs frequencies) increasing the computer’s global
implementation.
Page 23
Types of Pipelining

• Arithmetic Pipelining
• Instruction Pipelining
• Processor Pipelining
• Unifunction Vs. Multifunction Pipelining
• Static vs Dynamic Pipelining
• Scalar vs Vector Pipelining

Page 24
Pipeline Hazards

Pipeline Hazards
In the pipeline system, some situations prevent the next instruction from performing the planned task on a
particular clock cycle due to some problems.

“Pipeline hazards are the situations that prevent the next instruction from being executing during its designated
clock cycle." These hazards create a problem named as stall cycles.

Types of Pipeline Hazards

1.Structural Hazard/ Resource Conflict
2.Data Hazard/ data Dependency
3.Control Hazard / Branch Difficulty

Page 25
Structural Hazard/ Resource conflict

• This type of Hazard occurs when two different Inputs try to use the same resource simultaneously.
• These hazards are caused by access to memory by two instructions at the same time. These conflicts can
be slightly resolved by using separate instruction and data memories.

• Structural hazards occur when the processor's hardware is not capable of executing all the
instructions in the pipeline simultaneously.
• Structural hazards within a single pipeline are rare modern processors because the instruction set
architecture is designed to support pipelining.

Page 26
Structural Hazard/ Resource conflict continue

• During clock cycle, I1 is fetching operand (OF) and no other instruction can access memory during cycle 3 and
same with I2.
• Instruction 3 (I3) is delay by 2 cycle as it cannot fetch instruction as memory is being access by other
instruction.
• Thus resource dependency can detoriate overall performance of pipeline execution.
• Above problem can be solved by using separate instruction and data memory.

Page 27
Data Hazard/ data Dependency

• Instruction 1 cycle is processed where it is fetched, decoded, operand fetch, execution and write back of
instruction takes place.
• When instruction two is processed from i+ 1, then it is fetched, decoded but we cannot fetch the operand
because the value of R2 and R3 is stored in R1, and that updated value is used in the next instruction operand.
• So in i + 1, we cannot fetch the operand because the R1 value is not updated. Therefore, we have to delay the
second instruction's operation fetch till the write back instruction of the first instruction is completed, and this
situation is called a Hazard.
• The instruction R1 result is required as an input for the next instruction R1 value, it means value of R1 in second
instruction depends on the resulting value of instruction R1and this dependency is called as Data
Dependency, and because of this data dependency, two Stall cycles have been created by the pipeline in
executing the instructions.
Page 28
here are three situations in which a data hazard can occur:
1.read after write (RAW), a true dependency
2.write after read (WAR), an anti-dependency
3.write after write (WAW), an output dependency

Page 29
Branch hazards

• Branch instructions, particularly conditional branch instructions, create data dependencies between
the branch instruction and the previous instruction, fetch stage of the pipeline.
• Since the branch instruction computes the address of the next instruction that the instruction fetch
stage should fetch from, it consumes some time and also some time is required to flush the pipeline
and fetch instructions from target location.
• This time wasted is called as branch penalty.

Page 30
Example:

MOV RO,77H
MOV R1, 73H
ADD RO,R1
JC NEXT

NEXT: MOV R2,R1

Page 31

5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Pipelining and parallel processing
No ratings yet
Pipelining and parallel processing
26 pages
Pipeline
No ratings yet
Pipeline
33 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Computer Architecture Pipe Line
No ratings yet
Computer Architecture Pipe Line
28 pages
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
No ratings yet
Session - 29 and 30 Instruction Pipelining and Pipeline Hazards, Instruction Level Parallelism
25 pages
Unit 6 - Pipeline, Vector Processing and Multiprocessors
No ratings yet
Unit 6 - Pipeline, Vector Processing and Multiprocessors
23 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
100% (1)
Concept of Pipelining - Computer Architecture Tutorial What Is Pipelining?
5 pages
Campmc Unit Ii
No ratings yet
Campmc Unit Ii
61 pages
Pipelining basic concept
No ratings yet
Pipelining basic concept
23 pages
5 Pipeline
No ratings yet
5 Pipeline
63 pages
4 Instruction Pipeline
No ratings yet
4 Instruction Pipeline
13 pages
Module 4
No ratings yet
Module 4
12 pages
PIpeline Processing and Multi Processing
No ratings yet
PIpeline Processing and Multi Processing
16 pages
Computer Systems Architecture: Thorsten Altenkirch and Liyang Hu
No ratings yet
Computer Systems Architecture: Thorsten Altenkirch and Liyang Hu
20 pages
Pipeline & Parallel Processing
No ratings yet
Pipeline & Parallel Processing
19 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
1. Lecture 13 Pipelining
No ratings yet
1. Lecture 13 Pipelining
12 pages
Pipeline
No ratings yet
Pipeline
22 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
BNCS1209 Chapter 6
No ratings yet
BNCS1209 Chapter 6
25 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Comp Architecture Chapter 4 - Pipelining
No ratings yet
Comp Architecture Chapter 4 - Pipelining
53 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
36 pages
Unit 5
No ratings yet
Unit 5
51 pages
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
No ratings yet
Onur Digitaldesign - Comparch 2021 Lecture13 Pipelining Afterlecture
138 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
16 pages
COMPUTER ARCHITECHTURE
No ratings yet
COMPUTER ARCHITECHTURE
18 pages
Unit 4 - P 2
No ratings yet
Unit 4 - P 2
13 pages
Lecture-14 CH-04 2
No ratings yet
Lecture-14 CH-04 2
20 pages
Instruction Pipelining and SuperScalar Development - 2019
No ratings yet
Instruction Pipelining and SuperScalar Development - 2019
53 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
CSO Computer Programming
No ratings yet
CSO Computer Programming
73 pages
Lecture 06 - (New) Pipelining and Parallelism
No ratings yet
Lecture 06 - (New) Pipelining and Parallelism
37 pages
Module 3-Part 2 (1).pptx
No ratings yet
Module 3-Part 2 (1).pptx
50 pages
PCC-CS402
No ratings yet
PCC-CS402
7 pages
module 4-Pipelining
No ratings yet
module 4-Pipelining
39 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
49 pages
Pipelining
No ratings yet
Pipelining
43 pages
Helping Slides Pipelining Hazards Solutions
No ratings yet
Helping Slides Pipelining Hazards Solutions
55 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
Chapter 2 Lecture 4 and 5
No ratings yet
Chapter 2 Lecture 4 and 5
56 pages
Pipeline Processing
No ratings yet
Pipeline Processing
28 pages
Design of 3 Stage Pipelining Processor Using VHDL
No ratings yet
Design of 3 Stage Pipelining Processor Using VHDL
22 pages
5 Pipelining
No ratings yet
5 Pipelining
38 pages
moduel 5
No ratings yet
moduel 5
46 pages
COA CH 6
No ratings yet
COA CH 6
14 pages
3-Pipelining_241110_203716
No ratings yet
3-Pipelining_241110_203716
59 pages
Pipelining
No ratings yet
Pipelining
21 pages
Pipe Lining
No ratings yet
Pipe Lining
7 pages
Unit1 1.7 Instr Cycle
No ratings yet
Unit1 1.7 Instr Cycle
35 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Pipelining
No ratings yet
Pipelining
32 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
PipeLining in Microprocessors
No ratings yet
PipeLining in Microprocessors
19 pages
Contact Session 8
No ratings yet
Contact Session 8
63 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
..., Yim) - in A One-To-One Relationship, Each Value On (Qi1, Qi2, ..., Qin) Will
No ratings yet
..., Yim) - in A One-To-One Relationship, Each Value On (Qi1, Qi2, ..., Qin) Will
3 pages
Communication Protocols For KUF2000 Series Flow Meter
No ratings yet
Communication Protocols For KUF2000 Series Flow Meter
10 pages
Proposal For Automatic License and Number Plate Recognition System For Vehicle Identification
No ratings yet
Proposal For Automatic License and Number Plate Recognition System For Vehicle Identification
5 pages
Orchadmin
No ratings yet
Orchadmin
13 pages
Js
No ratings yet
Js
22 pages
81301 M.Sc-III Sem. INFO. TECHNOLOGY- DOT NET TECHNOLOGY
No ratings yet
81301 M.Sc-III Sem. INFO. TECHNOLOGY- DOT NET TECHNOLOGY
3 pages
Systolic-Based 2D Convolver For CNN in FPGA: October 2017
No ratings yet
Systolic-Based 2D Convolver For CNN in FPGA: October 2017
8 pages
Introduction - Crafting Your UX Portfolio - Let's Enchant
No ratings yet
Introduction - Crafting Your UX Portfolio - Let's Enchant
6 pages
Advantages and Disadvantages of Using Smartphones On The
No ratings yet
Advantages and Disadvantages of Using Smartphones On The
19 pages
Manual de Beamer
No ratings yet
Manual de Beamer
214 pages
Prepking HP0-920 Exam Questions
No ratings yet
Prepking HP0-920 Exam Questions
11 pages
Project Proposal
No ratings yet
Project Proposal
10 pages
CSCI250 - Exam 2 - V1-Solution
No ratings yet
CSCI250 - Exam 2 - V1-Solution
6 pages
Decomposing Numbers: Distributive Property
No ratings yet
Decomposing Numbers: Distributive Property
4 pages
Final Report
No ratings yet
Final Report
11 pages
Midterm Exam in Mil
No ratings yet
Midterm Exam in Mil
3 pages
Digital Transformation Guide
No ratings yet
Digital Transformation Guide
8 pages
Ass 2 in MMW
No ratings yet
Ass 2 in MMW
2 pages
Ethical Hacking: Deepak Kumar, Ankit Agarwal, Abhishek Bhardwaj
No ratings yet
Ethical Hacking: Deepak Kumar, Ankit Agarwal, Abhishek Bhardwaj
3 pages
Patel Urmitkumar Maheshbhai. M: 9898703637 E-Mail: Patelurmit@
No ratings yet
Patel Urmitkumar Maheshbhai. M: 9898703637 E-Mail: Patelurmit@
2 pages
Python Placement Ques
No ratings yet
Python Placement Ques
45 pages
HHMPI
No ratings yet
HHMPI
2 pages
Digital Costing
No ratings yet
Digital Costing
3 pages
Daa Notes
No ratings yet
Daa Notes
84 pages
ISTQB Dumps: 15 Sets of Model Questions With Answers
No ratings yet
ISTQB Dumps: 15 Sets of Model Questions With Answers
112 pages
Rdbms Notes
No ratings yet
Rdbms Notes
71 pages
ABAP For Students
50% (2)
ABAP For Students
84 pages
Synthetic Biology and The Creation of Human Organs
No ratings yet
Synthetic Biology and The Creation of Human Organs
1 page
Witness Test Data Program
No ratings yet
Witness Test Data Program
3 pages
The Ionic Framework: $ NPM Install - G Cordova
No ratings yet
The Ionic Framework: $ NPM Install - G Cordova
32 pages

Module 4 - Parallel & Pipeline Processing - Final

Uploaded by

Module 4 - Parallel & Pipeline Processing - Final

Uploaded by

Module 4 : Parallel & Pipeline Processing

Mrs. Minakshi S. Ghorpade

Disadvantage of Parallel Processing

Classification of Parallel Processing : Various ways of Classification

• The sequence of instructions read from memory constitutes an instruction stream.

• It is a way of organizing multiple processor system.

• Pipelining organizes the execution of the multiple instructions simultaneously.

An instruction in a process is divided into 5 subtasks likely,

• Each step is executed in a particular segment, and there are times

Types of Pipeline Hazards

NEXT: MOV R2,R1

You might also like