0% found this document useful (0 votes)

104 views33 pages

Unit-4 Pipelinie and Vector Processing

This document discusses parallel processing and pipelining. It describes how parallel processing uses multiple functional units or processors to perform tasks simultaneously to increase computational speed. Pipelining breaks down processes into sequential sub-operations that can be performed concurrently by different pipeline stages. The document provides examples of arithmetic and floating-point pipelines and discusses how pipelining can theoretically achieve a maximum speedup equal to the number of pipeline stages.

Uploaded by

Mandeep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

104 views33 pages

Unit-4 Pipelinie and Vector Processing

Uploaded by

Mandeep

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Unit-4

Pipelinie and Vector Processing

Contents
● Parallel Processing
● Pipelining
● Arithmetic Pipeline,
● Instruction Pipeline
● RISC Pipeline
● Vector Processing
● Array Processors
Parallel Processing
Parallel Processing
● used to provide Simultaneous data processing tasks for the purpose of
increasing the computational speed of a computer system.
● Perform concurrent data processing to achieve faster execution time.
● For example:
» while an instruction is being executed in the ALU, the next instruction
can be read from memory.
» The system may have two or more ALUs and be able to execute two
or more instructions at the same time .
» The system may have two or more processors operating concurrently.
● The purpose of parallel processing is to speed up the computer
procesing capability and increase its throughput.
● Throughput: the amount of processing that can be accomplished
during a given interval of time.
Parallel processing is established by distributing the data among the
multiple functional units.
Processor with Multiple Functional Unit : Fig. 9.1
Processor with Multiple Functional Unit:
● Separates the execution unit into eight functional units operating in
parallel.
● The adder and integer multiplier perform the arithmetic operations
with integer numbers.
● The floating-point operations are separated into three circuits
operating in parallel.
● The logic, shift, and increment operations can be performed
concurrently on different data.
● All units are independent of each other, so one number can be shifted
while another number is being incremented.
● A multifunctional organization is usually associated with a complex
control unit to coordinate all the activities among the various
components.
Parallel processing can be classified as:
● The internal organization of the processors
● The interconnection structure between processors
● The flow of information through the system
● The number of instructions and data items that are manipulated simultaneously
introduced by M. J. Flynn.

Parallel processing may occur in the instruction stream, the data stream, or both.
Instruction stream: The sequence of instructions read from memory constitutes
an instruction stream.
Data stream: The operations performed on the data in the processor constitutes a
data stream.
Flynn's classification divides computers into four major groups as follows:
● Single instruction stream, single data stream – SISD
● Single instruction stream, multiple data stream – SIMD
● Multiple instruction stream, single data stream – MISD
● Multiple instruction stream, multiple data stream – MIMD
SISD –
● Represents the organization of a single computer containing a control
unit, a processor unit, and a memory unit.
● Instructions are executed sequentially.
● Parallel processing may be achieved by means of multiple functional
units or by pipeline processing

SIMD – Includes multiple processing units with a single control unit. All
processors receive the same instruction from the control unit, but operate
on different data.

MISD – structure is only of theoretical interest since no practical system

has been constructed using this organization.

MIMD – A computer system capable of processing several programs at

the same time.
Applications of Parallel Processing
● Numeric weather prediction
● Finite element analysis
● Artificial Intelligence and Automation
● Genetic Engineering
● Weapon Research and Defense
● Medical Applications

One type of parallel processing that does not fit Flynn's classification is
pipelining.
Pipelining
● Pipelining is a technique of decomposing a sequential process into
suboperations, with each subprocess being executed in a special
dedicated segment that operates concurrently with all other segments
● Each segment performs partial processing dictated by the way the
task is partitioned
● The result obtained from the computation in each segment is
transferred to the next segment in the pipeline
● The final result is obtained after the data have passed through all
segments
● Can imagine that each segment consists of an input register followed
by an combinational circuit
● A clock is applied to all registers after enough time has elapsed to
perform all segment activity
● The information flows through the pipeline one step at a time.
Although each car still takes three hours to finish using pipelining,
we can now produce one car each hour rather than one every
three hours
Example: Ai * Bi + Ci for i = 1, 2, 3, ..., 7
Each suboperation is to be implemented in a segment within a pipeline.
Each segment has one or two registers and a combinational circuit as
shown in Fig.9-2.

R1 through R5 are registers

that receive new data with
every clock pulse.
The multiplier and adder are
combinational circuits.
The suboperations performed in each segment within a pipeline are:
R1←Ai , R2←Bi (Input Ai & Bi)
R3←R1 * R2, R4←Ci (Multiply & input Ci)
R5 ← R3 + R4 (Add Ci to product)
The five registers are loaded with new data every clock pulse. The effect
of each clock is shown in Table 9-1 .

It takes three clock

pulses to fil up the
pipe and retrieve the
first output from R5.

From there on, each

clock produces a
new output and
moves the data one
step down the
pipeline.
● Any operation that can be decomposed into a sequence of suboperations of about
the same complexity can be implemented by a pipeline processor
● The technique is efficient for those applications that need to repeat the same task
many time with different sets of data

The general structure of a four-segment pipeline is illustrated in Fig. 9-3.

● The operands pass through all four segments in a fixed sequence. Each segment
consists of a combinational circuit Si that performs a suboperation over the data
stream flowing through the pipe.
● The segments are separated by registers Ri that hold the intermediate results
between the stages.
● Information flows between adjacent stages under the control of a common clock
applied to all the registers simultaneously.
● A task is the total operation performed going through all segments of a pipeline
The behavior of a pipeline can be illustrated with a space-time diagram:
● This shows the segment utilization as a function of time
● The horizontal axis displays the time in clock cycles and the vertical axis gives the
segment number.
● The diagram shows six tasks T1 through T6 executed in four segments.
● Initially, task T1 is handled by segment 1. After the first clock, segment 2 is busy
with T1, while segment 1 is busy with task T2. Continuing in this manner, the first
task T1 is completed after the fourth clock cycle.
● From then on, the pipe completes a task every clock cycle.
● No matter how many segments there are in the system, Once the pipeline is full, it
takes only one clock period to obtain an output
● Consider a k-segment pipeline with a clock cycle time tp to execute n
tasks
● The first task T1 requires time k tp to complete
● The remaining n-1 tasks finish at the rate of one task per clock cycle and
will be completed after time (n-1)tp
● The total time to complete the n tasks is [k+n-1]tp
● The example of Figure 9-4 requires [4+6-1] clock cycles to finish

● Consider a nonpipeline unit that performs the same operation and

takes tn time to complete each task
● The total time to complete n tasks would be n tn

● The speedup of a pipeline processing over an equivalent nonpipeline

processing is defined by the ratio
S= n tn
(k + n – 1)t p
● As the number of tasks increase, the speedup becomes
S= tn
tp
● If we assume that the time to process a task is the same in both
circuits, tn = k tp
S = k tn =k
tp
● Therefore, the theoretical maximum speedup that a pipeline can
provide is k (no. Of segments)
Example:
● Cycle time = tp = 20 ns
● # of segments = k = 4
● # of tasks = n = 100
The pipeline system will take (k + n – 1)tp = (4 + 100 –1)20ns = 2060 ns
Assuming that tn = k tp = 4 * 20 = 80 ns,
A nonpipeline system requires n k tp = 100 * 80 = 8000 ns
The speedup ratio = 8000/2060 = 3.88
As the number of tasks increases, the speedup will approach 4, which is
equal to the number of segments in the pipeline.
If we assume that t, = 60 ns, the speedup becomes 60/20 = 3 .
The pipeline cannot operate at its maximum theoretical rate
● One reason is that, different segments may take different times to
complete their suboperation, the clock cycle must be chosen to equal
the time delay of the segment with the maximum propagation time.
This causes all other segments to waste time while waiting for the
next clock.
● Moreover, a nonpipe circuit will not always have the same time delay
as that of an equivalent pipeline circuit, as many of the intermediate
registers will not be needed in a single-unit circuit
● Nevertheless, the pipeline technique provides a faster operation over
a purely serial sequence even though the maximum theoretical speed
is never fully achieved.
Arithmetic Pipeline
● Pipeline arithmetic units are usually found in very high speed computers.
● They are used to implement floating-point operations, multiplication of fixed-
point numbers, and similar computations encountered in scientific problems.

Example for floating-point addition and subtraction

● Inputs are two normalized floating-point binary numbers
X = A x 2^a
Y = B x 2^b
● A and B are two fractions that represent the mantissas
● a and b are the exponents
Four segments are used to perform the floating-point addition and subtraction:
● Compare the exponents
● Align the mantissas
● Add or subtract the mantissas
● Normalize the result
● Consider the two normalized floating-point numbers:
X = 0.9504 x 10^3
Y = 0.8200 x 10^2
● The two exponents are subtracted in the first segment to obtain 3-2=1
● The larger exponent 3 is chosen as the exponent of the result.
● Segment 2 shifts the mantissa of Y to the right to obtain
X = 0.9504 x 10^3
Y = 0.0820 x 10^3
● The mantissas are now aligned.
● Segment 3 produces the sum Z = 1.0324 x 10^3
● Segment 4 normalizes the result so that it has a fraction with a
nonzero first digit, by shifting the mantissa once to the right and
incrementing the exponent by one to obtain
Z = 0.10324 x 10^4
Instruction Pipeline
● An instruction pipeline reads consecutive instructions from memory
while previous instructions are being executed in other segments.
● This causes the instruction fetch and execute phases to overlap and perform
simultaneous operations.
● If a branch out of sequence occurs, the pipeline must be emptied and all the
instructions that have been read from memory after the branch instruction must
be discarded.
● Consider a computer with an instruction fetch unit and an instruction execution
unit forming a two segment pipeline.
● A FIFO buffer can be used for the fetch segment.
● Thus, an instruction stream can be placed in a queue, waiting for decoding and
processing by the execution segment.
● This reduces the average access time to memory for reading instructions.
● Whenever there is space in the buffer, the control unit initiates the next
instruction fetch phase.
Instruction Pipeline
The following steps are needed to process each instruction:
1) Fetch the instruction from memory 2) Decode the instruction
3) Calculate the effective address 4) Fetch the operands from memory
5) Execute the instruction 6) Store the result in the proper place

Example: Four-segment instruction pipeline

1. Fl is the segment that fetches an instruction.
2. DA is the segment that decodes the instruction and calculates the effective
address.
3. FO is the segment that fetches the operand.
4. EX is the segment that executes the instruction

(Assume that most of the instructions store the result in a register so that the
execution and storing of the result can be combined in one segment.)
Instruction cycle in the
CPU processed with a
four-segment pipeline.
● Figure shows the operation of the instruction pipetine. The time in the horizontal axis
is divided into steps of equal duration.
● Up to four suboperations in the instruction cycle can overlap and up to four different
instructions can be in progress of being processed at the same time
● It is assumed that the processor has separate instruction and data memories
so that the operation in Fl and FO can proceed at the same time.
● Assume that instruction 3 is a branch instruction. As soon as this instruction is
decoded in segment DA in step 4, the transfer from FI to DA of the other instructions
is halted until the branch instruction is executed in step 6.
Instruction Pipeline
There are three major difficulties that cause the instruction pipeline to
deviate from its normal operation :
● Resource conflicts caused by access to memory by two segments
at the same time. Most of these conflicts can be resolved by using
separate instruction and data memories.
● Data dependency conflicts arise when an instruction depends on
the result of a previous instruction, but this result is not yet
available.
● Branch difficulties arise from branch and other instructions that
change the value of PC .
RICS Pipeline
The simplicity of the RICS instruction set can be utilized to implement an
instruction pipeline using a small number of suboperations, with each being
executed in one clock cycle.
The instruction cycle can be divided into three suboperations and
implemented in three segments:
I: Instruction fetch :The I segment fetches the instruction from program
memory.
A: ALU operation : The instruction is decoded and an ALU operation is
performed in the A segment. The ALU is used for three different functions,
depending on the decoded instruction. It performs an operation for a data
manipulation instruction, it evaluates the effective address for a load or store
instruction, or it calculates the branch address for a program control
instruction.
E: Execute instruction : The E segment directs the output of the ALU to one
of three destinations, depending on the decoded instruction. It transfers the
result of the ALU operation into a destination register in the register file, it
transfers the effective address to a data memory for loading or storing, or it
transfers the branch address to the program counter.
Consider now the operation of the
following four instructions:
1. LOAD: Rl <-- M [address 1]
2. LOAD: R2 <-- M [address 2]
3. ADD: R3 <-- R l + R2
4. STORE: M[address 3] <-- R3

The E segment in clock cycle 4 is in a

process of placing the memory data into
R2. The A segment in clock cycle 4 is using
the data from R2, but the value in R2 will
not be the correct value since it has not yet
been transferred from memory.
It is up to the compiler to make sure that
the instruction following the load instruction
uses the data fetched from memory. If the
compiler cannot find a useful instruction to
put after the load, it inserts a no-op (no-
operation) instruction. This is a type of
instruction that is fetched from memory but
has no operation, thus wasting a clock
cycle. This concept of delaying the use of
the data loaded from memory is referred to
as delayed load.
Practice Questions
Q1: Draw the space-time diagram for a 4-segment pipeline executing
six tasks.

Q2: Specify the pipeline configuration to carry out the computation :

(Ai+Bi)(Ci+Di). Also list the contents of registers in the pipeline for
i=1 through 6.

Q3: Perform the addition of the following floating point numbers using
arithmetic pipeline.
X=0.9504*103
Y=0.8200*102
Also draw the diagram representing pipeline for the floating point
addition and subtraction.
Practice Questions
Q4: Determine the number of clock cycles it takes to process 200
tasks in a 6 segment pipeline.
Solution : k = 6 segments, n = 200 tasks
(k + n – 1) = 6 + 200 – 1 = 205 cycles

Q5: A nonpipellne system takes 50 ns to process a task. The same

task can processed in a six-segment pipeline with a clock cycle of 10
ns. Determine the speedup ratio of the pipeline for 100 tasks. What is
the maximum speedup that can achieved?
Solution: tn = 50 ns, k = 6, tp = 10 ns, n = 100
Practice Questions
Q5: The lime delay of the four segments in the pipeline are as follows:
t1=50ns, t2=30ns, t3=95ns, and t4= 45 ns. The interface registers
delay time1= 5 ns.
a. How long would it take to add 100 pairs of numbers in the
pipeline?
b. How can we reduce the total time to about one-half of the time
calculated in part (a)?
Solution:
Practice Questions
Q6: Consider the four instructions in the following program.
Suppose that the first instruction starts from step 1 in the
pipeline used in. Specify what operations are performed in the
four segments during step 4.
Solution:

2.2 Pipelining: Asynchronous
25% (4)
2.2 Pipelining: Asynchronous
24 pages
Chemistry Lab Report
No ratings yet
Chemistry Lab Report
6 pages
Aiesec: Abbreviations Used in AIESEC Aka. How To Survive The First Weeks in
No ratings yet
Aiesec: Abbreviations Used in AIESEC Aka. How To Survive The First Weeks in
5 pages
Csso U 5
No ratings yet
Csso U 5
29 pages
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
No ratings yet
Chapter 9 - Pipeline and Vector Processing Section 9.1 - Parallel Processing
10 pages
Module 5
No ratings yet
Module 5
16 pages
Chapter 5 Pipelining and Vector Processing Modified
No ratings yet
Chapter 5 Pipelining and Vector Processing Modified
37 pages
Pipeline Processing Coa
No ratings yet
Pipeline Processing Coa
34 pages
ACA - Pipelining
No ratings yet
ACA - Pipelining
25 pages
Unit-V NEW
No ratings yet
Unit-V NEW
21 pages
CO Module 5 Notes
No ratings yet
CO Module 5 Notes
16 pages
Chap 9
No ratings yet
Chap 9
59 pages
Presentation 5156 Content Document 20250301102853AM
No ratings yet
Presentation 5156 Content Document 20250301102853AM
40 pages
Vectors
No ratings yet
Vectors
52 pages
Chapter 5 - CO - BIM - III
No ratings yet
Chapter 5 - CO - BIM - III
7 pages
Lecture 10
No ratings yet
Lecture 10
23 pages
Parallel Processing
No ratings yet
Parallel Processing
32 pages
UNIT-5: Pipeline and Vector Processing
No ratings yet
UNIT-5: Pipeline and Vector Processing
63 pages
Unit 7 N
No ratings yet
Unit 7 N
13 pages
BCA Semester II Computer Organisation and Architecture (COA
No ratings yet
BCA Semester II Computer Organisation and Architecture (COA
24 pages
Coa Unit 5
No ratings yet
Coa Unit 5
71 pages
Comp Architecture Chapter 4 - Pipelining
No ratings yet
Comp Architecture Chapter 4 - Pipelining
53 pages
Chapter 3
No ratings yet
Chapter 3
59 pages
Unit - V: Pipeline & Vector Processing and Multi Processors Pipeline and Vector Processing: Multiprocessors
No ratings yet
Unit - V: Pipeline & Vector Processing and Multi Processors Pipeline and Vector Processing: Multiprocessors
20 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Pipe Lining
No ratings yet
Pipe Lining
7 pages
Pipeline and Vector Processing
100% (1)
Pipeline and Vector Processing
18 pages
Unit-6 Pipelining
No ratings yet
Unit-6 Pipelining
63 pages
Unit-5-Parallel Processing
No ratings yet
Unit-5-Parallel Processing
11 pages
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
No ratings yet
Lecture 8 Unit 4 Pipeline and Vector Processing 2019
36 pages
CA Slides#3 Pipeline Introduction
No ratings yet
CA Slides#3 Pipeline Introduction
26 pages
Chapter 8 Pipeline and Vector Processing
0% (1)
Chapter 8 Pipeline and Vector Processing
12 pages
Unit 4 - P 2
No ratings yet
Unit 4 - P 2
13 pages
COAU5
No ratings yet
COAU5
31 pages
COA Unit-5
No ratings yet
COA Unit-5
144 pages
Coa Notes Unit 5
No ratings yet
Coa Notes Unit 5
55 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
28 pages
Chapter 3 - Pipelining-And-Vector-Processing
100% (1)
Chapter 3 - Pipelining-And-Vector-Processing
29 pages
Vector Processing and Pipelining
No ratings yet
Vector Processing and Pipelining
22 pages
Unit 6 COA
No ratings yet
Unit 6 COA
37 pages
Coa Unit 5
No ratings yet
Coa Unit 5
20 pages
Pipeline - 3117
No ratings yet
Pipeline - 3117
22 pages
Unit 5 1
No ratings yet
Unit 5 1
21 pages
Unit 5
No ratings yet
Unit 5
51 pages
Unit 5 (Coa) Notes
No ratings yet
Unit 5 (Coa) Notes
35 pages
33 Hazards in Pipeline 06-04-2023
No ratings yet
33 Hazards in Pipeline 06-04-2023
27 pages
Lec18 Pipeline
No ratings yet
Lec18 Pipeline
59 pages
Pipelining 2
No ratings yet
Pipelining 2
43 pages
Unit 4 COA
No ratings yet
Unit 4 COA
19 pages
Pipelining
No ratings yet
Pipelining
33 pages
Pipeline - 3117
No ratings yet
Pipeline - 3117
21 pages
Chapter9pipelining 200907163859
No ratings yet
Chapter9pipelining 200907163859
13 pages
Pipelining PDF
No ratings yet
Pipelining PDF
19 pages
Pipelining and Vector Processing
No ratings yet
Pipelining and Vector Processing
30 pages
Unit 5 - Pipeling and Multipoessors
No ratings yet
Unit 5 - Pipeling and Multipoessors
74 pages
Mod 3
No ratings yet
Mod 3
46 pages
Computer Systems Architecture 308 312
No ratings yet
Computer Systems Architecture 308 312
5 pages
Unit-5 (Coa) Notes
No ratings yet
Unit-5 (Coa) Notes
33 pages
1.4-Parallel Computer Architecture
No ratings yet
1.4-Parallel Computer Architecture
22 pages
Python Beyond Limits: Python, #3
From Everand
Python Beyond Limits: Python, #3
AnwaarX
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
IGNOU Operating System Previous Years Solved Papers
From Everand
IGNOU Operating System Previous Years Solved Papers
Manish Soni
No ratings yet
Design and Analysis of Algorithms LPCIT-113: Guru Nanak Dev Engineering College
No ratings yet
Design and Analysis of Algorithms LPCIT-113: Guru Nanak Dev Engineering College
31 pages
Unit-6 Multiprocessors
No ratings yet
Unit-6 Multiprocessors
21 pages
Btech It 3 Sem Computer Architecture 76394 Nov 2019
No ratings yet
Btech It 3 Sem Computer Architecture 76394 Nov 2019
2 pages
Mathematics-Iii (Probability & Statistics) : Inst Ruct Ions T O Candidat Es
No ratings yet
Mathematics-Iii (Probability & Statistics) : Inst Ruct Ions T O Candidat Es
2 pages
Thermodynamics Chemistry Difficult NEET Practice Questions, MCQS, Past Year Questions (PYQs), NCERT Questions, Question Bank, CL
No ratings yet
Thermodynamics Chemistry Difficult NEET Practice Questions, MCQS, Past Year Questions (PYQs), NCERT Questions, Question Bank, CL
1 page
Upd EYE TATTOO AND ITS ADVERSE EVENTS
No ratings yet
Upd EYE TATTOO AND ITS ADVERSE EVENTS
14 pages
Dela Warr Camera
No ratings yet
Dela Warr Camera
4 pages
AFM ER308 Afm Er308L
No ratings yet
AFM ER308 Afm Er308L
9 pages
ARC List
No ratings yet
ARC List
4 pages
Some Notes On Daphnis Et Chloé
No ratings yet
Some Notes On Daphnis Et Chloé
13 pages
Opportunity at Risk
No ratings yet
Opportunity at Risk
88 pages
MCCBs Simpact Series
No ratings yet
MCCBs Simpact Series
24 pages
Regression - Slides and UIP Case-Study Setup
No ratings yet
Regression - Slides and UIP Case-Study Setup
21 pages
Types of Lighting
No ratings yet
Types of Lighting
7 pages
Portfolio Write-Up
No ratings yet
Portfolio Write-Up
4 pages
Daily Report Swiss Embassy Jakarta
No ratings yet
Daily Report Swiss Embassy Jakarta
1 page
Application of Linear Programming Techniques To Practical
100% (1)
Application of Linear Programming Techniques To Practical
13 pages
BK XXLS400 1-0
No ratings yet
BK XXLS400 1-0
9 pages
Mercury Drugs Vs Serrano
No ratings yet
Mercury Drugs Vs Serrano
7 pages
ChatLog CP - 21E - Geometry - I Contd - Triangles Contd - QAHO1001915 - QAHO1001916 - QAHO1001917 - SM1002208 2020 - 12 - 06 14 - 24
No ratings yet
ChatLog CP - 21E - Geometry - I Contd - Triangles Contd - QAHO1001915 - QAHO1001916 - QAHO1001917 - SM1002208 2020 - 12 - 06 14 - 24
2 pages
General Feedback For Module 7
No ratings yet
General Feedback For Module 7
1 page
Distortion in Amplifiers
No ratings yet
Distortion in Amplifiers
6 pages
2024 Style Guide Template
No ratings yet
2024 Style Guide Template
7 pages
FLYLITE - Pilot Training Program Effective AUGUST 01, 2022 Trainee Copy RV080822
No ratings yet
FLYLITE - Pilot Training Program Effective AUGUST 01, 2022 Trainee Copy RV080822
11 pages
Sinopsis Muhammad Haris Yulianto-1
No ratings yet
Sinopsis Muhammad Haris Yulianto-1
6 pages
06 Activity 1
No ratings yet
06 Activity 1
3 pages
Colony Earth PDF
No ratings yet
Colony Earth PDF
144 pages
THE Infinite Game: Simon Sinek
No ratings yet
THE Infinite Game: Simon Sinek
27 pages
A. Engage
No ratings yet
A. Engage
8 pages
Emi & Emc
No ratings yet
Emi & Emc
19 pages
Corrosion Inhibitors
100% (2)
Corrosion Inhibitors
70 pages
DR AI 1688489062
No ratings yet
DR AI 1688489062
44 pages

Unit-4 Pipelinie and Vector Processing

Uploaded by

Unit-4 Pipelinie and Vector Processing

Uploaded by

Unit-4

Pipelinie and Vector Processing

MISD – structure is only of theoretical interest since no practical system

MIMD – A computer system capable of processing several programs at

R1 through R5 are registers

It takes three clock

From there on, each

The general structure of a four-segment pipeline is illustrated in Fig. 9-3.

● Consider a nonpipeline unit that performs the same operation and

● The speedup of a pipeline processing over an equivalent nonpipeline

Example for floating-point addition and subtraction

Example: Four-segment instruction pipeline

The E segment in clock cycle 4 is in a

Q2: Specify the pipeline configuration to carry out the computation :

Q5: A nonpipellne system takes 50 ns to process a task. The same

You might also like