0% found this document useful (0 votes)

10 views8 pages

Unit 4 COA

The document discusses parallel computing and Flynn's taxonomy, which classifies computing systems into four categories: SISD, SIMD, MISD, and MIMD, each with distinct capabilities for processing instructions and data streams. It also covers the concept of parallel processing, highlighting its advantages in computational speed and resource utilization, and introduces pipelining as a technique to enhance processing efficiency by breaking down operations into sub-operations executed concurrently. The conclusion emphasizes the importance of pipelining in modern CPU architectures and its role in advancing computational performance.

Uploaded by

lonewolfvivek99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

Unit 4 COA

Uploaded by

lonewolfvivek99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Computer Architecture | Flynn’s taxonomy

Last Updated : 06 Feb, 2024

Parallel computing is a computing where the jobs are broken into discrete parts that can be executed
concurrently. Each part is further broken down to a series of instructions. Instructions from each part
execute simultaneously on different CPUs. Parallel systems deal with the simultaneous use of
multiple computer resources that can include a single computer with multiple processors, a number
of computers connected by a network to form a parallel processing cluster or a combination of both.
Parallel systems are more difficult to program than computers with a single processor because the
architecture of parallel computers varies accordingly and the processes of multiple CPUs must be
coordinated and synchronized.

The crux of parallel processing are CPUs. Based on the number of instruction and data streams that
can be processed simultaneously, computing systems are classified into four major categories:

Flynn’s classification –

1. Single-instruction, single-data (SISD) systems –

An SISD computing system is a uniprocessor machine which is capable of executing a single
instruction, operating on a single data stream. In SISD, machine instructions are processed in
a sequential manner and computers adopting this model are popularly called sequential
computers. Most conventional computers have SISD architecture. All the instructions and
data to be processed have to be stored in primary memory.

The speed of the processing element in the SISD model is limited(dependent) by the rate at which
the computer can transfer information internally. Dominant representative SISD systems are IBM PC,
workstations.

2. Single-instruction, multiple-data (SIMD) systems –

An SIMD system is a multiprocessor machine capable of executing the same instruction on all
the CPUs but operating on different data streams. Machines based on an SIMD model are
well suited to scientific computing since they involve lots of vector and matrix operations. So
that the information can be passed to all the processing elements (PEs) organized data
elements of vectors can be divided into multiple sets(N-sets for N PE systems) and each PE
can process one data set.

Dominant representative SIMD systems is Cray’s vector processing machine.

3. Multiple-instruction, single-data (MISD) systems –

An MISD computing system is a multiprocessor machine capable of executing different
instructions on different PEs but all of them operating on the same dataset .
Example Z = sin(x)+cos(x)+tan(x)
The system performs different operations on the same data set. Machines built using the MISD
model are not useful in most of the application, a few machines are built, but none of them are
available commercially.

4. Multiple-instruction, multiple-data (MIMD) systems –

An MIMD system is a multiprocessor machine which is capable of executing multiple
instructions on multiple data sets. Each PE in the MIMD model has separate instruction and
data streams; therefore machines built using this model are capable to any kind of
application. Unlike SIMD and MISD machines, PEs in MIMD machines work asynchronously.

MIMD machines are broadly categorized into shared-memory MIMD and distributed-memory
MIMD based on the way PEs are coupled to the main memory.

In the shared memory MIMD model (tightly coupled multiprocessor systems), all the PEs are
connected to a single global memory and they all have access to it. The communication between PEs
in this model takes place through the shared memory, modification of the data stored in the global
memory by one PE is visible to all other PEs. Dominant representative shared memory MIMD
systems are Silicon Graphics machines and Sun/IBM’s SMP (Symmetric Multi-Processing).
In Distributed memory MIMD machines (loosely coupled multiprocessor systems) all PEs have a local
memory. The communication between PEs in this model takes place through the interconnection
network (the inter process communication channel, or IPC). The network connecting PEs can be
configured to tree, mesh or in accordance with the requirement.
The shared-memory MIMD architecture is easier to program but is less tolerant to failures and
harder to extend with respect to the distributed memory MIMD model. Failures in a shared-memory
MIMD affect the entire system, whereas this is not the case of the distributed model, in which each
of the PEs can be easily isolated. Moreover, shared memory MIMD architectures are less likely to
scale because the addition of more PEs leads to memory contention. This is a situation that does not
happen in the case of distributed memory, in which each PE has its own memory. As a result of
practical outcomes and user’s requirement , distributed memory MIMD architecture is superior to
the other existing models.

What is Parallel Processing ?

Last Updated : 08 Mar, 2024

Parallel processing is used to increase the computational speed of computer systems by performing
multiple data-processing operations simultaneously. For example, while an instruction is being
executed in ALU, the next instruction can be read from memory. The system can have two or more
ALUs and be able to execute multiple instructions at the same time. In addition, two or more
processing is also used to speed up computer processing capacity and increases with parallel
processing, and with it, the cost of the system increases. But, technological development has
reduced hardware costs to the point where parallel processing methods are economically possible.
Parallel processing derives from multiple levels of complexity. It is distinguished between parallel and
serial operations by the type of registers used at the lowest level.

Shift registers

work one bit at a time in a serial fashion, while parallel registers work simultaneously with all bits of
the word. At high levels of complexity, parallel processing derives from having a plurality of
functional units that perform separate or similar operations simultaneously. By distributing data
among several functional units, parallel processing is installed. As an example, arithmetic, shift and
logic operations can be divided into three units and operations are transformed into a teach unit
under the supervision of a control unit. One possible method of dividing the execution unit into eight
functional units operating in parallel is shown in figure. Depending on the operation specified by the
instruction, operands in the registers are transferred to one of the units, associated with the
operands. In each functional unit, the operation performed is denoted in each block of the diagram.
The arithmetic operations with integer numbers are performed by the adder and integer multiplier.

Floating-point operations

can be divided into three circuits operating in parallel. Logic, shift, and increment operations are
performed concurrently on different data. All units are independent of each other, therefore one
number is shifted while another number is being incremented. Generally, a multi-functional
organization is associated with a complex control unit to coordinate all the activities between the
several components.
The main advantage of parallel processing is that it provides better utilization of system resources by
increasing resource multiplicity which overall system throughput.

COA Pipelining
The term Pipelining refers to a technique of decomposing a sequential process into sub-operations,
with each sub-operation being executed in a dedicated segment that operates concurrently with all
other segments.

The most important characteristic of a pipeline technique is that several computations can be in
progress in distinct segments at the same time. The overlapping of computation is made possible by
associating a register with each segment in the pipeline. The registers provide isolation between
each segment so that each can operate on distinct data simultaneously.

The structure of a pipeline organization can be represented simply by including an input register for
each segment followed by a combinational circuit.

Let us consider an example of combined multiplication and addition operation to get a better
understanding of the pipeline organization.

The combined multiplication and addition operation is done with a stream of numbers such as:

Ai* Bi + Ci for i = 1, 2, 3, ......., 7

The operation to be performed on the numbers is decomposed into sub-operations with each sub-
operation to be implemented in a segment within a pipeline.
The sub-operations performed in each segment of the pipeline are defined as:

R1 ← Ai, R2 ← Bi Input Ai, and Bi

R3 ← R1 * R2, R4 ← Ci Multiply, and input Ci

R5 ← R3 + R4 Add Ci to product

The following block diagram represents the combined as well as the sub-operations performed in
each segment of the pipeline.

Registers R1, R2, R3, and R4 hold the data and the combinational circuits operate in a particular
segment.

The output generated by the combinational circuit in a given segment is applied as an input register
of the next segment. For instance, from the block diagram, we can see that the register R3 is used as
one of the input registers for the combinational adder circuit.

In general, the pipeline organization is applicable for two areas of computer design which includes:

1. Arithmetic Pipeline

2. Instruction Pipeline

We will discuss both of them in our later sections.

We are going to discuss these below briefly for a general idea.

1. Arithmetic Pipeline:

An arithmetic pipeline is a technologically shaped processing pipeline designed to accelerate the

implementation of arithmetic operations. It's an ideal part of the general processor figure,
particularly specializing in improving the overall performance of mathematical computations.

Components

o Addition Stage: In this stage, the pipeline plays the addition operation. It's a crucial
mathematical operation and is frequently broken down into sub-parts for efficient
processing.

o Multiplication Stage: For more complicated mathematics operations, which encompass

multiplication, an intense level is covered in the pipeline. Multiplication consists of a
sequence of partial products, and an arithmetic pipeline can simplify this device.

o Division Stage: Division is any other arithmetic operation that can take advantage of
pipelining. Dividing various involves more than one step, and breaking down the approach
into pipeline ranges can decorate the general tempo of execution.

Advantages:

o Parallelism in Arithmetic Operations: Arithmetic pipelines take advantage of parallelism by

breaking down complex operations into small parts. This allows the concurrent execution of
a couple of arithmetic operations, considerably enhancing throughput.

o Optimized Resource Utilization: The pipeline structure allows for the best usage of
processing resources. While one arithmetic operation is within the multiplication stage,
every other can be within the addition stage, maximizing the performance of the processor.

o Enhanced Computational Speed: By dividing arithmetic operations into smaller, feasible

phases, the overall pace of computation is expanded. This is mainly critical in programs in
which mathematical calculations are a large element, which includes medical computing or
photograph processing.

2. Instruction Pipeline:

An Instruction Pipeline is a key component of a processor's structure designed to facilitate the

concurrent execution of a couple of commands. It breaks down the execution of instructions into
different phases, allowing one-of-a-type spans to function simultaneously on unique instructions.

Components:

o Instruction Fetch (IF): The first stage entails fetching the instruction from memory. The
software program counter is used to decide the address of the following approach.

o Instruction Decode (ID): In this phase, the fetched instruction is decoded to determine the
operation to be completed and to understand the operands involved.

o Execution (EX): The actual computation or operation through the instruction takes place in
this stage. It might also additionally contain mathematics or logical operations.
o Memory Access (MEM): If instruction requires access to memory, this stage is wherein data
is analyzed from or written to memory.

o Write Back (WB): The final phase includes registering the results once more to report or
memory and finishing the execution of these.

Advantages:

o Improved Throughput: The instruction pipeline allows for a continuous drift of commands
through the processor, enhancing the usual throughput. While one instruction is within the
execution phase, every other may be within the decoding phase, resulting in better resource
utilization.

o Faster Program Execution: By overlapping the execution of instructions, the time taken to
execute a series of commands is reduced. This outcome in faster software execution is a vital
element in enhancing the general performance of a PC system.

o Effective Resource Management: Instructional pipelining allows powerful manipulation of

sources by permitting tremendous levels of the pipeline to operate concurrently. This
contributes to a good and streamlined execution of commands.

Conclusion

In short, pipelining stands as a cornerstone of processor layout, presenting a systematic and effective
technique for enhancing typical overall performance via parallelism. Its application stages range from
simple practice pipelines to the modern superscalar architectures seen in current CPUs. The
evolution of pipelining techniques, coupled with improvements in memory hierarchy and ILP,
continues to enhance power in computer structures, pushing the bounds of computational
competencies. In the future, the principles of pipelining are possibly crucial to the continued
exploration of faster and more trusting computing structures

Extreme Programming For Safire Solutions
90% (10)
Extreme Programming For Safire Solutions
3 pages
Monitor Materno-Fetal Especializado C20
No ratings yet
Monitor Materno-Fetal Especializado C20
4 pages
ACA1
No ratings yet
ACA1
26 pages
COA Unit - 4
No ratings yet
COA Unit - 4
31 pages
Chapter 08 - Pipeline and Vector Processing
No ratings yet
Chapter 08 - Pipeline and Vector Processing
14 pages
Csa - Unit-4
No ratings yet
Csa - Unit-4
9 pages
Architecture
No ratings yet
Architecture
15 pages
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynn - S Classification)
No ratings yet
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynn - S Classification)
8 pages
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynns Classification)
No ratings yet
Lecture 3.1.1 (Parallelism in Uniprocessor System, Flynns Classification)
21 pages
1 - Unit 8 Pipeline - MP
No ratings yet
1 - Unit 8 Pipeline - MP
12 pages
Parallel Processing Report
No ratings yet
Parallel Processing Report
9 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
COA Module5 Notes
No ratings yet
COA Module5 Notes
20 pages
Parallel
No ratings yet
Parallel
5 pages
Computer Architecture Flynn's Taxonomy
No ratings yet
Computer Architecture Flynn's Taxonomy
4 pages
Flynn Taxonomy
No ratings yet
Flynn Taxonomy
4 pages
Coa Unit-3,4 Notes
No ratings yet
Coa Unit-3,4 Notes
17 pages
Unit 9: Fundamentals of Parallel Processing
No ratings yet
Unit 9: Fundamentals of Parallel Processing
16 pages
Parallel Computing System
No ratings yet
Parallel Computing System
4 pages
Cloud Computing Lecture3
No ratings yet
Cloud Computing Lecture3
50 pages
Coa Module 5
No ratings yet
Coa Module 5
10 pages
Ca Unit 4 Prabu
No ratings yet
Ca Unit 4 Prabu
24 pages
Coa, Unit V, Notes
No ratings yet
Coa, Unit V, Notes
26 pages
5.1parallel Processing
No ratings yet
5.1parallel Processing
20 pages
Lecture 2
No ratings yet
Lecture 2
12 pages
Aca Unit 1.1
No ratings yet
Aca Unit 1.1
20 pages
Parallel Processing
No ratings yet
Parallel Processing
33 pages
Pipelined Notes
No ratings yet
Pipelined Notes
10 pages
Assign
No ratings yet
Assign
12 pages
Model
No ratings yet
Model
14 pages
Unit 4 - Parallel Computer Structures Word
No ratings yet
Unit 4 - Parallel Computer Structures Word
12 pages
CS802A Lec-2 PDF
No ratings yet
CS802A Lec-2 PDF
28 pages
Coa Unit 5
No ratings yet
Coa Unit 5
71 pages
Chapter
No ratings yet
Chapter
9 pages
Computer Architecture and Organization
No ratings yet
Computer Architecture and Organization
15 pages
Unit-6 Pipelining
No ratings yet
Unit-6 Pipelining
63 pages
Parallel Architecture Classification
50% (2)
Parallel Architecture Classification
41 pages
5 Marks Q. Describe Array Processor Architecture
No ratings yet
5 Marks Q. Describe Array Processor Architecture
11 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
Coa-Unit - 5 Notes
No ratings yet
Coa-Unit - 5 Notes
38 pages
Computer Architecture 4
No ratings yet
Computer Architecture 4
6 pages
Computer Systems Architecture 308 312
No ratings yet
Computer Systems Architecture 308 312
5 pages
Parallel and Distributed Algorithms: Johnnie W. Baker
No ratings yet
Parallel and Distributed Algorithms: Johnnie W. Baker
67 pages
Chapter 8 Pipeline and Vector Processing
0% (1)
Chapter 8 Pipeline and Vector Processing
12 pages
Parallel Processing Parallel Processing
No ratings yet
Parallel Processing Parallel Processing
64 pages
CC Unit 1.2
No ratings yet
CC Unit 1.2
39 pages
Swami Vivekananda Institute of Science &: Technology
No ratings yet
Swami Vivekananda Institute of Science &: Technology
8 pages
08 Parallel Algorithms Approches
No ratings yet
08 Parallel Algorithms Approches
12 pages
Copm Org Lecture 8
No ratings yet
Copm Org Lecture 8
28 pages
Taxonomy Parallel Computer Architectures Instruction Data
No ratings yet
Taxonomy Parallel Computer Architectures Instruction Data
2 pages
IJARCCE6G S Prabhudev Parallel PDF
No ratings yet
IJARCCE6G S Prabhudev Parallel PDF
4 pages
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
No ratings yet
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
4 pages
Parallel Processing
100% (1)
Parallel Processing
4 pages
Flynn Classification
No ratings yet
Flynn Classification
6 pages
Lecture3 (Form Parallelism&flynn)
No ratings yet
Lecture3 (Form Parallelism&flynn)
12 pages
Parallel Processing
No ratings yet
Parallel Processing
22 pages
COA Chapter 6
No ratings yet
COA Chapter 6
6 pages
CP4253 Map Unit I
No ratings yet
CP4253 Map Unit I
31 pages
ch.9 Pipeline MoDIFIED
No ratings yet
ch.9 Pipeline MoDIFIED
76 pages
Bluetooth Anti Theft Device For Personal Security
No ratings yet
Bluetooth Anti Theft Device For Personal Security
8 pages
Unit 5 COA
No ratings yet
Unit 5 COA
40 pages
Unit 3 COA
No ratings yet
Unit 3 COA
23 pages
COA Unit 1
No ratings yet
COA Unit 1
26 pages
Emergency Medical Responder: Your First Response in Emergency Care 7e by AAOS
No ratings yet
Emergency Medical Responder: Your First Response in Emergency Care 7e by AAOS
406 pages
Stacks Notes
No ratings yet
Stacks Notes
21 pages
wb3 Draft
No ratings yet
wb3 Draft
6 pages
2017 Industrial
No ratings yet
2017 Industrial
11 pages
SOP IPPB - LPT001 Downloading&Configuring Java
No ratings yet
SOP IPPB - LPT001 Downloading&Configuring Java
9 pages
Chapter 3 - Authentication, Authorization, and Accounting - Compressed
No ratings yet
Chapter 3 - Authentication, Authorization, and Accounting - Compressed
64 pages
Amcomp-2750 Honeywell MTU9105 9110 9113 9115 9117 9119 9121 9183 STC-1750 Service Jul2023
No ratings yet
Amcomp-2750 Honeywell MTU9105 9110 9113 9115 9117 9119 9121 9183 STC-1750 Service Jul2023
201 pages
GCN-based Soft Sensor Utilizing Process Flow
No ratings yet
GCN-based Soft Sensor Utilizing Process Flow
6 pages
Control Systems Chapter General Catalogue 2023 - ECPEN23-500
No ratings yet
Control Systems Chapter General Catalogue 2023 - ECPEN23-500
60 pages
Research Proposal
No ratings yet
Research Proposal
3 pages
DGS-1510 Series CLI Reference Guide v1.70
No ratings yet
DGS-1510 Series CLI Reference Guide v1.70
815 pages
Project Schedule Management Overview: 174 Part 1 - Guide
No ratings yet
Project Schedule Management Overview: 174 Part 1 - Guide
1 page
Solutions For Problems in Mathematical Structures For Computer Science, 7th Edition - Gersting
No ratings yet
Solutions For Problems in Mathematical Structures For Computer Science, 7th Edition - Gersting
31 pages
Feniex Product Catalog 2013-2014
No ratings yet
Feniex Product Catalog 2013-2014
56 pages
Reading: Form B - Extra Reading
No ratings yet
Reading: Form B - Extra Reading
3 pages
Simple Presentation On Artificial Intelligence
No ratings yet
Simple Presentation On Artificial Intelligence
7 pages
B1 LEVEL - EnglishFileQskillsRE - RegistrationForStudents
No ratings yet
B1 LEVEL - EnglishFileQskillsRE - RegistrationForStudents
24 pages
Productivity Tools Microsoft Office Excel
No ratings yet
Productivity Tools Microsoft Office Excel
7 pages
Acceptance Test Plan Template
100% (1)
Acceptance Test Plan Template
16 pages
Linux Commands
No ratings yet
Linux Commands
1 page
Electrical Drawings Explained
No ratings yet
Electrical Drawings Explained
80 pages
Vector Release Notes
No ratings yet
Vector Release Notes
503 pages
Kongsberg HiPAP 502 Single SystemROMAS
No ratings yet
Kongsberg HiPAP 502 Single SystemROMAS
28 pages
Your E-Admit Card
No ratings yet
Your E-Admit Card
4 pages
Unit 1: Introduction To Markup Languages
No ratings yet
Unit 1: Introduction To Markup Languages
41 pages
Optical Transmission Modes, Layers and Protocols: Synchronous Networks
No ratings yet
Optical Transmission Modes, Layers and Protocols: Synchronous Networks
15 pages
Platform Technologies
100% (1)
Platform Technologies
2 pages
Beej's Guide To C Programming: Brian "Beej Jorgensen" Hall
No ratings yet
Beej's Guide To C Programming: Brian "Beej Jorgensen" Hall
679 pages

Unit 4 COA

Uploaded by

Unit 4 COA

Uploaded by

Computer Architecture | Flynn’s taxonomy

Last Updated : 06 Feb, 2024

1. Single-instruction, single-data (SISD) systems –

2. Single-instruction, multiple-data (SIMD) systems –

Dominant representative SIMD systems is Cray’s vector processing machine.

3. Multiple-instruction, single-data (MISD) systems –

4. Multiple-instruction, multiple-data (MIMD) systems –

What is Parallel Processing ?

Ai* Bi + Ci for i = 1, 2, 3, ......., 7

R1 ← Ai, R2 ← Bi Input Ai, and Bi

R3 ← R1 * R2, R4 ← Ci Multiply, and input Ci

We will discuss both of them in our later sections.

An arithmetic pipeline is a technologically shaped processing pipeline designed to accelerate the

o Multiplication Stage: For more complicated mathematics operations, which encompass

o Parallelism in Arithmetic Operations: Arithmetic pipelines take advantage of parallelism by

o Enhanced Computational Speed: By dividing arithmetic operations into smaller, feasible

An Instruction Pipeline is a key component of a processor's structure designed to facilitate the

o Effective Resource Management: Instructional pipelining allows powerful manipulation of

You might also like