Zareen 6

Uploaded by

Jehangir Vakil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views11 pages

Zareen 6

Uploaded by

Jehangir Vakil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Chapter 2

Data Level Parallelism

Book 1 – Computer Architecture: A Quantitative Approach, Henessy and Patterson,
5th Edition, Morgan Kaufmann, 2012
Chapter 4 - Data-Level Parallelism in Vector, SIMD, and GPU Architectures
Parallelism

Classes of parallelism in applications

• Data Level Parallelism

oMany data items can be operated on at the same time

• Task Level Parallelism

oDifferent tasks are created that can operate independently and
largely in parallel.
Flynn’s Taxonomy
SIMD
True SIMD
One CPU (control Unit) + multiple
ALUs (Processing Elements(PEs))
each with a memory (can be shared
memory)

Pipelined SIMD
One CPU (control Unit) + pipelined
ALU
ALU work in a pipelined manner not
independently
Data Level Parallelism - Single Instruction stream,
Multiple Data Stream (SIMD)
Three variants

• Vector Architectures

• Multimedia SIMD Extensions

• GPUs, APUs
Vector Processing
• Vector – a set of scalar data elements, all of the same type, stored in memory
• Vector Processor – an ensemble of hardware resources, including vector
registers, functional pipelines, processing elements, and register counters for
performing vector operations
• Vector Processing occurs when arithmetic and logical operations are applied to
vectors
Properties of Vector Processors

• Vector Operations : arithmetic (add, sub, mul, div), memory accesses,

effective address calculations
• Multiple vector instructions can be in progress at the same time =>
more parallelism
• Applications to benefit
• Large scientific and engineering applications (simulations, weather
forecasting, applications involving large matrix operations)
• Multimedia applications
(video codecs, image processing, audio processing)
Basic Vector Architectures
• Vector Processor : ordinary pipelined scalar unit + vector unit
• Types of vector processors
• Memory-Memory processors: all vector operations are memory to
memory (CDC)
• Vector-Register processors: all vector operations except load and
store are among the vector registers (CRAY-1, CRAY-2, X-MP, Y-MP)
➢VMIPS – Vector processor as an extension of the 5-stage MIPS processor
Components of VMIPS Processor
• Vector registers—Each vector register is a fixed-length bank holding a single
vector
➢Vector register has at least 2 read and 1 write port
➢Typically 8-32 vector registers, each holding 64-128, 64 bit elements
➢VMIPS - 8 vector registers, each holding 64 elements of 64 bits (16 Rd ports, 8
Wr ports)

• Vector Functional Units (FUs) : fully pipelined, can start new operation every
clock cycle
• Typically 4 to 8 FUs: FP add, FP mult, FP reciprocal, integer add, logical, shift
• May have multiple of same unit
• VMIPS : 5 FUs (FP add/sub, FP mul, FP div, integer, FP logical)
Components of VMIPS Processor

• Vector Load-Store Units (LSUs)

➢Fully pipelined
➢May have multiple LSUs
➢VMIPS – 1 VLSU, bandwidth is 1 word per cycle after initial delay
• Scalar Registers
➢Single element for FP scalar or address
➢VMIPS – 32 GPR, 32 FPRs they are read out and latched at one input of the
FUs
• Cross-bar to connect FUs, LSUs, registers

Array & Vector Processor
No ratings yet
Array & Vector Processor
17 pages
Guc 315 61 38694 2023-11-23T11 50 52
No ratings yet
Guc 315 61 38694 2023-11-23T11 50 52
33 pages
26-27 SIMD Architecture
No ratings yet
26-27 SIMD Architecture
33 pages
7TH - Unit 4-21ec74h6 - Ca
No ratings yet
7TH - Unit 4-21ec74h6 - Ca
67 pages
Advanced Computer Architecture: Presented By, Farhan Mukhtiar
No ratings yet
Advanced Computer Architecture: Presented By, Farhan Mukhtiar
9 pages
Advanced Computer Architecture: Presented By, Krishna
No ratings yet
Advanced Computer Architecture: Presented By, Krishna
35 pages
CS7103 - MultiCore Architecture Ppts Unit-II
No ratings yet
CS7103 - MultiCore Architecture Ppts Unit-II
43 pages
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
No ratings yet
Onur Digitaldesign 2020 Lecture19 Simd Beforelecture
64 pages
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Data-Level Parallelism in Vector, SIMD, and GPU Architectures
58 pages
Onur 447 Spring15 Lecture14 Simd Afterlecture
No ratings yet
Onur 447 Spring15 Lecture14 Simd Afterlecture
60 pages
Chapter 8
No ratings yet
Chapter 8
59 pages
CA 4 Notes
No ratings yet
CA 4 Notes
34 pages
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
CH 04. Data-Level Parallelism in Vector, SIMD, and GPU Architectures
50 pages
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
No ratings yet
Unit Iii Data-Level Parallelism in Vector, Simd, and Gpu Architectures
26 pages
Lec 18-VectorSIMDGPUArchitectures
No ratings yet
Lec 18-VectorSIMDGPUArchitectures
29 pages
Computer ARCHITECTURE Lecture 8 10 1738846483
No ratings yet
Computer ARCHITECTURE Lecture 8 10 1738846483
202 pages
BCSE412L - Parallel Computing 04
No ratings yet
BCSE412L - Parallel Computing 04
9 pages
ACA1
No ratings yet
ACA1
29 pages
Lecture #4
No ratings yet
Lecture #4
16 pages
CS-482_Lecture#4_Vector and array processors
No ratings yet
CS-482_Lecture#4_Vector and array processors
40 pages
Architecture Chapter4 E5 2012
No ratings yet
Architecture Chapter4 E5 2012
92 pages
Vector Processor
No ratings yet
Vector Processor
83 pages
Chapter 04
No ratings yet
Chapter 04
47 pages
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
No ratings yet
SIMD and Associative Computational Models: Parallel & Distributed Algorithms
31 pages
SIMD Presentation
No ratings yet
SIMD Presentation
28 pages
Vector Processors
No ratings yet
Vector Processors
4 pages
Comparison of Multimedia SIMD, GPUs and Vector
No ratings yet
Comparison of Multimedia SIMD, GPUs and Vector
13 pages
Lecture ParallelArchTLP-DLP
No ratings yet
Lecture ParallelArchTLP-DLP
52 pages
Why Vector Processing: Deep Pipeline More Parallelism
No ratings yet
Why Vector Processing: Deep Pipeline More Parallelism
7 pages
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
No ratings yet
Flynn's Taxonomy: Data-Level Parallelism in Vector, SIMD, and GPU Architectures
28 pages
CA Classes-236-240
No ratings yet
CA Classes-236-240
5 pages
Unit 3-4
No ratings yet
Unit 3-4
76 pages
MCA - HW - Lecture 7and8 - Prelim
No ratings yet
MCA - HW - Lecture 7and8 - Prelim
146 pages
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
No ratings yet
7-VECTOR PROCESSING-04-Jan-2020Material - I - 04-Jan-2020 - VECTOR - PROCESSING PDF
31 pages
Unit Iii - Aca
No ratings yet
Unit Iii - Aca
13 pages
Aca UNIT-5
No ratings yet
Aca UNIT-5
10 pages
EE6304 Lecture13 Processors
No ratings yet
EE6304 Lecture13 Processors
69 pages
SIMD
No ratings yet
SIMD
44 pages
Lecture 10 - SIMD Architecture
No ratings yet
Lecture 10 - SIMD Architecture
27 pages
Unit IV CA
No ratings yet
Unit IV CA
73 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 26-Aug-2021 Module2-SIMD-VectorProcessors
16 pages
Module 4 Chapter 2
No ratings yet
Module 4 Chapter 2
42 pages
Onur Digitaldesign 2020 Lecture20 Gpu Beforelecture
No ratings yet
Onur Digitaldesign 2020 Lecture20 Gpu Beforelecture
73 pages
Paralelismo 2024
No ratings yet
Paralelismo 2024
30 pages
Design by Mohammed Intekhab Khan
No ratings yet
Design by Mohammed Intekhab Khan
33 pages
3.array Processors
100% (3)
3.array Processors
14 pages
Vector
No ratings yet
Vector
38 pages
Computer Architecture Simd Vector Gpu
No ratings yet
Computer Architecture Simd Vector Gpu
16 pages
SIMD Architecture
100% (1)
SIMD Architecture
16 pages
COE4590 14 Vector
No ratings yet
COE4590 14 Vector
14 pages
Vector
No ratings yet
Vector
42 pages
Vector Processor
No ratings yet
Vector Processor
13 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
CS6461 - Computer Architecture Fall 2016 - Vector Operations
No ratings yet
CS6461 - Computer Architecture Fall 2016 - Vector Operations
47 pages
ch.9 Pipeline MoDIFIED
No ratings yet
ch.9 Pipeline MoDIFIED
76 pages
Data-Level Parallelism: Nima Honarmand
No ratings yet
Data-Level Parallelism: Nima Honarmand
59 pages
Parallel Processing
No ratings yet
Parallel Processing
33 pages
Lecture 2
No ratings yet
Lecture 2
12 pages
Unit 2
No ratings yet
Unit 2
43 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Engineering Economics & Management: Week 2 Prepared By: Miss. Fatima M. Saleem
No ratings yet
Engineering Economics & Management: Week 2 Prepared By: Miss. Fatima M. Saleem
81 pages
ZS#5
No ratings yet
ZS#5
17 pages
ZS#3
No ratings yet
ZS#3
10 pages
Solving Machine Learning Optimization Problems Using Quantum Computers
No ratings yet
Solving Machine Learning Optimization Problems Using Quantum Computers
6 pages
Zareen 14
No ratings yet
Zareen 14
9 pages
Zareen 13
No ratings yet
Zareen 13
13 pages
Coarse Grained Lattice Folding Quantum
No ratings yet
Coarse Grained Lattice Folding Quantum
12 pages
Attack Via Light Injection
No ratings yet
Attack Via Light Injection
21 pages
Attack Strategies BB84 (Sir)
No ratings yet
Attack Strategies BB84 (Sir)
4 pages
Fetch / Execute Cycle
100% (1)
Fetch / Execute Cycle
19 pages
MICROPROCESSOR
No ratings yet
MICROPROCESSOR
2 pages
Computer Architecture and Organization Case Study GROUP 6
No ratings yet
Computer Architecture and Organization Case Study GROUP 6
5 pages
Single Cyycle Datapath PDF
No ratings yet
Single Cyycle Datapath PDF
2 pages
Unit-1 8085
No ratings yet
Unit-1 8085
299 pages
QB - Unit 4 - 22MT4201 Processor and Controller
No ratings yet
QB - Unit 4 - 22MT4201 Processor and Controller
2 pages
Arm Cortex-M Processor Comparison Table
No ratings yet
Arm Cortex-M Processor Comparison Table
2 pages
List of AMD Ryzen Microprocessors Aaaa - Wikipedia
No ratings yet
List of AMD Ryzen Microprocessors Aaaa - Wikipedia
4 pages
End Solution 2023 (Autumn)
No ratings yet
End Solution 2023 (Autumn)
10 pages
Module 2A Design of Control Unit
No ratings yet
Module 2A Design of Control Unit
14 pages
Accuratetiminganalysis
No ratings yet
Accuratetiminganalysis
6 pages
BCN1043 Computer Arc & Org S1 0119
No ratings yet
BCN1043 Computer Arc & Org S1 0119
6 pages
2.1 Advanced Processor Technology
No ratings yet
2.1 Advanced Processor Technology
40 pages
Eta 32
No ratings yet
Eta 32
44 pages
Chapter 1lecture 1.2 (Machine Instructions) Notes+Homework+References+videoLink
No ratings yet
Chapter 1lecture 1.2 (Machine Instructions) Notes+Homework+References+videoLink
4 pages
Lecture 10: Memory Dependence Detection and Speculation
No ratings yet
Lecture 10: Memory Dependence Detection and Speculation
3 pages
Asm 8086 14
No ratings yet
Asm 8086 14
6 pages
CHAPITRE 07 Control Unit
No ratings yet
CHAPITRE 07 Control Unit
20 pages
Soca Unitwise Important Questions
No ratings yet
Soca Unitwise Important Questions
4 pages
HSC Board QP CS 2 - July 22
No ratings yet
HSC Board QP CS 2 - July 22
4 pages
01 Introduction
No ratings yet
01 Introduction
31 pages
Block Diagram of 8085
No ratings yet
Block Diagram of 8085
32 pages
ARDUINO
100% (1)
ARDUINO
28 pages
Chapter 17 (Lect 48 and Micro Programmed Control Intro.)
No ratings yet
Chapter 17 (Lect 48 and Micro Programmed Control Intro.)
15 pages
What Is Instruction Queue in 8086 Microprocessor
33% (3)
What Is Instruction Queue in 8086 Microprocessor
2 pages
Introduction To VLSI Design: Amit Kumar Mishra ECE Department IIT Guwahati
No ratings yet
Introduction To VLSI Design: Amit Kumar Mishra ECE Department IIT Guwahati
20 pages
Mic Unit 3 Vvimp Question Bank
100% (1)
Mic Unit 3 Vvimp Question Bank
4 pages
By Prof. Ayushi Jaiswal Assistant Professor SBJITMR, Nagpur
No ratings yet
By Prof. Ayushi Jaiswal Assistant Professor SBJITMR, Nagpur
5 pages

Zareen 6

Uploaded by

Zareen 6

Uploaded by

Chapter 2

Data Level Parallelism

Classes of parallelism in applications

• Data Level Parallelism

• Task Level Parallelism

• Multimedia SIMD Extensions

• Vector Operations : arithmetic (add, sub, mul, div), memory accesses,

• Vector Load-Store Units (LSUs)

You might also like