Vector (Array) Processing and Superscalar Processors

Vector processors can process multiple data items simultaneously using a single instruction, improving efficiency over scalar processors which operate on single data items sequentially. Superscalar processors can execute multiple instructions simultaneously by utilizing multiple functional pipelines within the CPU, improving performance over scalar processors. Both approaches aim to increase parallelism and throughput compared to traditional scalar processors.

Uploaded by

karunakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

336 views7 pages

Vector (Array) Processing and Superscalar Processors

Uploaded by

karunakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Vector(Array) Processing and

Superscalar Processors
A Scalar processor is a normal processor, which works on simple instruction at a time,
which operates on single data items. But in today's world, this technique will prove to be
highly inefficient, as the overall processing of instructions will be very slow.

What is Vector(Array) Processing?

There is a class of computational problems that are beyond the capabilities of a
conventional computer. These problems require vast number of computations on
multiple data items, that will take a conventional computer(with scalar processor) days
or even weeks to complete.
Such complex instructions, which operates on multiple data at the same time, requires a
better way of instruction execution, which was achieved by Vector processors.
Scalar CPUs can manipulate one or two data items at a time, which is not very efficient.
Also, simple instructions like ADD A to B, and store into C are not practically efficient.
Addresses are used to point to the memory location where the data to be operated will
be found, which leads to added overhead of data lookup. So until the data is found, the
CPU would be sitting ideal, which is a big performance issue.
Hence, the concept of Instruction Pipeline comes into picture, in which the instruction
passes through several sub-units in turn. These sub-units perform various independent
functions, for example: the first one decodes the instruction, the second sub-unit
fetches the data and the third sub-unit performs the math itself. Therefore, while the
data is fetched for one instruction, CPU does not sit idle, it rather works on decoding the
next instruction set, ending up working like an assembly line.
Vector processor, not only use Instruction pipeline, but it also pipelines the data,
working on multiple data at the same time.
A normal scalar processor instruction would be ADD A, B, which leads to addition of two
operands, but what if we can instruct the processor to ADD a group of
numbers(from 0 to n memory location) to another group of numbers(lets
say, n to k memory location). This can be achieved by vector processors.
In vector processor a single instruction, can ask for multiple data operations, which
saves time, as instruction is decoded once, and then it keeps on operating on different
data items.
Applications of Vector Processors
Computer with vector processing capabilities are in demand in specialized applications.
The following are some areas where vector processing is used:

1. Petroleum exploration.
2. Medical diagnosis.
3. Data analysis.
4. Weather forecasting.
5. Aerodynamics and space flight simulations.
6. Image processing.
7. Artificial intelligence.

Superscalar Processors
It was first invented in 1987. It is a machine which is designed to improve the
performance of the scalar processor. In most applications, most of the operations are on
scalar quantities. Superscalar approach produces the high performance general
purpose processors.
The main principle of superscalar approach is that it executes instructions
independently in different pipelines. As we already know, that Instruction pipelining
leads to parallel processing thereby speeding up the processing of instructions. In
Superscalar processor, multiple such pipelines are introduced for different operations,
which further improves parallel processing.
There are multiple functional units each of which is implemented as a pipeline. Each
pipeline consists of multiple stages to handle multiple instructions at a time which
support parallel execution of instructions.
It increases the throughput because the CPU can execute multiple instructions per clock
cycle. Thus, superscalar processors are much faster than scalar processors.
A scalar processor works on one or two data items, while the vector processor works
with multiple data items. A superscalar processor is a combination of both. Each
instruction processes one data item, but there are multiple execution units within each
CPU thus multiple instructions can be processing separate data items concurrently.
While a superscalar CPU is also pipelined, there are two different performance
enhancement techniques. It is possible to have a non-pipelined superscalar CPU or
pipelined non-superscalar CPU. The superscalar technique is associated with some
characteristics, these are:

1. Instructions are issued from a sequential instruction stream.

2. CPU must dynamically check for data dependencies.
3. Should accept multiple instructions per clock cycle.

ector(Array) Processor and

its Types
Array processors are also known as multiprocessors or vector processors. They perform
computations on large arrays of data. Thus, they are used to improve the performance of the
computer.

Types of Array Processors

There are basically two types of array processors:

1. Attached Array Processors

2. SIMD Array Processors

Attached Array Processors

An attached array processor is a processor which is attached to a general purpose computer and
its purpose is to enhance and improve the performance of that computer in numerical
computational tasks. It achieves high performance by means of parallel processing with multiple
functional units.
SIMD Array Processors
SIMD is the organization of a single computer containing multiple processors operating in
parallel. The processing units are made to operate under the control of a common control unit,
thus providing a single instruction stream and multiple data streams.
A general block diagram of an array processor is shown below. It contains a set of identical
processing elements (PE's), each of which is having a local memory M. Each processor element
includes an ALU and registers. The master control unit controls all the operations of the
processor elements. It also decodes the instructions and determines how the instruction is to be
executed.
The main memory is used for storing the program. The control unit is responsible for fetching
the instructions. Vector instructions are send to all PE's simultaneously and results are returned
to the memory.
The best known SIMD array processor is the ILLIAC IV computer developed by
the Burroughs corps. SIMD processors are highly specialized computers. They are only suitable
for numerical problems that can be expressed in vector or matrix form and they are not suitable
for other types of computations.
Why use the Array Processor

 Array processors increases the overall instruction processing speed.

 As most of the Array processors operates asynchronously from the host CPU, hence it
improves the overall capacity of the system.
 Array Processors has its own local memory, hence providing extra memory for systems
with low memory.

ARCHIVED: What is
superscalar architecture?
This content has been archived, and is no longer maintained by Indiana
University. Information here may no longer be accurate, and links may
no longer be available or reliable.
Superscalar architecture is a method of parallel computing used in many processors. In a
superscalar computer, the central processing unit (CPU) manages multiple instruction pipelines
to execute several instructions concurrently during a clock cycle. This is achieved by feeding the
different pipelines through a number of execution units within the processor. To successfully
implement a superscalar architecture, the CPU's instruction fetching mechanism must
intelligently retrieve and delegate instructions. Otherwise, pipeline stalls may occur, resulting in
execution units that are often idle.

To visualize how this works, consider a hospital surgical unit that consists of areas for
admittance, surgery, and recovery. Patients can move in only one direction, from admittance to
recovery, and it takes the same amount of time to go through each of the areas. Assume the
admitting area can handle three patients at a time and there are three surgical teams, each of
which can work on a single patient. Also assume the recovery area has an indeterminate number
of beds, but can accommodate only one person per bed. When the unit is working correctly, the
admitting area processes three patients at a time, sends one to each of the teams, and immediately
processes another three patients. Even though the surgical teams can handle only one patient at a
time, because there are three of them, they will have passed their charges on by the time the new
ones arrive. The paths the three patients take are analogous to instructions flowing through three
pipelines in a CPU clock cycle. The admitting area is like a fetching mechanism, the surgery
teams are like execution units, and the recovery room is like the registers or cache to which the
units write their results.

To illustrate the kind of problems that can occur in superscalar architectures, consider what
would happen if the staff of the admitting area in the example were not very competent. For
example, if they passed a patient in need of a kidney transplant to a surgical team before the
donor kidney was available, the team wouldn't be able to go to work. Suddenly, there would be a
bottleneck at the admitting area because only two surgical teams would be available for new
patients. Another bottleneck could occur if a surgical team tried to assign a patient to an already
occupied bed in the recovery area. Again, a bottleneck would appear because the team would not
be available until the bed was emptied and the team could move the current patient into it. Stalls
like this happen in processors when an execution unit tries to perform a task that is dependent on
the results of as yet uncalculated instructions. This is why it is important that CPUs carefully
manage the order in which they process instructions.
This is document aett in the Knowledge Base.

Computer-Architecture Hari Aryal Ioe
No ratings yet
Computer-Architecture Hari Aryal Ioe
163 pages
Fold Machine
No ratings yet
Fold Machine
74 pages
EMBEDDED SYSTEMS DESIGN Course Material With QP
No ratings yet
EMBEDDED SYSTEMS DESIGN Course Material With QP
172 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
Micro Electromechanical Systems (Mems) : Seminar Report
No ratings yet
Micro Electromechanical Systems (Mems) : Seminar Report
7 pages
12th CS Short Questions Notes by Youth Academy
100% (1)
12th CS Short Questions Notes by Youth Academy
36 pages
MPMC
No ratings yet
MPMC
677 pages
DWDM-Unit 2 CH-1
No ratings yet
DWDM-Unit 2 CH-1
36 pages
Unit V Natural Language Processing
No ratings yet
Unit V Natural Language Processing
20 pages
Fire Detecting Robots PPT Seminars
50% (2)
Fire Detecting Robots PPT Seminars
17 pages
Microcontroller Notes-Po18zf
No ratings yet
Microcontroller Notes-Po18zf
92 pages
Final Training Report
No ratings yet
Final Training Report
70 pages
Itanium Processor Seminar Report
No ratings yet
Itanium Processor Seminar Report
30 pages
Electrical 5th Sem PDF
No ratings yet
Electrical 5th Sem PDF
23 pages
Javascript
No ratings yet
Javascript
26 pages
Wireless Multigas Detector Unit PDF
100% (1)
Wireless Multigas Detector Unit PDF
39 pages
Embedded Systems Design
No ratings yet
Embedded Systems Design
141 pages
Abes Engineering College, Ghaziabad: Department of Electronics & Communication Engineering
No ratings yet
Abes Engineering College, Ghaziabad: Department of Electronics & Communication Engineering
80 pages
17 HC2024 Tesla TTPoE v5
No ratings yet
17 HC2024 Tesla TTPoE v5
23 pages
Computer Architecture and Organization Reviewer
No ratings yet
Computer Architecture and Organization Reviewer
14 pages
Part 1
No ratings yet
Part 1
25 pages
Minimum and Maximum Modes of 8086
100% (1)
Minimum and Maximum Modes of 8086
3 pages
Organization of The 8086/8088 Microprocessor: Lecture#6
No ratings yet
Organization of The 8086/8088 Microprocessor: Lecture#6
19 pages
The Intel Pentium Processor
No ratings yet
The Intel Pentium Processor
12 pages
There Are Some of The Difference Mentioned Below:: 1. Difference Between 8085 and 8086 Microprocessor Solution
No ratings yet
There Are Some of The Difference Mentioned Below:: 1. Difference Between 8085 and 8086 Microprocessor Solution
7 pages
Project Report Computer Hardware Networking Mass Infotech (Cedti), Yamuna Nagar (Hariyana)
No ratings yet
Project Report Computer Hardware Networking Mass Infotech (Cedti), Yamuna Nagar (Hariyana)
151 pages
Unit 3
No ratings yet
Unit 3
21 pages
Clap Switch PDF
No ratings yet
Clap Switch PDF
19 pages
Ece Mini Project On Embedded Password Based Security Door Lock System
100% (2)
Ece Mini Project On Embedded Password Based Security Door Lock System
45 pages
ES QA Notes
No ratings yet
ES QA Notes
91 pages
Programming in Python
No ratings yet
Programming in Python
95 pages
PLC Networks
No ratings yet
PLC Networks
7 pages
Signal Description of 8086 Microprocessor
100% (1)
Signal Description of 8086 Microprocessor
8 pages
and ARM
No ratings yet
and ARM
4 pages
Eit Practical File (8719139)
No ratings yet
Eit Practical File (8719139)
54 pages
Mpi V Sem It Guess Paper Solutions
No ratings yet
Mpi V Sem It Guess Paper Solutions
52 pages
5.1. Unit V - DSP Processor
No ratings yet
5.1. Unit V - DSP Processor
83 pages
Assignment # 3 CHAPTERS# 1,2,3: CH#1 Answers To Review Qestions SECTION 1.1
No ratings yet
Assignment # 3 CHAPTERS# 1,2,3: CH#1 Answers To Review Qestions SECTION 1.1
34 pages
Emertxe Training Project Documentation Framework
No ratings yet
Emertxe Training Project Documentation Framework
10 pages
Axapta Development Training
No ratings yet
Axapta Development Training
139 pages
E Bicycle Locking System
100% (1)
E Bicycle Locking System
3 pages
UNIT-V Notes Advance Java
No ratings yet
UNIT-V Notes Advance Java
28 pages
Se Unit 2 Analysis Modelling
No ratings yet
Se Unit 2 Analysis Modelling
68 pages
Python Iterators
No ratings yet
Python Iterators
9 pages
Software Testing Manual
No ratings yet
Software Testing Manual
52 pages
Fire Detection Using Embedded Systems
100% (1)
Fire Detection Using Embedded Systems
2 pages
DSP Project Report
No ratings yet
DSP Project Report
14 pages
Petrochemical Level Indicator and Controller With Temperature Monitoring For Spinning or Cotton Process Industries
100% (1)
Petrochemical Level Indicator and Controller With Temperature Monitoring For Spinning or Cotton Process Industries
3 pages
V20PCA204 - Android Application Development
No ratings yet
V20PCA204 - Android Application Development
32 pages
Automatic Meter Reading: Amrita Pattnaik Roll # EE200199180
No ratings yet
Automatic Meter Reading: Amrita Pattnaik Roll # EE200199180
18 pages
Categories of Computer System
No ratings yet
Categories of Computer System
5 pages
Apple Script X
No ratings yet
Apple Script X
40 pages
f8194544 Microsoft PowerPoint DeepLearning
No ratings yet
f8194544 Microsoft PowerPoint DeepLearning
28 pages
Refining ChatGPT-Generated Code
No ratings yet
Refining ChatGPT-Generated Code
26 pages
Smart Street Lighting Using Embedded Systems
No ratings yet
Smart Street Lighting Using Embedded Systems
6 pages
Micro Controller Based Digital Visitor Counter
No ratings yet
Micro Controller Based Digital Visitor Counter
36 pages
XML Multiple Questions
No ratings yet
XML Multiple Questions
14 pages
Smart Controller
No ratings yet
Smart Controller
4 pages
PHP Bits
No ratings yet
PHP Bits
24 pages
Branching Instructions in 8085 Microprocessor
No ratings yet
Branching Instructions in 8085 Microprocessor
8 pages
1.1 Parallelism and Computing: 1.1.1 Trends in Applications
No ratings yet
1.1 Parallelism and Computing: 1.1.1 Trends in Applications
25 pages
Minimum and Maximum Mode of 8086
No ratings yet
Minimum and Maximum Mode of 8086
9 pages
Btech Ee 6 Sem Microprocessor and Microcontroller Kee602 2022
No ratings yet
Btech Ee 6 Sem Microprocessor and Microcontroller Kee602 2022
1 page
Data Compression and Data Retrieval 2161603: Department of CE / IT - 07 / 16
No ratings yet
Data Compression and Data Retrieval 2161603: Department of CE / IT - 07 / 16
18 pages
L2-Part Program and G-Code For Milling
No ratings yet
L2-Part Program and G-Code For Milling
8 pages
Load Shadding Time Mangment With Programmable Interface
No ratings yet
Load Shadding Time Mangment With Programmable Interface
6 pages
Microprocessor
No ratings yet
Microprocessor
11 pages
Processors:: INTEL 8086
No ratings yet
Processors:: INTEL 8086
10 pages
Multiple - Processor Scheduling
No ratings yet
Multiple - Processor Scheduling
16 pages
8259 Microprocessor: Some Features of This Microprocessor
No ratings yet
8259 Microprocessor: Some Features of This Microprocessor
6 pages
Numpy Basics: Arithmetic Operations
No ratings yet
Numpy Basics: Arithmetic Operations
6 pages
Activity 4 Template
No ratings yet
Activity 4 Template
8 pages
Case Study 8086 Microprocessor
No ratings yet
Case Study 8086 Microprocessor
4 pages
Brief History and Turbo C++ Editor Environment
No ratings yet
Brief History and Turbo C++ Editor Environment
11 pages
AP Computer Science A 2020 Practice Exam FRQ Scoring Guidelines
No ratings yet
AP Computer Science A 2020 Practice Exam FRQ Scoring Guidelines
9 pages
What Are Cookies
No ratings yet
What Are Cookies
13 pages
I3, I5, I7 Processors
50% (2)
I3, I5, I7 Processors
4 pages
PPT
No ratings yet
PPT
10 pages
Unit-5 Operator Overloading
No ratings yet
Unit-5 Operator Overloading
8 pages
MOS Controlled Thyristor Group 7
No ratings yet
MOS Controlled Thyristor Group 7
9 pages
CS106B Notes
No ratings yet
CS106B Notes
8 pages
Attributes and Usage of Jsp:Usebean Action Tag
No ratings yet
Attributes and Usage of Jsp:Usebean Action Tag
7 pages
121 - A. B. C. D.: View Answer Discuss Too Difficult!
No ratings yet
121 - A. B. C. D.: View Answer Discuss Too Difficult!
10 pages
Sooxma Tech Project List
No ratings yet
Sooxma Tech Project List
18 pages
2022 FE Roadmap
No ratings yet
2022 FE Roadmap
4 pages
Attributes and Usage of Jsp:Usebean Action Tag
No ratings yet
Attributes and Usage of Jsp:Usebean Action Tag
7 pages
Adduser and Addgroup Commands
No ratings yet
Adduser and Addgroup Commands
7 pages
SQL Codes
No ratings yet
SQL Codes
11 pages
ITT430 - Topic 1 - Introduction To Microcomputer (20202)
No ratings yet
ITT430 - Topic 1 - Introduction To Microcomputer (20202)
2 pages
ADALINE
No ratings yet
ADALINE
3 pages
Matrix Indexing in MATLAB
No ratings yet
Matrix Indexing in MATLAB
6 pages
Enzymes Mode of Action of Enzymes
No ratings yet
Enzymes Mode of Action of Enzymes
6 pages
Servlet Questions
No ratings yet
Servlet Questions
6 pages
Worksheet 11 AK
No ratings yet
Worksheet 11 AK
4 pages
Week 3
No ratings yet
Week 3
3 pages
Fifo Explanation
No ratings yet
Fifo Explanation
3 pages
Anna University Engineering Question Bank
No ratings yet
Anna University Engineering Question Bank
7 pages
Question No. Option Question No. Option
No ratings yet
Question No. Option Question No. Option
1 page
Shri Ramdeobaba College of Engineering and Management, Nagpur - 440013
No ratings yet
Shri Ramdeobaba College of Engineering and Management, Nagpur - 440013
2 pages
TAUGCSE491 5 2233 954WTpdf PDF
No ratings yet
TAUGCSE491 5 2233 954WTpdf PDF
4 pages
Opt-1 Opt-2 Opt-3
No ratings yet
Opt-1 Opt-2 Opt-3
4 pages
Differences Between IPC Mechanisms On A Single System Vs
No ratings yet
Differences Between IPC Mechanisms On A Single System Vs
3 pages
As Course
No ratings yet
As Course
3 pages
WT Lesson Plan
No ratings yet
WT Lesson Plan
2 pages
Unit 1
No ratings yet
Unit 1
2 pages
Brevent 20240616 0607 28365
No ratings yet
Brevent 20240616 0607 28365
1 page
Assignment 3
No ratings yet
Assignment 3
1 page
Ashish Resume 3
No ratings yet
Ashish Resume 3
1 page
Non-Solicit Self-Declaration Form-Signed
No ratings yet
Non-Solicit Self-Declaration Form-Signed
1 page
Assignment-Ii: y X y Dy DX Dy DX X y and
No ratings yet
Assignment-Ii: y X y Dy DX Dy DX X y and
1 page
Modern Intelligent Instruments - Theory and Application
From Everand
Modern Intelligent Instruments - Theory and Application
Changjian Deng
No ratings yet
Improved Indirect Power Control (IDPC) of Wind Energy Conversion Systems (WECS)
From Everand
Improved Indirect Power Control (IDPC) of Wind Energy Conversion Systems (WECS)
Fayssal Amrane
No ratings yet
Computer Aided Design of Electrical Machines
From Everand
Computer Aided Design of Electrical Machines
K.M. Vishnu Murthy
No ratings yet
Energy harvesting Third Edition
From Everand
Energy harvesting Third Edition
Gerardus Blokdyk
No ratings yet

Vector (Array) Processing and Superscalar Processors

Uploaded by

Vector (Array) Processing and Superscalar Processors

Uploaded by

Vector(Array) Processing and

What is Vector(Array) Processing?

1. Instructions are issued from a sequential instruction stream.

ector(Array) Processor and

Types of Array Processors

1. Attached Array Processors

Attached Array Processors

 Array processors increases the overall instruction processing speed.

You might also like