0% found this document useful (0 votes)

36 views60 pages

Chapter 1 - Parallel Architectures

Uploaded by

thanhtruongtran23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views60 pages

Chapter 1 - Parallel Architectures

Uploaded by

thanhtruongtran23

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

PARALLEL

ARCHITECTURES
References
• Michael J. Quinn. Parallel Computing. Theory and Practice.
McGraw-Hill
• Albert Y. Zomaya. Parallel and Distributed Computing
Handbook. McGraw-Hill
• Ian Foster. Designing and Building Parallel Programs.
Addison-Wesley.
• Ananth Grama, Anshul Gupta, George Karypis, Vipin Kumar .
Introduction to Parallel Computing, Second Edition. Addison
Wesley.
• Joseph Jaja. An Introduction to Parallel Algorithm. Addison
Wesley.
• Nguyễn Đức Nghĩa. Tính toán song song. Hà Nội 2003.

3
1.1 Parallel Computing
Theory

4
What is Parallel Computing? (1)
• Traditionally, software has been written for serial
computation:
• To be run on a single computer having a single Central
Processing Unit (CPU);
• A problem is broken into a discrete series of instructions.
• Instructions are executed one after another.
• Only one instruction may execute at any moment in time.

5
What is Parallel Computing? (2)

6
What is Parallel Computing? (3)
• In the simplest sense, parallel computing is the simultaneous
use of multiple compute resources to solve a computational
problem.
• To be run using multiple CPUs
• A problem is broken into discrete parts that can be solved
concurrently
• Each part is further broken down to a series of instructions
• Instructions from each part execute simultaneously on
different CPUs

7
Parallel Computing: Resources
• The compute resources can include:
• A single computer with multiple processors;
• A single computer with (multiple) processor(s)
and some specialized computer resources (GPU,
FPGA …)
• An arbitrary number of computers connected by
a network;
• A combination of both.

8
Parallel Computing:
The computational problem
• The computational problem usually demonstrates
characteristics such as the ability to be:
• Broken apart into discrete pieces of work that can be
solved simultaneously;
• Execute multiple program instructions at any moment in
time;
• Solved in less time with multiple compute resources than
with a single compute resource.

9
Parallel Computing: what for? (1)
• Parallel computing is an evolution of serial computing that
attempts to emulate what has always been the state of
affairs in the natural world: many complex, interrelated
events happening at the same time, yet within a sequence.
• Some examples:
• Planetary and galactic orbits
• Weather and ocean patterns
• Tectonic plate drift
• Rush hour traffic in Paris
• Automobile assembly line
• Daily operations within a business
• Building a shopping mall
• Ordering a hamburger at the drive through.

10
Parallel Computing: what for? (2)
• Traditionally, parallel computing has been
considered to be "the high end of computing" and
has been motivated by numerical simulations of
complex systems and "Grand Challenge Problems"
such as:
• weather and climate
• chemical and nuclear reactions
• biological, human genome
• geological, seismic activity
• mechanical devices - from prosthetics to spacecraft
• electronic circuits
• manufacturing processes

11
Parallel Computing: what for? (3)
• Today, commercial applications are providing an equal or
greater driving force in the development of faster
computers. These applications require the processing of
large amounts of data in sophisticated ways. Example
applications include:
• parallel databases, data mining
• oil exploration
• web search engines, web based business services
• computer-aided diagnosis in medicine
• management of national and multi-national corporations
• advanced graphics and virtual reality, particularly in the
entertainment industry
• networked video and multi-media technologies
• collaborative work environnements
• Ultimately, parallel computing is an attempt to maximize
the infinite but seemingly scarce commodity called time.
12
13
14
15
16
Why Parallel Computing? (1)
• This is a legitime question! Parallel computing is
complex on any aspect!

• The primary reasons for using parallel computing:

• Save time - wall clock time
• Solve larger problems
• Provide concurrency (do multiple things at the same time)

17
Why Parallel Computing? (2)
• Other reasons might include:
• Taking advantage of non-local resources - using
available compute resources on a wide area network, or
even the Internet when local compute resources are
scarce.
• Cost savings - using multiple "cheap" computing
resources instead of paying for time on a
supercomputer.
• Overcoming memory constraints - single computers
have very finite memory resources. For large problems,
using the memories of multiple computers may
overcome this obstacle.

18
Limitations of Serial Computing
• Limits to serial computing - both physical and practical reasons
pose significant constraints to simply building ever faster serial
computers.
• Transmission speeds - the speed of a serial computer is directly
dependent upon how fast data can move through hardware.
Absolute limits are the speed of light (30 cm/nanosecond) and the
transmission limit of copper wire (9 cm/nanosecond). Increasing
speeds necessitate increasing proximity of processing elements.
• Limits to miniaturization - processor technology is allowing an
increasing number of transistors to be placed on a chip. However,
even with molecular or atomic-level components, a limit will be
reached on how small components can be.
• Economic limitations - it is increasingly expensive to make a single
processor faster. Using a larger number of moderately fast
commodity processors to achieve the same (or better) performance
is less expensive.

19
The future
• During the past 10 years, the trends indicated by
ever faster networks, distributed systems, and multi-
processor computer architectures (even at the
desktop level) clearly show that parallelism is the
future of computing.
• It will be multi-forms, mixing general purpose
solutions (your PC…) and very speciliazed
solutions as IBM Cells, ClearSpeed, GPGPU from
NVidia …

20
Who and What? (1)
• Top500.org provides statistics on parallel computing
users - the charts below are just a sample. Some
things to note:
• Sectors may overlap - for example, research may be
classified research. Respondents have to choose between
the two.
• "Not Specified" is by far the largest application -
probably means multiple applications.

21
Who and What? (2)

22
1.2 Parallel Platforms

23
Von Neumann Architecture
• For over 40 years, virtually all computers have
followed a common machine model known as the
von Neumann computer. Named after the Hungarian
mathematician John von Neumann.

• A von Neumann computer uses the stored-program

concept. The CPU executes a stored program that
specifies a sequence of read and write operations on
the memory.

24
Basic Design
• Basic design
• Memory is used to store both
program and data instructions
• Program instructions are coded data
which tell the computer to do
something
• Data is simply information to be
used by the program
• A central processing unit (CPU)
gets instructions and/or data from
memory, decodes the instructions
and then sequentially performs
them.

25
Flynn's Classical Taxonomy
• There are different ways to classify parallel
computers. One of the more widely used
classifications, in use since 1966, is called Flynn's
Taxonomy.
• Flynn's taxonomy distinguishes multi-processor
computer architectures according to how they can
be classified along the two independent dimensions
of Instruction and Data. Each of these dimensions
can have only one of two possible states: Single or
Multiple.

26
Flynn Matrix
• The matrix below defines the 4 possible
classifications according to Flynn

27
Single Instruction, Single Data (SISD)
• A serial (non-parallel) computer
• Single instruction: only one instruction
stream is being acted on by the CPU during
any one clock cycle
• Single data: only one data stream is being
used as input during any one clock cycle
• Deterministic execution
• This is the oldest and until recently, the
most prevalent form of computer
• Examples: most PCs, single CPU
workstations and mainframes

28
Single Instruction, Multiple Data
(SIMD)
• A type of parallel computer
• Single instruction: All processing units execute the same
instruction at any given clock cycle
• Multiple data: Each processing unit can operate on a different
data element
• This type of machine typically has an instruction dispatcher,
a very high-bandwidth internal network, and a very large
array of very small-capacity instruction units.
• Best suited for specialized problems characterized by a high
degree of regularity, such as image processing.
• Synchronous (lockstep) and deterministic execution
• Two varieties: Processor Arrays and Vector Pipelines
29
Single Instruction, Multiple Data
(SIMD)

30
Multiple Instruction, Single Data
(MISD)
• A single data stream is fed into multiple processing units.
• Each processing unit operates on the data independently via
independent instruction streams.
• Few actual examples of this class of parallel computer have
ever existed. One is the experimental Carnegie-Mellon
C.mmp computer (1971).
• Some conceivable uses might be:
• multiple frequency filters operating on a single signal
stream
• multiple cryptography algorithms attempting to crack a
single coded message.

31
Multiple Instruction, Single Data
(MISD)

32
Multiple Instruction, Multiple
Data (MIMD)
• Currently, the most common type of parallel computer.
Most modern computers fall into this category.
• Multiple Instruction: every processor may be executing a
different instruction stream
• Multiple Data: every processor may be working with a
different data stream
• Execution can be synchronous or asynchronous,
deterministic or non-deterministic
• Examples: most current supercomputers, networked parallel
computer "grids" and multi-processor SMP computers -
including some types of PCs.

33
Multiple Instruction, Multiple
Data (MIMD)

34
Some General Parallel Terminology(1)
• Task
• A logically discrete section of computational work. A task is
typically a program or program-like set of instructions that is
executed by a processor.
• Parallel Task
• A task that can be executed by multiple processors safely
(yields correct results)
• Serial Execution
• Execution of a program sequentially, one statement at a time.
In the simplest sense, this is what happens on a one processor
machine. However, virtually all parallel tasks will have
sections of a parallel program that must be executed serially.

35
Some General Parallel Terminology(2)
• Parallel Execution
• Execution of a program by more than one task, with each task being
able to execute the same or different statement at the same moment in
time.
• Shared Memory
• From a strictly hardware point of view, describes a computer
architecture where all processors have direct (usually bus based) access
to common physical memory. In a programming sense, it describes a
model where parallel tasks all have the same "picture" of memory and
can directly address and access the same logical memory locations
regardless of where the physical memory actually exists.
• Distributed Memory
• In hardware, refers to network based memory access for physical
memory that is not common. As a programming model, tasks can only
logically "see" local machine memory and must use communications to
access memory on other machines where other tasks are executing.
36
Some General Parallel Terminology(3)
• Communications
• Parallel tasks typically need to exchange data. There are
several ways this can be accomplished, such as through a
shared memory bus or over a network, however the actual
event of data exchange is commonly referred to as
communications regardless of the method employed.
• Synchronization
• The coordination of parallel tasks in real time, very often
associated with communications. Often implemented by
establishing a synchronization point within an application
where a task may not proceed further until another task(s)
reaches the same or logically equivalent point.
• Synchronization usually involves waiting by at least one task,
and can therefore cause a parallel application's wall clock
execution time to increase.

37
Some General Parallel Terminology(4)
• Granularity
• In parallel computing, granularity is a qualitative measure of
the ratio of computation to communication.
• Coarse: relatively large amounts of computational work are
done between communication events
• Fine: relatively small amounts of computational work are
done between communication events
• Observed Speedup
• Observed speedup of a code which has been parallelized,
defined as:
time of serial execution/time of parallel execution
• One of the simplest and most widely used indicators for a
parallel program's performance.

38
Some General Parallel Terminology(5)
• Parallel Overhead
• The amount of time required to coordinate parallel tasks,
as opposed to doing useful work. Parallel overhead can
include factors such as:
• Task start-up time
• Synchronizations
• Data communications
• Software overhead imposed by parallel compilers, libraries,
tools, operating system, etc.
• Task termination time
• Massively Parallel
• Refers to the hardware that comprises a given parallel
system - having many processors. The meaning of many
keeps increasing, but currently BG/L pushes this number
to 6 digits.

39
Some General Parallel Terminology(6)
• Scalability
• Refers to a parallel system's (hardware and/or
software) ability to demonstrate a proportionate
increase in parallel speedup with the addition of
more processors. Factors that contribute to
scalability include:
• Hardware - particularly memory-cpu bandwidths and
network communications
• Application algorithm
• Parallel overhead related
• Characteristics of your specific application and
coding

40
Amdahl's Law
Amdahl's Law states that potential
program speedup is defined by the
fraction of code (P) that can be
parallelized:
1
speedup = --------
1 - P

• If none of the code can be parallelized, P = 0 and the

speedup = 1 (no speedup). If all of the code is
parallelized, P = 1 and the speedup is infinite (in
theory).
• If 50% of the code can be parallelized, maximum
speedup = 2, meaning the code will run twice as fast.
41
Amdahl's Law
• Introducing the number of processors performing
the parallel fraction of work, the relationship can be
modeled by 1
speedup = ------------
P + S
---
N

• where P = parallel fraction, N = number of

processors and S = serial fraction

42
Amdahl's Law
• It soon becomes obvious that there are limits to the
scalability of parallelism. For example, at P
= .50, .90 and .99 (50%, 90% and 99% of the code
is parallelizable)
speedup
--------------------------------
N P = .50 P = .90 P = .99
----- ------- ------- -------
10 1.82 5.26 9.17
100 1.98 9.17 50.25
1000 1.99 9.91 90.99
10000 1.99 9.91 99.02

43
Amdahl's Law
• However, certain problems demonstrate increased
performance by increasing the problem size. For
example:
• 2D Grid Calculations 85 seconds 85%
• Serial fraction 15 seconds 15%
• We can increase the problem size by doubling the grid
dimensions and halving the time step. This results in four
times the number of grid points and twice the number of
time steps. The timings then look like:
• 2D Grid Calculations 680 seconds 97.84%
• Serial fraction 15 seconds 2.16%
• Problems that increase the percentage of parallel time
with their size are more scalable than problems with a
fixed percentage of parallel time.

44
Amdahl's Law

45
Memory architectures
• Shared Memory
• Distributed Memory
• Hybrid Distributed-Shared Memory

46
Shared Memory
• Shared memory parallel computers vary widely, but generally have in
common the ability for all processors to access all memory as global
address space.

• Multiple processors can operate independently but share the same memory
resources.
• Changes in a memory location effected by one processor are visible to all
other processors.
• Shared memory machines can be divided into two main classes based upon
memory access times: UMA and NUMA.
47
Shared Memory : UMA vs. NUMA
• Uniform Memory Access (UMA):
• Most commonly represented today by Symmetric
Multiprocessor (SMP) machines
• Identical processors
• Equal access and access times to memory
• Sometimes called CC-UMA - Cache Coherent UMA. Cache
coherent means if one processor updates a location in shared
memory, all the other processors know about the update. Cache
coherency is accomplished at the hardware level.
• Non-Uniform Memory Access (NUMA):
• Often made by physically linking two or more SMPs
• One SMP can directly access memory of another SMP
• Not all processors have equal access time to all memories
• Memory access across link is slower
• If cache coherency is maintained, then may also be called CC-
NUMA - Cache Coherent NUMA

48
Shared Memory: Pro and Con
• Advantages
• Global address space provides a user-friendly programming
perspective to memory
• Data sharing between tasks is both fast and uniform due to the
proximity of memory to CPUs
• Disadvantages:
• Primary disadvantage is the lack of scalability between memory
and CPUs. Adding more CPUs can geometrically increases
traffic on the shared memory-CPU path, and for cache coherent
systems, geometrically increase traffic associated with
cache/memory management.
• Programmer responsibility for synchronization constructs that
insure "correct" access of global memory.
• Expense: it becomes increasingly difficult and expensive to
design and produce shared memory machines with ever
increasing numbers of processors.

49
Distributed Memory
• Like shared memory systems, distributed memory systems vary widely but
share a common characteristic. Distributed memory systems require a
communication network to connect inter-processor memory.
• Processors have their own local memory. Memory addresses in one
processor do not map to another processor, so there is no concept of global
address space across all processors.
• Because each processor has its own local memory, it operates
independently. Changes it makes to its local memory have no effect on the
memory of other processors. Hence, the concept of cache coherency does
not apply.
• When a processor needs access to data in another processor, it is usually
the task of the programmer to explicitly define how and when data is
communicated. Synchronization between tasks is likewise the
programmer's responsibility.
• The network "fabric" used for data transfer varies widely, though it can can
be as simple as Ethernet.

50
Distributed Memory

51
Distributed Memory: Pro and Con
• Advantages
• Memory is scalable with number of processors. Increase the
number of processors and the size of memory increases
proportionately.
• Each processor can rapidly access its own memory without
interference and without the overhead incurred with trying to
maintain cache coherency.
• Cost effectiveness: can use commodity, off-the-shelf processors
and networking.
• Disadvantages
• The programmer is responsible for many of the details
associated with data communication between processors.
• It may be difficult to map existing data structures, based on
global memory, to this memory organization.
• Non-uniform memory access (NUMA) times

52
Hybrid Distributed-Shared Memory
Comparison of Shared and Distributed Memory Architectures
Architecture CC-UMA CC-NUMA Distributed
Examples SMPs Bull NovaScale Cray T3E
Sun Vexx SGI Origin Maspar
DEC/Compaq HP Exemplar IBM SP2
SGI Challenge DEC/Compaq IBM BlueGene
IBM POWER3 IBM POWER4 (MCM)
Communica MPI MPI MPI
tions Threads Threads
OpenMP OpenMP
shmem shmem
Scalability to 10s of procs to 100s of procs to 1000s of procs
Draw Backs Memory-CPU Memory-CPU bandwidth System administration
bandwidth Non-uniform access Programming is hard to
times develop and maintain
Software many 1000s many 1000s ISVs 100s ISVs
Availability ISVs

53
Hybrid Distributed-Shared Memory
• The largest and fastest computers in the world today employ
both shared and distributed memory architectures.
• The shared memory component is usually a cache coherent
SMP machine. Processors on a given SMP can address that
machine's memory as global.
• The distributed memory component is the networking of
multiple SMPs. SMPs know only about their own memory -
not the memory on another SMP. Therefore, network
communications are required to move data from one SMP
to another.
• Current trends seem to indicate that this type of memory
architecture will continue to prevail and increase at the high
end of computing for the foreseeable future.
• Advantages and Disadvantages: whatever is common to
both shared and distributed memory architectures.

54
Hybrid Distributed-Shared Memory

55
Some Types of Parallel Computer
• Multi-core processor
• Symmetric multiprocessing
• Distributed computing

56
Multi-core processor
• Multi-cores
• Support Multi-threaded

57
Symmetric multiprocessing

• Same CPUs
• Shared Memory
• Each CPU executes a different task.
• Need high-speed bus

58
Distributed computing
• Distributed memory system
• CPUs are linked over the network
• Some types:
• Cluster computing.
• Massive parallel processing.
• Grid computing.

59
Thank you
for your
attentions!

Revision Test CSC 577
No ratings yet
Revision Test CSC 577
4 pages
Advanced Embedded Customization
No ratings yet
Advanced Embedded Customization
76 pages
OS Practical No 5
No ratings yet
OS Practical No 5
2 pages
Procedimento de Configuração FTP para Mitsubishi
No ratings yet
Procedimento de Configuração FTP para Mitsubishi
8 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
Lecture 28 - Transport Layer Protocols-Joel-Pc-Joel-Pc
No ratings yet
Lecture 28 - Transport Layer Protocols-Joel-Pc-Joel-Pc
106 pages
Lecture 1 Introduction
No ratings yet
Lecture 1 Introduction
34 pages
03 Network Topologies and Technologies
No ratings yet
03 Network Topologies and Technologies
46 pages
Lecture 2 Computer Architecture Course 2024 1
No ratings yet
Lecture 2 Computer Architecture Course 2024 1
57 pages
Lecture-2-06 01 2025
No ratings yet
Lecture-2-06 01 2025
21 pages
Data Domain - How To Power Off and Power On To Reboot Data Domain
No ratings yet
Data Domain - How To Power Off and Power On To Reboot Data Domain
4 pages
Murali 2mark Erp
No ratings yet
Murali 2mark Erp
18 pages
Caching - How Can I Force Clients To Refresh JavaScript Files - Stack Overflow
No ratings yet
Caching - How Can I Force Clients To Refresh JavaScript Files - Stack Overflow
17 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
51 pages
Pda 2
No ratings yet
Pda 2
105 pages
Chapter 2 - Parallel Algorithm Design
No ratings yet
Chapter 2 - Parallel Algorithm Design
84 pages
Algo Pattern
No ratings yet
Algo Pattern
45 pages
1 Introduction
No ratings yet
1 Introduction
48 pages
Sy0 601 07
No ratings yet
Sy0 601 07
40 pages
Authentication and Authorization Testing Aim
No ratings yet
Authentication and Authorization Testing Aim
14 pages
Final Project Report
No ratings yet
Final Project Report
23 pages
Unit 1
No ratings yet
Unit 1
22 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
Initial Setup Keirf - FlashFloppy Wiki GitHub
No ratings yet
Initial Setup Keirf - FlashFloppy Wiki GitHub
6 pages
Chapter 5 - General Purpose PGPU, CUDA
No ratings yet
Chapter 5 - General Purpose PGPU, CUDA
70 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
CMP 252 - Parallelism Fundamentals
No ratings yet
CMP 252 - Parallelism Fundamentals
64 pages
Unit 5
No ratings yet
Unit 5
66 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
HP Man UCA1.0 TeMIPIntegration PDF
No ratings yet
HP Man UCA1.0 TeMIPIntegration PDF
75 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Parallel 123
No ratings yet
Parallel 123
28 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
Dsa All Notes Before Midterm
No ratings yet
Dsa All Notes Before Midterm
4 pages
How Would The Wireless Telecom Industry Be Impacted by Google's Entry in The Same, Armed With Android?
No ratings yet
How Would The Wireless Telecom Industry Be Impacted by Google's Entry in The Same, Armed With Android?
1 page
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
38 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Parallel Computing
100% (1)
Parallel Computing
53 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
P 1
No ratings yet
P 1
44 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Chapter # 1
No ratings yet
Chapter # 1
117 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Parallel Computing: "Parallelization" Redirects Here. For Parallelization of Manifolds, See
No ratings yet
Parallel Computing: "Parallelization" Redirects Here. For Parallelization of Manifolds, See
20 pages
Chapter 4 - Message-Passing Programming, MPI
No ratings yet
Chapter 4 - Message-Passing Programming, MPI
79 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
Introduction To Parallel Computing-Dr Nousheen
No ratings yet
Introduction To Parallel Computing-Dr Nousheen
43 pages
BSE2107 OOP II - Inner Classes in Java-1
No ratings yet
BSE2107 OOP II - Inner Classes in Java-1
25 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Lecture Notes On Parallel Computation
No ratings yet
Lecture Notes On Parallel Computation
30 pages
Lec1 Introduction To Parallel Computing
No ratings yet
Lec1 Introduction To Parallel Computing
40 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
07 - Chapter 1 PDF
No ratings yet
07 - Chapter 1 PDF
27 pages
Week 1
No ratings yet
Week 1
74 pages
Module 1: Parallelism Fundamentals Week 1 Learning Outcomes
No ratings yet
Module 1: Parallelism Fundamentals Week 1 Learning Outcomes
8 pages
ME F320 Engineering Optimization
No ratings yet
ME F320 Engineering Optimization
5 pages
Band:1900 & 2600Mhz Lte/Wcdma: Current Configurations
No ratings yet
Band:1900 & 2600Mhz Lte/Wcdma: Current Configurations
12 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
10 Parallel Computing
No ratings yet
10 Parallel Computing
15 pages
DOS Internal Commands
No ratings yet
DOS Internal Commands
40 pages
Select Isr4k Series Platform Eol
No ratings yet
Select Isr4k Series Platform Eol
13 pages
2008 4GGSM 4G#RO MDO294 0164 - eSQAC - DT - Report
No ratings yet
2008 4GGSM 4G#RO MDO294 0164 - eSQAC - DT - Report
19 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
CCS 3101 Artificial Intelligence Course Outline
No ratings yet
CCS 3101 Artificial Intelligence Course Outline
2 pages
Parallel N Distributed Systems
No ratings yet
Parallel N Distributed Systems
44 pages
Parallel Processor Computing Unit 1
No ratings yet
Parallel Processor Computing Unit 1
10 pages
Human Computer Interface
No ratings yet
Human Computer Interface
2 pages
1 Introduction To Parallel Computing
No ratings yet
1 Introduction To Parallel Computing
58 pages
Foundation Level Sample Exam v2.9
No ratings yet
Foundation Level Sample Exam v2.9
26 pages
Release Notes - Zebra Android Oreo 01-30-04.00-OG-U06-STD Release (GMS)
No ratings yet
Release Notes - Zebra Android Oreo 01-30-04.00-OG-U06-STD Release (GMS)
8 pages
Install Kali VM
No ratings yet
Install Kali VM
8 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Introduction To Computing
No ratings yet
Introduction To Computing
6 pages
Parallel Computing Varun Patial
No ratings yet
Parallel Computing Varun Patial
41 pages
Evaluation Class X
33% (3)
Evaluation Class X
19 pages
Lecture Parallel Computing
No ratings yet
Lecture Parallel Computing
6 pages
Basics of Parallel Programming: Unit-1
No ratings yet
Basics of Parallel Programming: Unit-1
79 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
28 pages
OOPJ2 - Module 1 - Introduction To Object Oriented Programming
No ratings yet
OOPJ2 - Module 1 - Introduction To Object Oriented Programming
76 pages
Parallel Computing Terminology
No ratings yet
Parallel Computing Terminology
11 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet
Computer Application In Business ( Concise Notes )
From Everand
Computer Application In Business ( Concise Notes )
NotesKaro
No ratings yet
Introduction to Computing DSST Quick Prep Sheet
From Everand
Introduction to Computing DSST Quick Prep Sheet
Justin Orgeron
No ratings yet
Computer for Kids: History of Computer
From Everand
Computer for Kids: History of Computer
Steven Bright
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet

Chapter 1 - Parallel Architectures

Uploaded by

Chapter 1 - Parallel Architectures

Uploaded by

PARALLEL

• The primary reasons for using parallel computing:

• A von Neumann computer uses the stored-program

• If none of the code can be parallelized, P = 0 and the

• where P = parallel fraction, N = number of

You might also like