0% found this document useful (0 votes)

9 views3 pages

Notes 03

The document discusses the circuit model of parallel computation and its relationship to other parallel models. It defines the circuit model as a directed acyclic graph of gates where each gate performs a primitive operation. It then proves theorems showing the circuit model can simulate parallel models like the CREW PRAM and that the fastest sequential algorithm runtime lower bounds the size of any circuit solving the problem.

Uploaded by

biubu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Notes 03

Uploaded by

biubu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

ICS 643: Advanced Parallel Algorithms Fall 2016

Lecture 3 — September 12, 2016

Prof. Nodari Sitchinava Scribe: Kyle Berney

1 Overview

In the last lecture, we started introducing the various models of parallel computation. We first
covered the shared-memory model, where each processor is connected to a common global memory.
There are multiple forms of the shared-memory model, which are distinguished mainly by how
it handles the situation where 2 or more processors are all attempting to either read or write to
the same memory location. The next model we covered was the local-memory model, where each
processor has its own private memory. Each processor is connected to an interconnection network,
to allow each processor to communicate with each other. We then looked at the modular-memory
model, where each processor is connected to an interconnection network, which is then connected
to k-memory modules. All processors can access each of the k-memory modules. Lastly, we looked
at mixed models, such as the parallel external memory (PEM) model and the GPU architecture.
We finished up the last lecture by examining the different types of interconnection networks. Some
examples are: ethernet, ring, and 2D mesh.
In this lecture, we will first introduce the circuit model. Then, we will look at how the circuit
model is related to parallel models.

2 Circuit Model

The circuit model is comprised of gates; each gate has a constant fan-in and executes a primitive
operation. Some examples of primitive operations of a circuit are addition, subtraction, multipli-
cation, and boolean operations. Each gate works in parallel and starts its execution once all of its
inputs are ready. Hence, we can combine gates together to compute [BM04]. The formal definition
of a circuit is as follows:

Definition 1. A circuit for a particular problem, is a family of directed acyclic graphs (DAG) where
each node is a primitive operation and each edge is a dependency between operations. [JaJ92]

Therefore, in the circuit model, the runtime is equal to the depth of the circuit, which is equivalent
to the longest directed path in the DAG; and the work is equal to the size of the circuit, which is
equivalent to the number of nodes in the DAG [BM04]. Notice that at most there will be kn edges
in the DAG, where n = the number of nodes in the DAG and k = fan-in of the DAG. Thus, if
k = O(1), number of edges in the DAG is Θ(number of nodes in the DAG).

1
2.1 Relation to Parallel Models

Theorem 2. A circuit for a particular problem with time, T (n), and work, W (n), can be simulated
on a CREW PRAM with p processors in time:
W (n)
t(n) ≤ + T (n).
p

Proof. We will consider the circuit level-by-level; let us define:

(
1 if gi is fed by inputs
level(gi ) =
1 + maxgj feeds gi (level(gj )) otherwise

where gi is a gate. Let nl be the number of gates at level l. It will take dnl /pe time to simulate
level l using a p processor CREW PRAM (note: we need concurrent reads in order to handle cases
where two or more gates all use a common input). By definition, we know that the number of levels
in the circuit is T (n), hence we have:
T (n)
X nl TX(n)
nl

t(n) = ≤ +1
p p
l=1 l=1
 
T (n)
1 X 
= · nl + T (n).
p
l=1

PT (n)
By definition, we have that l=1 nl = W (n), therefore:
T (n)
1 X W (n)
t(n) ≤ · nl + T (n) = + T (n)
p p
l=1

Hence, by Brent’s Theorem, if we can create a circuit to solve a problem,

then we can
always
W (n)
simulate the circuit using a p processor CREW PRAM algorithm with O p + T (n) runtime
[BM04].
Lemma 3. Let A be the fastest sequential algorithm for some problem; and let TA (n) be its runtime.
Then,
TA (n) ≤ Wc (n)
where Wc (n) is the number of nodes in any circuit that solves the same problem.

Proof. We will prove by contradiction. Assume that there exists a circuit with size Wc (n), such
that Wc (n) < TA (n). By Brent’s Theorem, for p = 1 we have:

T (n) ≤ Wc (n) + Tc (n)

⇒ T (n) ≤ Wc (n) < TA (n), since Wc (n) < TA (n).

We arrived at contradiction, because A is the fastest sequential algorithm and, consequently, it
must be that TA (n) ≤ T (n).

2
Therefore, we can never have a circuit smaller than the fastest sequential algorithm runtime.
We know that given a circuit, we can create a CREW PRAM algorithm by converting each edge into
a memory read and each gate into an operation. Similarly, a p-processor CREW PRAM algorithm
with work W (n) and time T (n) can be converted into a circut with W (n) gates and depth T (n) by
converting each memory read to an incoming edge, each memory write to an outgoing edge, and
each operation into a gate.

3 Conclusion

In conclusion, we now know that given a p processor CREW PRAM algorithm with work W (n)
and time T (n) can be converted
into a CREW PRAM algorithm with p0 < p processors that has
W (n)
work W (n) and time O p0 + T (n) . In other words, if we design a CREW PRAM algorithm
with p processors, then we can always scale the algorithm down for p0 < p processors. Therefore,
when designing parallel algorithms, we want to design it for the largest possible p.

References

[BM04] Guy Blelloch and Bruce Maggs. Parallel algorithms. 2004.

[JaJ92] Joseph JaJa. Introduction to Parallel Algorithms. Addison Wesley, 1992.

B l2vpn CG 73x ncs540
No ratings yet
B l2vpn CG 73x ncs540
430 pages
Suse Linux Entrerprise Server 11
100% (2)
Suse Linux Entrerprise Server 11
348 pages
Parallel Algorithms Ws 20
No ratings yet
Parallel Algorithms Ws 20
353 pages
Distributed System - Question Bank.
100% (1)
Distributed System - Question Bank.
4 pages
Daa Unit-V
No ratings yet
Daa Unit-V
50 pages
Chapter 03 History and Classification of Computers
No ratings yet
Chapter 03 History and Classification of Computers
28 pages
Parallel Algorithm Main Single
No ratings yet
Parallel Algorithm Main Single
289 pages
JaJa Parallel - Algorithms Intro
50% (2)
JaJa Parallel - Algorithms Intro
45 pages
Pram Algorithms: Parallel and Distributed Algorithms BY Debdeep Mukhopadhyay AND Abhishek Somani
No ratings yet
Pram Algorithms: Parallel and Distributed Algorithms BY Debdeep Mukhopadhyay AND Abhishek Somani
17 pages
1 Parallel and Distributed Computation
No ratings yet
1 Parallel and Distributed Computation
10 pages
Introduction To Parallel Computing: Solution Manual
No ratings yet
Introduction To Parallel Computing: Solution Manual
70 pages
Question Paper: Ee2207 Electron Drvices and Circuits Laboratory
No ratings yet
Question Paper: Ee2207 Electron Drvices and Circuits Laboratory
2 pages
HPC Parallel
No ratings yet
HPC Parallel
122 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
Unit1 2 and 3
No ratings yet
Unit1 2 and 3
76 pages
PRAM Algorithms
100% (1)
PRAM Algorithms
24 pages
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
No ratings yet
PRAM COMP 633: Parallel Computing Algorithms: The PRAM Model of Computation
49 pages
PRINCIPLES OF COMMUNICATION SYSTEMS Seco
No ratings yet
PRINCIPLES OF COMMUNICATION SYSTEMS Seco
9 pages
Vsphere ICM 8 Lab 10
No ratings yet
Vsphere ICM 8 Lab 10
25 pages
Parallel Computing
100% (1)
Parallel Computing
12 pages
Pda 3
No ratings yet
Pda 3
90 pages
Thesis
No ratings yet
Thesis
166 pages
Pda 4
No ratings yet
Pda 4
82 pages
Parallel Algorithm Merged
No ratings yet
Parallel Algorithm Merged
76 pages
Home Automation System Using Embedded System
No ratings yet
Home Automation System Using Embedded System
8 pages
Ble 90
No ratings yet
Ble 90
268 pages
Parallel
No ratings yet
Parallel
59 pages
HPC Note
No ratings yet
HPC Note
39 pages
Written Asst2
No ratings yet
Written Asst2
27 pages
Chapter 02
No ratings yet
Chapter 02
47 pages
1.3 Abstract Machine Models in Parallel Computing
No ratings yet
1.3 Abstract Machine Models in Parallel Computing
48 pages
Chapter 14: Parallel Algorithms
No ratings yet
Chapter 14: Parallel Algorithms
23 pages
Pap 3 Shared Memory Algos
No ratings yet
Pap 3 Shared Memory Algos
23 pages
Lecture 9 - Parallel Algorithms
No ratings yet
Lecture 9 - Parallel Algorithms
28 pages
L2 Parallel Computing Models
No ratings yet
L2 Parallel Computing Models
31 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
21 pages
Simulating Ocean Currents
No ratings yet
Simulating Ocean Currents
35 pages
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
No ratings yet
Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques
104 pages
Parallel Computation Models: Slide 1
No ratings yet
Parallel Computation Models: Slide 1
28 pages
PDC ch#5
No ratings yet
PDC ch#5
12 pages
Parallel and Distributed Algorithms
No ratings yet
Parallel and Distributed Algorithms
65 pages
Parallel Random Access Machine
No ratings yet
Parallel Random Access Machine
22 pages
Parallel Algorithms and Architectures 1
No ratings yet
Parallel Algorithms and Architectures 1
22 pages
Chapter Six
No ratings yet
Chapter Six
18 pages
Par Seq Algorithms
No ratings yet
Par Seq Algorithms
44 pages
ACA Solution Manual
No ratings yet
ACA Solution Manual
39 pages
Chapter 3
No ratings yet
Chapter 3
21 pages
An Introduction To Parallel Algorithms
No ratings yet
An Introduction To Parallel Algorithms
66 pages
Solution 2-DD
No ratings yet
Solution 2-DD
70 pages
Parallel Algorithms: Theory and Practice
No ratings yet
Parallel Algorithms: Theory and Practice
44 pages
n32 Parallel
No ratings yet
n32 Parallel
16 pages
Notes 02
No ratings yet
Notes 02
9 pages
Assignment of Algorithm
No ratings yet
Assignment of Algorithm
9 pages
PSF224T
100% (1)
PSF224T
4 pages
DAA 6th
No ratings yet
DAA 6th
12 pages
1 Overview, Models of Computation, Brent's Theorem
No ratings yet
1 Overview, Models of Computation, Brent's Theorem
8 pages
Another CS Buffer Busy Waits
No ratings yet
Another CS Buffer Busy Waits
37 pages
Three
No ratings yet
Three
10 pages
Ascential Software DataStage 7.0 Solution Brochure
No ratings yet
Ascential Software DataStage 7.0 Solution Brochure
4 pages
Algorithms and Parallel Computing: Dr. Fayez Gebali, P.Eng
No ratings yet
Algorithms and Parallel Computing: Dr. Fayez Gebali, P.Eng
17 pages
Lect 5 Brent
No ratings yet
Lect 5 Brent
10 pages
High Performance Computing
No ratings yet
High Performance Computing
8 pages
Parallel Random Access Machine (PRAM) : Control
No ratings yet
Parallel Random Access Machine (PRAM) : Control
9 pages
Lecture Parallelism DC PDF
No ratings yet
Lecture Parallelism DC PDF
7 pages
UNIT3 B
No ratings yet
UNIT3 B
97 pages
Parallel Models of Computation
No ratings yet
Parallel Models of Computation
3 pages
S R T S: OME Esearch Opics For Tudents
No ratings yet
S R T S: OME Esearch Opics For Tudents
3 pages
1.1 Parallelism Is Ubiquitous
No ratings yet
1.1 Parallelism Is Ubiquitous
3 pages
Ch5-TCP Client Server Example
No ratings yet
Ch5-TCP Client Server Example
47 pages
S23 PDC Mid Exam
No ratings yet
S23 PDC Mid Exam
2 pages
Lecture 220 - Clock and Data Recovery Circuits - Iii
No ratings yet
Lecture 220 - Clock and Data Recovery Circuits - Iii
12 pages
DLD Lab 6
No ratings yet
DLD Lab 6
8 pages
Manual - Tocomsat Duo Lite
No ratings yet
Manual - Tocomsat Duo Lite
11 pages
Business Object in SAP
No ratings yet
Business Object in SAP
27 pages
Ram, Pram, and Logp Models
No ratings yet
Ram, Pram, and Logp Models
72 pages
Pnpdevs
No ratings yet
Pnpdevs
85 pages
Computing 1
No ratings yet
Computing 1
6 pages
g3 4 Water Flow Sensor - Wiki
No ratings yet
g3 4 Water Flow Sensor - Wiki
7 pages
File I/O API - Visual Basic Samp
No ratings yet
File I/O API - Visual Basic Samp
1 page
Implementation Planning Checklists
No ratings yet
Implementation Planning Checklists
5 pages
Brief Introduction To Pointers
No ratings yet
Brief Introduction To Pointers
9 pages
MHR Sprint Demo 8
No ratings yet
MHR Sprint Demo 8
16 pages
Readme
No ratings yet
Readme
2 pages
KABIR Et Al-2022-Informatica Economica
No ratings yet
KABIR Et Al-2022-Informatica Economica
14 pages
CF Unit 2 (Old)
No ratings yet
CF Unit 2 (Old)
10 pages
SAHA Asia - V-One Uranus Package - QUO1010361 - 26112021
No ratings yet
SAHA Asia - V-One Uranus Package - QUO1010361 - 26112021
4 pages
G2 Assignment 1
No ratings yet
G2 Assignment 1
7 pages
Dip Assignment - 2 - 2022-23
No ratings yet
Dip Assignment - 2 - 2022-23
1 page
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Mathematical Functions
From Everand
Mathematical Functions
Oliver Linton
No ratings yet

Notes 03

Uploaded by

Notes 03

Uploaded by

ICS 643: Advanced Parallel Algorithms Fall 2016

Lecture 3 — September 12, 2016

Proof. We will consider the circuit level-by-level; let us define:

Hence, by Brent’s Theorem, if we can create a circuit to solve a problem,

T (n) ≤ Wc (n) + Tc (n)

⇒ T (n) ≤ Wc (n) < TA (n), since Wc (n) < TA (n).

[BM04] Guy Blelloch and Bruce Maggs. Parallel algorithms. 2004.

[JaJ92] Joseph JaJa. Introduction to Parallel Algorithms. Addison Wesley, 1992.

You might also like