0% found this document useful (0 votes)

187 views40 pages

Project - ParallelComputing BSR v2

This document provides an introduction to parallel computing, including: 1. The need for parallel computing to solve large problems faster and address challenges like climate modeling. 2. Key terminology like speedup, efficiency, and throughput used to evaluate parallel algorithms and architectures. 3. Flynn's taxonomy which categorizes computer architectures as SISD, SIMD, MISD, and MIMD based on the relationship between instructions and data streams.

Uploaded by

api-27351105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

187 views40 pages

Project - ParallelComputing BSR v2

Uploaded by

api-27351105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 40

INTRODUCTION TO PARALLEL

COMPUTING
B S RAMANJANEYULU

System Software Development Group,

CDAC, Bangalore.

1
Presentation Outline

• Need for Parallel Computing

• Requirements of Parallel Computing
• Parallel Computing Terminology
• Parallel computer architectures
• Designing parallel algorithms
• Architectural taxonomy (SISD, SIMD, MISD and
MIMD)
• Symmetric multiprocessing (SMP)
• Clusters
• Parallel programming models
2
How to Run Applications Faster?

 There are 3 ways to improve performance:

• Work Harder
• Work Smarter
• Get Help (multiple workers)
 Computer Analogy
• Use faster hardware: e.g. reduce the time per instruction
• Optimized algorithms and techniques.
• Multiple computers to solve problem.

3
Sequential vs. Parallel

Sequential

Parallel

4
Sequential vs. Parallel (Contd…)

 Traditional sequential programs execute one instruction at a time

using one processor
 Parallelism implies executing tasks simultaneously (on multiple
processors) to complete the job faster
 Parallelism can be done by:
− Breaking up the task into smaller tasks
− Assigning the smaller tasks to multiple workers (processors) to
work on simultaneously
− Coordinating the workers (processors)
− Parallel problem solving is natural. Examples: Building
construction; Automobile manufacturing

5
The Need For Faster Machines

 Grand Challenge Problems:

 Climate Modeling
 Computational Fluid dynamics
 Combustion Systems
 Human Genome
 Structural Mechanics
 Molecular Modeling
 Astrophysical Calculations
 Seismic Data Processing

6
Data Parallelism
Example:

if CPU=“1" then
start=1
end=50
else if CPU=“2" then
start=51
end=100
end if

do
i = start , end
Task on d(i)
end do
7
Task Parallelism
Task parallelism

 Multiple tasks executing concurrently is called task parallelism.

 All the CPUs execute separate code blocks simultaneously.

Example:

if CPU=“1" then
do “Task 1”
else if CPU=“2" then
do “Task 2”
end if

8
Definition

Definition :

 In computer architecture point of view, a parallel

computer is a “Collection of processing elements that
communicate and co-operate to solve large problems
fast”.
 When this architecture is combined with a parallel
algorithm, we get the ‘parallel computing system’.

9
Sequential vs. Parallel Computing

SEQUENTIAL COMPUTING

 Fetch/Store

 Compute

PARALLEL COMPUTING

 Fetch/Store

 Compute
 communicate

10
Execution Time

• Sequential system
– Execution time as a function of size of input
• Parallel system
– Execution time as a function of input size,
and number of processors used

11
Terminology of Parallel Computing

Speedup : Speedup ‘Tp’ is defined as the ratio of the serial

runtime of the best sequential algorithm for solving a problem to
the time taken by the parallel algorithm to solve the same
problem on ‘p’ processors.
Tp=T(seq) / T(parallel)
The ‘p’ processors used by the parallel algorithm are assumed to
be identical to the one used by the sequential algorithm

Efficiency: Ratio of speedup to the number of processors.

Efficiency = Tp / P

12
Terminology of Parallel Computing
Throughput (in FLOPS): (Contd…)

It is obtained by taking the clock rate of the given system and

dividing it by the number of clock cycles a floating point
instruction requires.

Cost : Cost of solving a problem on a parallel system is the

product of parallel runtime and the number of processors used ,
i.e., E = p.Sp

13
Requirements for Parallel Computing

 Multiple processors
(The workers)

 Network
(Link between workers)

 OS support

14
Requirements for Parallel Computing (Contd…)

Parallel Programming Paradigms

 Message Passing (MPI , PVM )
 Data Parallel (Fortran 90/High Performance Fortran )
 Multi-Threading
 Hybrid
 Others (OpenMP, shmem)
 Decomposition of the problem into pieces that multiple
workers can perform.

15
Issues in Parallel Computing

• Parallel computer architectures

• Efficient parallel algorithms
• Parallel programming models
• Parallel computer languages
• Methods for evaluating parallel algorithms
• Parallel programming tools

16
Designing Parallel Algorithms

 Detect and exploit any inherent parallelism in an existing

sequential Algorithm

 Invent a new parallel algorithm

 Adopt from another parallel algorithm that solves a similar

problem

17
Decomposition Techniques

Decomposition Techniques

The process of splitting the computations in a problem into a set of

concurrent tasks is referred to as decomposition.

 Decomposing a problem effectively is of paramount importance

in parallel computing.

 Without a good decomposition, we may not be able to achieve a

high degree of concurrency.

 Decomposing a problem must ensure good load balance.

18
Decomposition Techniques (Contd…)

What is meant by good decomposition?

 It should lead to high degree of concurrency (fine-granularity).

 The interaction among tasks should be as little as possible

(coarse-granularity).

•The ratio between computation and communication is known as granularity.

19
Success depends on the combination of

 Architecture, Compiler, Choice of Right Algorithm

 Portability, Maintainability, and Efficient implementation

20
Architectural Taxonomy

Flynn's taxonomy uses the relationship of program instructions

to program data. The four categories are:
 SISD – Single Instruction, Single Data Stream
 SIMD – Single Instruction, Multiple Data Stream
 MISD - Multiple Instruction, Single Data Stream
(no practical examples)
 MIMD - Multiple Instruction, Multiple Data Stream

21
SISD Model features
 Not a parallel computer
 Conventional serial, scalar von Neumann computer
 A single instruction is issued in each clock cycle
 Each instruction operates on a single (scalar) data element
 Performance measured in MIPS
 Examples: most PCs and single CPU workstations

22
SIMD Model features

Also von Neumann architectures but

more powerful instructions
Each instruction may operate on more
than one data element
Usually intermediate host executes
program logic and broadcasts
instructions to other processors
Examples: Array Processors and Vector
Processors (used in the supercomputers of
1970’s and 80’s

23
MIMD Model features

 Parallelism achieved by connecting multiple processors together

 Each processor executes its own instruction stream independent of other
processors on unique data stream
 Advantages
 Processors can execute multiple job streams simultaneously
 Each processor can perform any operation regardless of what
other processors are doing
Disadvantages
Load balancing overhead - synchronization needed to coordinate
processors at end of parallel structure in a single application
Can be difficult to program 24
MIMD Block Diagram

25
MIMD Classification

26
Parallel Computer Architecture Memory Models

Shared Memory Distributed Memory

Hybrid Memory
Symmetric Multiprocessors (SMP)

28
Symmetric Multiprocessors (SMP)
(Contd…)

•Uses commodity microprocessors with on-chip and off-chip

cache.
•Processors are connected to a shared memory through a high-
speed bus
•Single address space.
•Easy application development.
•Difficult to scale.
•Difficult to repair/ replace the faulty node (when compared to
clusters)

29
SMP, MPP and clusters

30
Competing Architectures

• Massively Parallel Processors (MPP)-proprietary systems built for

specific purposes
– high cost and a low performance/price ratio.
• Symmetric Multiprocessors (SMP)
– suffers from scalability
• Distributed Systems
– difficult to extract high performance.
• Clusters
– High Performance Computing--- With Commodity Processors
– High Availability Computing --- for Critical Applications

31
What is a Cluster?

 A cluster is a type of parallel or distributed processing system,

which consists of a collection of interconnected stand-
alone/complete computers cooperatively working together as a
single, integrated computing resource.

 A typical cluster consists of:

• Faster, closer connection Network than a typical LAN
• Low latency communication protocols
• Looser connection than SMP

32
Motivation for using Clusters

 The communications bandwidth between

workstations is increasing as new networking
technologies and protocols are implemented in
LANs and WANs.

 Workstation clusters are easier to integrate into

existing networks than special parallel computers.

33
Cluster Computer Architecture

34
Components of Cluster Computers
• Multiple High Performance Computers
– PCs
– Workstations
– SMPs
• State-of-the-art Operating Systems
– Layered
– Micro-kernel based
• High Performance Networks/Switches
– Gigabit Ethernet
– PARAMNet
– Myrinet
• Network Interface Cards (NICs)
• Fast Communication Protocols and Services
– Active Messages (AM)
– Virtual Interface Architecture (VIA)
35
Components of Cluster Computers (Contd…)

• Parallel Programming Environments and Tools

– Compilers
– PVM [Parallel Virtual Machine]
– MPI [Message Passing Interface]
• Applications

– Sequential
– Parallel or Distributed

36
Parallel programming models -- MPI, PVM and OpenMP

•MPI – Messaging Passing Interface

•PVM – Parallel Virtual Machine
•Both MPI and PVM are based on message passing mechanism.
•Both MPI and PVM can be used with shared-memory and
distributed memory architectures.
•MPI
- MPI is mainly for data-parallel problems.
- Collective and asynchronous operations are more powerful
in MPI.
•OpenMP – Open Multiprocessing
- OpenMP is thread-based multiprocessing.
37

- OpenMP – more suitable to SMP systems.

Features of CDAC’s PARAM Supercomputers

 Distributed memory at system level and Shared memory at Node level.

 Nodes connected by low latency high throughput System Area

Networks PARAMNet and Fast/Gigabit Ethernet.

 Standard Message Passing interface (MPI) i.e. SUN MPI, IBM MPI,
Public Domain MPI and C-DAC’s own MPI (CMPI).

 C-DAC’s High Performance Computing and Communication Software

(HPCC) for Parallel Program Development and run time support.

38
References

• http://www.llnl.gov/computing/tutorials/parallel_comp/
• Tutorials located in the Maui High Performance
Computing Center's "SP Parallel Programming
Workshop".
• Linux Parallel procesing HOW TO from
http://www.tldp.org/HOWTO/Parallel-Processing-
HOWTO.html

39
Thank you.

Strox - Su: Best Platform
No ratings yet
Strox - Su: Best Platform
22 pages
Project Charter Template
No ratings yet
Project Charter Template
6 pages
Parallel Computing
No ratings yet
Parallel Computing
28 pages
PDC Complete Course File
No ratings yet
PDC Complete Course File
422 pages
Parallel Computing
No ratings yet
Parallel Computing
19 pages
Parallel Programming Module 1
No ratings yet
Parallel Programming Module 1
71 pages
Unit VI Parallel Programming Concepts
No ratings yet
Unit VI Parallel Programming Concepts
90 pages
Week1 Parallel and Distributed Computing
No ratings yet
Week1 Parallel and Distributed Computing
55 pages
Week1 - Parallel and Distributed Computing
100% (1)
Week1 - Parallel and Distributed Computing
46 pages
Introduction To Parallel Computing LLNL
No ratings yet
Introduction To Parallel Computing LLNL
44 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Introduction To Parallel Computing
No ratings yet
Introduction To Parallel Computing
38 pages
Chapter 1
No ratings yet
Chapter 1
25 pages
Unit 1 - Part 1
No ratings yet
Unit 1 - Part 1
51 pages
Computer Achitecture II - Parallel - Computing
No ratings yet
Computer Achitecture II - Parallel - Computing
46 pages
Chapter 02 - Asynchronous and Parallel Programming in
No ratings yet
Chapter 02 - Asynchronous and Parallel Programming in
55 pages
Parallel Computing
No ratings yet
Parallel Computing
91 pages
Parallel and Distributed Computing
No ratings yet
Parallel and Distributed Computing
90 pages
Parallel VS Distributed Computing
No ratings yet
Parallel VS Distributed Computing
9 pages
KCS 713 Unit 1 Lecture 5
No ratings yet
KCS 713 Unit 1 Lecture 5
32 pages
Lecture 2 General Parallelism Terms
No ratings yet
Lecture 2 General Parallelism Terms
22 pages
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
No ratings yet
2-INTRODUCTION TO PDC - MOTIVATION - KEY CONCEPTS-03-Dec-2019Material - I - 03-Dec-2019 - Module - 1 PDF
63 pages
COA - Unit 4
No ratings yet
COA - Unit 4
84 pages
Introduction To Parallel Computing-Dr Nousheen
No ratings yet
Introduction To Parallel Computing-Dr Nousheen
43 pages
BDS Session 2
No ratings yet
BDS Session 2
56 pages
01 Intro Parallel Computing
No ratings yet
01 Intro Parallel Computing
40 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
Module 4 - Architecture
No ratings yet
Module 4 - Architecture
22 pages
Lecture-2-06 01 2025
No ratings yet
Lecture-2-06 01 2025
21 pages
Topic 1 2024
No ratings yet
Topic 1 2024
41 pages
Lecture 2 General Parallelism Terms 1
No ratings yet
Lecture 2 General Parallelism Terms 1
24 pages
Lecture Notes On Parallel Computation
No ratings yet
Lecture Notes On Parallel Computation
30 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
Lecture 4
No ratings yet
Lecture 4
27 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
LP V Theory and Practical Explanation: o o o o
No ratings yet
LP V Theory and Practical Explanation: o o o o
96 pages
High Performance Computing Unit 1-2
No ratings yet
High Performance Computing Unit 1-2
60 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Introduction To Parallel Programming
No ratings yet
Introduction To Parallel Programming
129 pages
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
No ratings yet
CS326 Parallel and Distributed Computing: SPRING 2021 National University of Computer and Emerging Sciences
47 pages
Unit 5
No ratings yet
Unit 5
66 pages
Chapter 1 - Parallel Architectures
No ratings yet
Chapter 1 - Parallel Architectures
60 pages
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
No ratings yet
Parallel Computing: Er. Anupama Singh Department of Computer Science & Engg
22 pages
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
No ratings yet
Theory of Distributed Computing and Parallel Processing With Its Applications, Advantages and Disadvantages
11 pages
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
No ratings yet
EE664: Introduction To Parallel Computing: Dr. Gaurav Trivedi Lectures 5-14
170 pages
Unit 4
No ratings yet
Unit 4
16 pages
Lecture Week - 2 General Parallelism Terms
No ratings yet
Lecture Week - 2 General Parallelism Terms
24 pages
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
No ratings yet
FALLSEM2021-22 CSE4001 ETH VL2021220104078 Reference Material I 05-Aug-2021 Module1 (Part 1)
30 pages
Parallel Computing Main
No ratings yet
Parallel Computing Main
47 pages
Multiprocessing Vs Multithreading 2
No ratings yet
Multiprocessing Vs Multithreading 2
16 pages
Cloud Computing
No ratings yet
Cloud Computing
27 pages
Parallel Computing
100% (1)
Parallel Computing
53 pages
Overview of Parallel Computing: Shawn T. Brown
No ratings yet
Overview of Parallel Computing: Shawn T. Brown
46 pages
Coa PPT-2
No ratings yet
Coa PPT-2
16 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Lec1 and 2
No ratings yet
Lec1 and 2
52 pages
COA - Module-5
No ratings yet
COA - Module-5
35 pages
Lect 1 Overview
No ratings yet
Lect 1 Overview
17 pages
Concurrency and Multithreading in C: POSIX Threads and Synchronization
From Everand
Concurrency and Multithreading in C: POSIX Threads and Synchronization
Larry Jones
No ratings yet
Mastering Dynamic Programming in Python
From Everand
Mastering Dynamic Programming in Python
Ed A Norex
No ratings yet
Mastering Concurrency and Parallel Programming Unlock the Secrets of Expert-Level Skills.pdf
From Everand
Mastering Concurrency and Parallel Programming Unlock the Secrets of Expert-Level Skills.pdf
Larry Jones
No ratings yet
Optimized Computing in C++: Mastering Concurrency, Multithreading, and Parallel Programming
From Everand
Optimized Computing in C++: Mastering Concurrency, Multithreading, and Parallel Programming
Peter Jones
No ratings yet
OpenMPI Houston 04 05
No ratings yet
OpenMPI Houston 04 05
20 pages
Parallel Programming in OpenMP
No ratings yet
Parallel Programming in OpenMP
245 pages
Myrinet Express (MX) : A High Performance, Low Level, Message Passing Interface For Myrinet Version 1.1 January 01, 2006
No ratings yet
Myrinet Express (MX) : A High Performance, Low Level, Message Passing Interface For Myrinet Version 1.1 January 01, 2006
54 pages
Fdi 2008 Lecture6
100% (1)
Fdi 2008 Lecture6
39 pages
IBM - AIX 5L Porting Guide
No ratings yet
IBM - AIX 5L Porting Guide
646 pages
Fdi 2008 Lecture8
No ratings yet
Fdi 2008 Lecture8
34 pages
IBM - NFS in AIX
No ratings yet
IBM - NFS in AIX
334 pages
Hpcclustertools Superg
No ratings yet
Hpcclustertools Superg
7 pages
IBM - Developing and Porting C On AIX
No ratings yet
IBM - Developing and Porting C On AIX
546 pages
Fdi 2008 Lecture4
No ratings yet
Fdi 2008 Lecture4
38 pages
DEISA Training October06 Technical 02 Communication
No ratings yet
DEISA Training October06 Technical 02 Communication
38 pages
System Design With SystemC
No ratings yet
System Design With SystemC
236 pages
Fdi 2008 Lecture3
No ratings yet
Fdi 2008 Lecture3
36 pages
Fdi 2008 Lecture7
No ratings yet
Fdi 2008 Lecture7
41 pages
Compile-Time Stack Requirements Analysis With GCC
100% (2)
Compile-Time Stack Requirements Analysis With GCC
13 pages
Parallel Algorithms Underlying MPI Implementations
No ratings yet
Parallel Algorithms Underlying MPI Implementations
55 pages
Intel Vidf Via
No ratings yet
Intel Vidf Via
100 pages
VIAP
No ratings yet
VIAP
7 pages
Symmetric Multiprocessing
No ratings yet
Symmetric Multiprocessing
1 page
HP Mpi
No ratings yet
HP Mpi
199 pages
VIA Evaluation
No ratings yet
VIA Evaluation
8 pages
Project Via Flowcontrol
No ratings yet
Project Via Flowcontrol
46 pages
M Via
No ratings yet
M Via
22 pages
Enterprise Search Tools Move From Luxury Item To Business Essential As Data Builds Up - Computerweekly - Com
No ratings yet
Enterprise Search Tools Move From Luxury Item To Business Essential As Data Builds Up - Computerweekly - Com
10 pages
12 CS em
No ratings yet
12 CS em
15 pages
Informatica Notes
No ratings yet
Informatica Notes
7 pages
Step by Step Manual Upgrade Oracle Database From 12c To 19c - DBsGuru
100% (1)
Step by Step Manual Upgrade Oracle Database From 12c To 19c - DBsGuru
47 pages
Revision Sheet Answers Moodle2023
No ratings yet
Revision Sheet Answers Moodle2023
16 pages
Ictcld401 Unit Plan 2024 t2 FT Feb
No ratings yet
Ictcld401 Unit Plan 2024 t2 FT Feb
8 pages
Ques Python
No ratings yet
Ques Python
30 pages
H177 34 - Troubleshooting Computer Problems: Documentation
No ratings yet
H177 34 - Troubleshooting Computer Problems: Documentation
5 pages
MCA 202-Big Data and Big Data Analysis
No ratings yet
MCA 202-Big Data and Big Data Analysis
189 pages
Unit 1
No ratings yet
Unit 1
17 pages
Cs408, Lab 3: Opengl For 3D Objects: Setting Up An Opengl Project
No ratings yet
Cs408, Lab 3: Opengl For 3D Objects: Setting Up An Opengl Project
4 pages
Cit 3100 Introduction To Computer Technology
No ratings yet
Cit 3100 Introduction To Computer Technology
3 pages
Data Engineer
No ratings yet
Data Engineer
2 pages
CV Awash Bank
No ratings yet
CV Awash Bank
8 pages
Synopsys Part2
No ratings yet
Synopsys Part2
3 pages
RESTful Java Web Services - Second Edition - Sample Chapter
100% (1)
RESTful Java Web Services - Second Edition - Sample Chapter
34 pages
1.3 - Kali Linux
No ratings yet
1.3 - Kali Linux
8 pages
How Can I Make The Smartforms To Display A Print Preview by Default Without Displaying The Popup For Print Parameters
50% (2)
How Can I Make The Smartforms To Display A Print Preview by Default Without Displaying The Popup For Print Parameters
94 pages
BCP 78 BCP 79
No ratings yet
BCP 78 BCP 79
43 pages
Pub Iso 2016-03
No ratings yet
Pub Iso 2016-03
22 pages
ScrollView in Android Studio A Deep Dive
No ratings yet
ScrollView in Android Studio A Deep Dive
8 pages
OceanofPDF - Com DATA SCIENCE Simple and Effective Tips An - Benjamin Smith
100% (1)
OceanofPDF - Com DATA SCIENCE Simple and Effective Tips An - Benjamin Smith
122 pages
Haramaya University College of Computing and Informatics Department of Computer Science
No ratings yet
Haramaya University College of Computing and Informatics Department of Computer Science
48 pages
Ceh
No ratings yet
Ceh
3 pages
Bus Reservation
0% (1)
Bus Reservation
26 pages
Chapter 1
No ratings yet
Chapter 1
226 pages
Gov Coesys Visa Management
No ratings yet
Gov Coesys Visa Management
6 pages
Ios Developers 130 Complete
No ratings yet
Ios Developers 130 Complete
15 pages

Project - ParallelComputing BSR v2

Uploaded by

Project - ParallelComputing BSR v2

Uploaded by

INTRODUCTION TO PARALLEL

System Software Development Group,

• Need for Parallel Computing

 There are 3 ways to improve performance:

 Traditional sequential programs execute one instruction at a time

 Grand Challenge Problems:

 Multiple tasks executing concurrently is called task parallelism.

 All the CPUs execute separate code blocks simultaneously.

 In computer architecture point of view, a parallel

Speedup : Speedup ‘Tp’ is defined as the ratio of the serial

Efficiency: Ratio of speedup to the number of processors.

It is obtained by taking the clock rate of the given system and

Cost : Cost of solving a problem on a parallel system is the

Parallel Programming Paradigms

• Parallel computer architectures

 Detect and exploit any inherent parallelism in an existing

 Invent a new parallel algorithm

 Adopt from another parallel algorithm that solves a similar

The process of splitting the computations in a problem into a set of

 Decomposing a problem effectively is of paramount importance

 Without a good decomposition, we may not be able to achieve a

 Decomposing a problem must ensure good load balance.

What is meant by good decomposition?

 It should lead to high degree of concurrency (fine-granularity).

 The interaction among tasks should be as little as possible

•The ratio between computation and communication is known as granularity.

 Architecture, Compiler, Choice of Right Algorithm

 Portability, Maintainability, and Efficient implementation

Flynn's taxonomy uses the relationship of program instructions

Also von Neumann architectures but

 Parallelism achieved by connecting multiple processors together

Shared Memory Distributed Memory

•Uses commodity microprocessors with on-chip and off-chip

• Massively Parallel Processors (MPP)-proprietary systems built for

 A cluster is a type of parallel or distributed processing system,

 A typical cluster consists of:

 The communications bandwidth between

 Workstation clusters are easier to integrate into

• Parallel Programming Environments and Tools

•MPI – Messaging Passing Interface

- OpenMP – more suitable to SMP systems.

 Distributed memory at system level and Shared memory at Node level.

 Nodes connected by low latency high throughput System Area

 C-DAC’s High Performance Computing and Communication Software

You might also like