Intro HPC IITK
Intro HPC IITK
Performance Computing
and its Applications
Ashish P. Kuvelkar
Senior Director (HPC- Tech)
C-DAC, Pune
• Introduction to HPC
• Architecting a HPC system
• Approach to Parallelization
• Parallelization Paradigm
• Applications in area of Science and Engineering
Definition 2 (Wikipedia)
• High Performance Computing (HPC) uses
Supercomputers and Computer Clusters to solve
advanced computation problems.
HSM/
Backup
Networking
Server
Gateway
Switch Storage
Fabric Accelerated
Compute Acceleration
Compute
Nodes Tape Library/
Nodes
Backup storage
Parallel File
System
Local
Network
• Superscaler implementation
• Multicore architecture
Multiple Instructions
Cache/ Fetch Decode/
Memory Unit issue
Unit
E E E
U U U
• Multiple execution units
• Sequential instructions, multiple Register File
issue
13 © Centre for Development of Advanced Computing
Multicore Processor
• Single computing component with two or more
independent processing units
• Each unit is called cores, which read and execute
program instructions
Source: Wikipedia.
• Implicit Parallelism
• Done by the compiler and runtime system
• Explicit Parallelism
• Done by the programmer
Advantages :
• Data sharing between threads is fast and uniform
• Global address space provides user friendly
programming
Disadvantages :
• Lack of scalability between memory and CPUs
• Programmer is responsible for specifying
synchronization, e.g. locks
• Expensive
© Centre for Development of Advanced Computing
Message Passing Model
• Asynchronous parallelism
• Explicit interaction
• Phase parallel
• Divide and conquer
• Pipeline
• Process farm
• Domain Decomposition
• Concurrency
• Ability to perform many actions simultaneously
• Scalability
• Resilience to increasing processor counts
• Data Locality
• High ratio of local memory accesses to remote
memory accesses (through communication)
• Modularity:
• Decomposition of complex entities into simpler
components
© Centre for Development of Advanced Computing
Massive processing power introduces I/O challenge
• Getting data to and from the processing units can take as long
as the processing itself
• Requires careful software design and deep understanding of
algorithms and architecture of
Processors (Cache effects, memory bandwidth)
GPU accelerators
Interconnects (Ethernet, IB, 10 Gigabit Ethernet),
Storage (local disks, NFS, parallel file systems)
4 cores
© Centre for Development of Advanced Computing
Application Areas of HPC in
Science & Engineering
Nuclear Science
• Applications in Nuclear Power, Nuclear
Medicine (cancer etc.), Defence
Nano Science
• Applications in Semiconductor Physics,
Microfabrication, Molecular Biology,
Exploration of New Materials
DreamWorks Animation
SKG produces all its animated
movies using HPC graphic
technology