GPU Architecture and Programming
GPU Architecture and Programming
UNIT- V (9Hrs)
Parallel Programming Languages: Distributed Memory Programming with MPI: trapezoidal rule in MPI,
I/O handling, MPI derived datatype, Collective Communication, Shared Memory Programming with
Pthreads: Conditional Variables, read-write locks, Cache handling, Shared memory programming with CO5
Open MP: Parallel for directives, scheduling loops, Thread Safety, CUDA: Parallel programming in CUDA
C, Thread management, Constant memory and Event, Graphics Interoperability, Atomics, Streams.
Text Books
1. Benedict R Gaster, Lee Howes, David, R. Kaeli, Perhaad Mistry and Dana Schaa, “Heterogeneous Computing
with OpenCL”, Elsevier, 2013.
2. Aaftab Munshi, Benedict Gaster, Timothy G. Mattson, James Fung & Dan Ginsburg, “OpenCL Programming
Guide”, Addison-Wesley Professional, 2011.
3. RyojiTsuchiyama, Takashi Nakamura, TakuroIizuka & Akihiro Asahara, “The OpenCL Programming Book”,
Fixstars Corporation, 2010.
4. Matthew Scarpio, “OpenCL in Action: How to Accelerate Graphics and Computations”, Manning Publications,
2011.
Reference Books
1. P. Pacheco, Parallel Programming with MPI, Elsevier
2. Shane Cook, CUDA programming, Morgan Kaufmann
Web References
1. https://www.nvidia.com/en-us/accelerated-applications/
2. https://forums.developer.nvidia.com/c/nvidia-virtual-gpu-forums/328
COs/POs/PSOs Mapping
Program Specific
Program Outcomes (POs)
COs Outcomes (PSOs)
PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3
1
2
3
4
5
Evaluation Method
* Application oriented / Problem solving / Design / Analytical in content beyond the syllabus