Tutorial No 3
Tutorial No 3
Question
1. What GPGPU stands for and what does it mean.
General Purpose GPU, a GPU that has the ability to perform calculations that are usually dedicated for CPU
2. Why CUDA is said Heterogeneous computing. 2- Processing is handled by two different processors
the low letancy code performed by CPU in a Serial way
the high letancy code performed by GPU in a Parallel way
3. Give the definition of the following terms:
a. Device: Refers to the GPU and its memory
b. Kernel. A function that runs on the device. One kernel executed at a time and Many
threads execute each kernel.
c. Grid of thread blocks. The kernal is executed by a grid of thread blocks. Each Grid has a collection of blocks and
each block has a collection of threads.
d. Warp.
Group of 32 threads of the same block
6. Explain Why the Constant memory is cached, while the Global memory is not.
Constant Memory is read only. caching has no overhead because it doesn't has cache coherency problem
Global Memory is read/write. It has cache coherency problem and the overhead to maintain it will be very high we have thousands of
threads running.
1
King Saud University
College of Computer and Information Sciences
Department of Computer Science
CSC453 – Parallel Processing – Tutorial No 3 – Fall 2021