Course 7

Uploaded by

anes20181

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views21 pages

Course 7

Uploaded by

anes20181

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Programming langages

Pr. Yahia Benmoussa ([email protected])

GPU Programming
●
NVIDIA GPU hardware architecture
●
CUDA Programming model
GPU vs CPU
GPU vs CPU
●
GPU and the CPU exists because they are designed
with different goals
– CPU is designed to execute a sequence of operations,
called a thread, as fast as possible.
●
transistors are devoted to insctruction control
– GPU is designed to execute thousands of them in parallel
●
transistors are devoted to data processing
GPU architecture
What is CUDA
The CUDA parallel programming model is
designed to overcome this challenge while
maintaining a low learning curve for programmers
familiar with standard programming languages
such as C.
CUDA Programming model
●
CUDA Programming model offers three key
abstractions
– Hierarchy of thread groups
– Shared memories
– Barrier synchronization
●
These abstractions are simply exposed to the
programmer as a minimal set of language extensions.
Thread Hierarchy
●
The programmer can to partition the problem into :
– coarse sub-problems that can be solved independently
in parallel by blocks of threads,
– sub-problem into finer pieces that can be solved
cooperatively in parallel by all threads within the block
– Grid is composed of different block
Thread Hierarchy
●
This decomposition preserves language expressivity by
allowing threads to cooperate when solving each sub-
problem
●
At the same time enables automatic scalability where each
block of threads can be scheduled on any of the available
multiprocessors within a GPU, in any order, concurrently or
●
sequentially, so that a compiled CUDA program can
execute on any number of multiprocessors
What is CUDA
●
CUDA C++ extends C++ by allowing the
programmer to define C++ functions, called
kernels, that, when called, are executed N
times in parallel by N different CUDA threads,
as opposed to only once like regular C++
functions.
CUDA Programming model
●
There is a limit to the number of threads per block,
since all threads of a block are expected to reside
on the same streaming multiprocessor core and
must share the limited memory resources of that
– On current GPUs, a thread block may contain up to
1024 threads
●
The size of the grid depends on the data size
CUDA Programming model
●
A kernel is defined using the __global__
declaration specifier and the number of CUDA
threads that execute that kernel for a given
kernel call is specified using a
new<<<...>>>execution configuration syntax
CUDA build-in variables
●
int threadIdx → This variable contains the
thread index within the block
●
int blockDim → This variable contains the
number of threads per block
●
int blockIdx.x → This variable contains the
block index within the grid
Dimensions of the block/grid
CUDA Programming model
Where to run your CUDA code ?
●
On you PC if it has a NVIDIA GPU !
– CUDA Installation Guide for Linux
●
On the cloud
– Google colab : 3 types of NVIDIA GPU :
●
T4
●
A 100
●
L4
– Kaggle
– Amazon SageMarker Studio Lab
T4 GPU
●
Number of SM = 40
●
Number of core per SM = 64
●
Total number of cores = 2560
References
●
CUDA C++ Programming Guide

Lecture 12 GPU Programming
No ratings yet
Lecture 12 GPU Programming
65 pages
Pile Type 1 - Screw Pile Load Test Outline (Terna)
No ratings yet
Pile Type 1 - Screw Pile Load Test Outline (Terna)
113 pages
Nmrws2: H O That Are Aldehydes
No ratings yet
Nmrws2: H O That Are Aldehydes
4 pages
M Schemes 04
0% (2)
M Schemes 04
3 pages
Endsem Imp HPC Unit 5
No ratings yet
Endsem Imp HPC Unit 5
24 pages
CSE Lec4 Cuda
No ratings yet
CSE Lec4 Cuda
91 pages
2 High-Performance Supercapacitor Electrode Based On Cobalt Oxide
No ratings yet
2 High-Performance Supercapacitor Electrode Based On Cobalt Oxide
21 pages
Unit-III Final Java Servlets and XML Notes
No ratings yet
Unit-III Final Java Servlets and XML Notes
64 pages
Introduction - CUDA C Programming Guide
No ratings yet
Introduction - CUDA C Programming Guide
573 pages
Chapter7 GPU
No ratings yet
Chapter7 GPU
45 pages
CUDA
No ratings yet
CUDA
18 pages
Lecture12 GPUArchCUDA02-CUDAMem
No ratings yet
Lecture12 GPUArchCUDA02-CUDAMem
67 pages
Govind 6
No ratings yet
Govind 6
4 pages
Cuda
No ratings yet
Cuda
69 pages
GPU Architecture and Programming
No ratings yet
GPU Architecture and Programming
3 pages
CUDA 1 - Introduction To GPU, CUDA
No ratings yet
CUDA 1 - Introduction To GPU, CUDA
21 pages
47 Exp2 Dav
No ratings yet
47 Exp2 Dav
15 pages
Experiment 3: Spatial Domain Image Enhancement: MATLAB Code
No ratings yet
Experiment 3: Spatial Domain Image Enhancement: MATLAB Code
8 pages
Worksheet 1 Descriptive Statistics
No ratings yet
Worksheet 1 Descriptive Statistics
4 pages
Puting Experiences
No ratings yet
Puting Experiences
15 pages
CUDA Lab Instruction
No ratings yet
CUDA Lab Instruction
40 pages
Unit 4 Transport Layer
No ratings yet
Unit 4 Transport Layer
25 pages
CUDAProg Model
No ratings yet
CUDAProg Model
24 pages
27th Aug - Introduction To GPGPU - Part 1
No ratings yet
27th Aug - Introduction To GPGPU - Part 1
32 pages
Programming Models For GPU Architecture
No ratings yet
Programming Models For GPU Architecture
55 pages
Application of DVD/CD Pickup Optics To Microscopy and Fringe Projection
No ratings yet
Application of DVD/CD Pickup Optics To Microscopy and Fringe Projection
6 pages
Lecture 2
No ratings yet
Lecture 2
77 pages
0 Gpu Computing I Give It
No ratings yet
0 Gpu Computing I Give It
57 pages
THS527 Datasheet
No ratings yet
THS527 Datasheet
5 pages
Features Material Specifications: Application
No ratings yet
Features Material Specifications: Application
1 page
DS1822 - Parallel Computing-Unit3
No ratings yet
DS1822 - Parallel Computing-Unit3
17 pages
Rocker Arm Installation & Removal 0 Deutz 0312 4232 2011
No ratings yet
Rocker Arm Installation & Removal 0 Deutz 0312 4232 2011
4 pages
HPC Final 4-8
No ratings yet
HPC Final 4-8
25 pages
Eta-120114 Spax Screws
No ratings yet
Eta-120114 Spax Screws
84 pages
GPU Basics
No ratings yet
GPU Basics
93 pages
1 Cuda
100% (1)
1 Cuda
173 pages
Unit 5 - CUDA Architecture
No ratings yet
Unit 5 - CUDA Architecture
17 pages
Gpu Cuda
No ratings yet
Gpu Cuda
204 pages
Parallel Processing With Cuda
No ratings yet
Parallel Processing With Cuda
25 pages
Cuda
No ratings yet
Cuda
25 pages
Air Braking System Calculation
No ratings yet
Air Braking System Calculation
27 pages
Phys BP PB 2
No ratings yet
Phys BP PB 2
1 page
Migration XPPS Xpert Sebn Ro: Content
No ratings yet
Migration XPPS Xpert Sebn Ro: Content
5 pages
Cuuda Nvidai Guide - Part1
No ratings yet
Cuuda Nvidai Guide - Part1
15 pages
Intro GPUs
No ratings yet
Intro GPUs
36 pages
Gpu Cuda Part2
No ratings yet
Gpu Cuda Part2
15 pages
From CPU To GPU With CUDA C Language: Michele Tuttafesta Dottorato Di Ricerca in Fisica 25 Ciclo
No ratings yet
From CPU To GPU With CUDA C Language: Michele Tuttafesta Dottorato Di Ricerca in Fisica 25 Ciclo
71 pages
CUDA Programming
No ratings yet
CUDA Programming
35 pages
GPU Architecture Ebook
No ratings yet
GPU Architecture Ebook
67 pages
CUDA Programming: Lei Zhou, Yafeng Yin, Yanzhi Ren, Hong Man, Yingying Chen
No ratings yet
CUDA Programming: Lei Zhou, Yafeng Yin, Yanzhi Ren, Hong Man, Yingying Chen
28 pages
(Reg. Relationship Steps
No ratings yet
(Reg. Relationship Steps
4 pages
CUDA Programming On Nvidia Gpus: Mike Giles
No ratings yet
CUDA Programming On Nvidia Gpus: Mike Giles
21 pages
GPU Programming: CUDA
No ratings yet
GPU Programming: CUDA
29 pages
Lec 1
No ratings yet
Lec 1
27 pages
Unit 6 Chapter 1 Parallel Programming Tools Cuda - Programming
No ratings yet
Unit 6 Chapter 1 Parallel Programming Tools Cuda - Programming
28 pages
MCUDA: An Efficient Implementation of CUDA Kernels On Multi-Cores
No ratings yet
MCUDA: An Efficient Implementation of CUDA Kernels On Multi-Cores
19 pages
Journal of Alloys and Compounds
No ratings yet
Journal of Alloys and Compounds
8 pages
Cuda Review 1
No ratings yet
Cuda Review 1
13 pages
CUDA Programming: Johan Seland Johan - Seland@sintef - No
No ratings yet
CUDA Programming: Johan Seland Johan - Seland@sintef - No
76 pages
Topic GPU1
No ratings yet
Topic GPU1
32 pages
An Overview of General Purpose Graphics Processing Units: Marc Moreno Maza
No ratings yet
An Overview of General Purpose Graphics Processing Units: Marc Moreno Maza
18 pages
ENCOR - Chapter - 1 - Packet Forwarding
No ratings yet
ENCOR - Chapter - 1 - Packet Forwarding
57 pages
CUDA
No ratings yet
CUDA
46 pages
лк CUDA - 1 PDCn
No ratings yet
лк CUDA - 1 PDCn
31 pages
002 - Introduction To CUDA Programming - 1
No ratings yet
002 - Introduction To CUDA Programming - 1
54 pages
Tabla de Torques DP DC HW
No ratings yet
Tabla de Torques DP DC HW
1 page
Introduction To Programming Massively Parallel Graphics Processors
No ratings yet
Introduction To Programming Massively Parallel Graphics Processors
84 pages
CUDA Programming with C++: From Basics to Expert Proficiency
From Everand
CUDA Programming with C++: From Basics to Expert Proficiency
William Smith
No ratings yet
Parallel & Distributed Computing Report
No ratings yet
Parallel & Distributed Computing Report
4 pages
Errecom Cat.a.05 19.en
No ratings yet
Errecom Cat.a.05 19.en
88 pages
Introduction To Gpu Programming With Cuda and Openacc
100% (1)
Introduction To Gpu Programming With Cuda and Openacc
40 pages
8 Cud A 1
No ratings yet
8 Cud A 1
38 pages
Cuda C
No ratings yet
Cuda C
70 pages
STEEL Standard Specifications
100% (1)
STEEL Standard Specifications
4 pages
CUDA Tutorial
No ratings yet
CUDA Tutorial
50 pages
Barnett Haskins
No ratings yet
Barnett Haskins
29 pages
Christian Eh An Sen 2
No ratings yet
Christian Eh An Sen 2
18 pages
Enrtl-Rk Rate Based Dipa Model
No ratings yet
Enrtl-Rk Rate Based Dipa Model
34 pages
CUDA
No ratings yet
CUDA
33 pages
Python
100% (1)
Python
635 pages
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
No ratings yet
GPGPU Programming With CUDA: Leandro Avila - University of Northern Iowa
29 pages
Cuda Talk
100% (1)
Cuda Talk
82 pages
Exercise 1: Descriptive Statistics Practice Exercises
No ratings yet
Exercise 1: Descriptive Statistics Practice Exercises
6 pages
Module 2 Previous Year Questions
No ratings yet
Module 2 Previous Year Questions
9 pages
ECE 498AL The CUDA Programming Model
No ratings yet
ECE 498AL The CUDA Programming Model
37 pages
CUDA Compute Unified Device Architecture
No ratings yet
CUDA Compute Unified Device Architecture
26 pages
Jonathan Bennett Events and Their Names
No ratings yet
Jonathan Bennett Events and Their Names
239 pages
Simba S7 D - Techspecific
No ratings yet
Simba S7 D - Techspecific
4 pages
Questions With Solutions Mid-Sem Final
No ratings yet
Questions With Solutions Mid-Sem Final
7 pages
Programming Gpus With Cuda: John Mellor-Crummey
No ratings yet
Programming Gpus With Cuda: John Mellor-Crummey
42 pages

Course 7

Uploaded by

Course 7

Uploaded by

Programming langages

Pr. Yahia Benmoussa ([email protected])

You might also like