GPU Optimisation

This document outlines an assignment to profile and optimize GPU performance by developing a Template D program. Students are asked to implement profiling of computation phases, occupancy, bandwidth and speedup and compare GPU and CPU performance. They must submit their template and report by Monday analyzing when GPU processing provides better performance than the CPU.

Uploaded by

raleigh_rayl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views1 page

GPU Optimisation

Uploaded by

raleigh_rayl

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Advanced Operating Systems – Programming Assignment 2

GPU Profiling & Optimisation

Your assignment is to complete our investigation into GPU speedup, occupancy, and memory
bandwidth, and to answer the question:

“Since there is an overhead in GPU processing, when does it make sense to use the GPU instead of the
CPU?”

Refer to the templates we have developed in the lab (A, B, and C), the NVIDIA webinars on CUDA
optimisation, and the other example programs in the NVIDIA GPU Computing SDK (e.g.
simpleZeroCopy, bandwidthTest, etc.).

Develop a Template D which incorporates your understanding of the CUDA C Runtime API, and the
material covered in the lectures. Write a short report on your findings, and provide comparative results
(speed, occupancy, bandwidth), using Template C results as your baseline.

Your program should implement the following requirements:

1. Target the FERMI architecture (in your GTX470), but state which features, in your Template D,
are not CUDA 1.0 compute capabilities when you write your report.

2. Wall-clock timings of CPU and GPU computation.

3. Direct profiling of GPU computation phases, as per the events in Template C.

4. Achieve maximum GPU speedup.

5. Achieve maximum GPU occupancy.

6. Achieve maximum GPU memory bandwidth.

7. Compare 5,6, and 7, with CPU.

Submit your template and report to the course website no later than first thing Monday 11th October –
please note that there are NO extensions on this deadline.

Thesis Gpu Programming
100% (2)
Thesis Gpu Programming
6 pages
Owens
No ratings yet
Owens
67 pages
HPC Summit Digital 2020: Gpu Experts Panel: Ampere Explained
No ratings yet
HPC Summit Digital 2020: Gpu Experts Panel: Ampere Explained
29 pages
Slides 2
No ratings yet
Slides 2
64 pages
Chapter 5 - General Purpose PGPU, CUDA
No ratings yet
Chapter 5 - General Purpose PGPU, CUDA
70 pages
Aca Lab Manual Final
No ratings yet
Aca Lab Manual Final
28 pages
Analyzing CUDA Workloads Using A Detailed GPU Simulator
No ratings yet
Analyzing CUDA Workloads Using A Detailed GPU Simulator
12 pages
A Quantitative Performance Analysis Model For GPU Architectures
No ratings yet
A Quantitative Performance Analysis Model For GPU Architectures
12 pages
Lecture 4 - Cpu, Gpu
No ratings yet
Lecture 4 - Cpu, Gpu
13 pages
Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
No ratings yet
Example: 201201014-GPU-AS2: Assignments For GPU Programming Course/ Lab
4 pages
AcceleratingAIAdvancements Pre Print Doube Blind
No ratings yet
AcceleratingAIAdvancements Pre Print Doube Blind
9 pages
Debunking The 100X GPU vs. CPU Myth
No ratings yet
Debunking The 100X GPU vs. CPU Myth
10 pages
Comparison of Processing Performance and Architectural Efficiency Metrics For Fpgas and Gpus in 3D Ultrasound Computer Tomography
No ratings yet
Comparison of Processing Performance and Architectural Efficiency Metrics For Fpgas and Gpus in 3D Ultrasound Computer Tomography
7 pages
Computación Distribuida y Paralela - Evidencia 3.ipynb - Colab
No ratings yet
Computación Distribuida y Paralela - Evidencia 3.ipynb - Colab
5 pages
Analysis of Programs For GPGPU Architectures
No ratings yet
Analysis of Programs For GPGPU Architectures
4 pages
Gpu Series I Cpu Vs Gpu 1720694318
No ratings yet
Gpu Series I Cpu Vs Gpu 1720694318
4 pages
Practical GPU Programming: High-performance computing with CUDA, CuPy, and Python on modern GPUs
From Everand
Practical GPU Programming: High-performance computing with CUDA, CuPy, and Python on modern GPUs
Maris Fenlor
No ratings yet
Practical GPU Programming
From Everand
Practical GPU Programming
Maris Fenlor
No ratings yet
CUDA Programming with Python: From Basics to Expert Proficiency
From Everand
CUDA Programming with Python: From Basics to Expert Proficiency
William Smith
1/5 (1)
Mastering CUDA C Programming
From Everand
Mastering CUDA C Programming
Ed Norex
No ratings yet
Administering ArcGIS for Server
From Everand
Administering ArcGIS for Server
Hussein Nasser
No ratings yet
CodeIgniter 1.7
From Everand
CodeIgniter 1.7
David Upton
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Accelerated Computing with HIP
From Everand
Accelerated Computing with HIP
Yifan Sun
4.5/5 (2)
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
Franco Mario
No ratings yet
Deploy any website on google cloud platform
From Everand
Deploy any website on google cloud platform
AJ Books
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
What's New in .NET 8? A Complete Guide to the Latest Features
From Everand
What's New in .NET 8? A Complete Guide to the Latest Features
Nitika
No ratings yet
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
From Everand
C++ VS JAVA A PERFORMANCE DEEPDIVE: Unraveling the Performance Characteristics of C++ and Java for High-Performance Computing
Manoj R Chakravarthi
No ratings yet
Mastering CUDA Python Programming
From Everand
Mastering CUDA Python Programming
Ed A Norex
No ratings yet
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
3D Hardware design:: Software applications for GPU
From Everand
3D Hardware design:: Software applications for GPU
S Mathioudakis
No ratings yet
Mastering CUDA C++ Programming: A Comprehensive Guidebook
From Everand
Mastering CUDA C++ Programming: A Comprehensive Guidebook
Brett Neutreon
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
Engineering AI Excellence
From Everand
Engineering AI Excellence
Azhar ul Haque Sario
No ratings yet
Customizing AutoCAD 2020, 13th Edition
From Everand
Customizing AutoCAD 2020, 13th Edition
Prof. Sham Tickoo
No ratings yet
C++ Algorithms for Beginners: A Practical Guide with Examples
From Everand
C++ Algorithms for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
From Everand
Kubernetes and Cloud Native Associate (KCNA) Exam Preparation
Georgio Daccache
No ratings yet
GPU Assembly and Shader Programming for Compute: Low-Level Optimization Techniques for High-Performance Parallel Processing
From Everand
GPU Assembly and Shader Programming for Compute: Low-Level Optimization Techniques for High-Performance Parallel Processing
Robert Johnson
No ratings yet
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
CUDA Programming in C: From Basics to Expert Proficiency
From Everand
CUDA Programming in C: From Basics to Expert Proficiency
William Smith
No ratings yet
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
From Everand
HPE Compute Certification Guide: 444 Practice Questions for the Advanced HPE1-H02 Exam
Steve Brown
No ratings yet
How to Build a Computer for under $500
From Everand
How to Build a Computer for under $500
Business Success Shop
No ratings yet
CompTIA A+ Success Path : Study Guide & Practice Tests
From Everand
CompTIA A+ Success Path : Study Guide & Practice Tests
SUJAN
No ratings yet
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
From Everand
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
Rodrigo Copetti
No ratings yet
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
From Everand
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
Steve Brown
No ratings yet
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
From Everand
IGNOU PGDCA MCS 202 Computer Organisation Previous Years Unsolved Papers
Manish Soni
No ratings yet
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
CompTIA Tech+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam FC0-U71
From Everand
CompTIA Tech+ CertMike: Prepare. Practice. Pass the Test! Get Certified!: Exam FC0-U71
Mike Chapple
No ratings yet
C++ Learn in 24 Hours
From Everand
C++ Learn in 24 Hours
Alex Nordeen
No ratings yet
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
CompTIA A+ Exam Prep Guide : Your Ultimate Study Companion
From Everand
CompTIA A+ Exam Prep Guide : Your Ultimate Study Companion
SUJAN
No ratings yet
Mastering Google Cloud Platform: Navigating the Clouds
From Everand
Mastering Google Cloud Platform: Navigating the Clouds
Kameron Hussain
No ratings yet
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
From Everand
Google Associate Cloud Engineer Exam Companion: Q&A with Explanations
SUJAN
No ratings yet
OpenGL to Vulkan: Mastering Graphics Programming
From Everand
OpenGL to Vulkan: Mastering Graphics Programming
Kameron Hussain
No ratings yet
SRS - How to build a Pen Test and Hacking Platform
From Everand
SRS - How to build a Pen Test and Hacking Platform
alasdair gilchrist
2/5 (1)
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

GPU Optimisation

Uploaded by

GPU Optimisation

Uploaded by

Advanced Operating Systems – Programming Assignment 2

GPU Profiling & Optimisation

Your program should implement the following requirements:

2. Wall-clock timings of CPU and GPU computation.

3. Direct profiling of GPU computation phases, as per the events in Template C.

4. Achieve maximum GPU speedup.

5. Achieve maximum GPU occupancy.

6. Achieve maximum GPU memory bandwidth.

7. Compare 5,6, and 7, with CPU.

You might also like