0% found this document useful (0 votes)

30 views35 pages

GPU Architecture and Function: Michael Foster and Ian Frasch

Uploaded by

Huseyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views35 pages

GPU Architecture and Function: Michael Foster and Ian Frasch

Uploaded by

Huseyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

GPU Architecture and

Function
Michael Foster and Ian Frasch
Overview
● What is a GPU?
● How is a GPU different from a CPU?
● The graphics pipeline
● History of the GPU
● GPU architecture
● Optimizations
● GPU performance trends
● Current development
What is a GPU?
● Dedicated graphics chip that handles
all processing required for rendering
3D objects on the screen
● Typically placed on a video card,
which contains its own memory and
display interfaces (HDMI, DVI, VGA,
etc)
● Primitive GPUs were developed in
the 1980s, although the first
“complete” GPUs began in the mid
1990s.
Systems level view
● Video card connected to
motherboard through PCI-
Express or AGP (Accelerated
Graphics Port)
● Northbridge chip enables data
transfer between the CPU and
GPU
● Graphics memory on the video
card contains the pixel RGB
data for each frame
How is a GPU different from a CPU?
Throughput more important than latency
o High throughput needed for the huge amount of
computations required for graphics
o Not concerned about latency because human visual
system operates on a much longer time scale
 16 ms maximum latency at 60 Hz refresh rate
 Long pipelines with many stages; a single instruction may
thousands of cycles to get through the pipeline.

Latency
How is a GPU different from a CPU?
Extremely parallel
o Different pixels and elements of the image can be
operated on independently
o Hundreds of cores executing at the same time to
take advantage of this fundamental parallelism
Inputs and Outputs
Inputs to GPU (from the CPU/memory):
● Vertices (3D coordinates) of objects
● Texture data
● Lighting data
Outputs from GPU:
● Frame buffer
o Placed in a specific section of graphics memory
o Contains RGB values for each pixel on the screen
o Data is sent directly to display
The Graphics Pipeline: A Visual

3D coordinates

/Pixel Shader
The Graphics Pipeline
● The GPU completes every
stage of this computational
pipeline
Transformations
Camera transformation
o Convert vertices from 3D world
coordinates to 3D camera
coordinates, with the camera
(user view) as the origin
Projection transformation
o Convert vertices from 3D camera
coordinates to 2D screen view
coordinates that the user will see
Illustration of 3D-2D Projection
(With overlapping vertices)
Depth values in Z buffer
determine which triangle
will be visible if two
vertices map to the same
2D coordinate
Transformations
● These transformations simply modify
vertices, so they are done by vertex shaders
● Transform computations are heavy on matrix
multiplication
● Each vertex can be transformed
independently
o Data Parallelism
Example renderings

Vertices Primitives Textures

(Point-cloud) (Triangles)
More renderings

Simple shaded 6-vertex

cube; each vertex has a Advanced rendering
color associated with it,
the pixel shader blends
the colors.
Fixed-Function to Programmable
● Earlier GPUs were fixed-function hardware pipelines
o Software developers could set parameters (textures, light
reflection colors, blend modes) but the function was completely
controlled by the hardware
● In newer GPUs, portions of the pipeline are completely
programmable
o Pipeline stages are now programs running on processor cores
inside the GPU, instead of fixed-function ASICs
o Vertex shaders = programs running on vertex processors,
fragment shaders = programs running on fragment processors
o However, some stages are still fixed function (e.g.
rasterization)
History of the GPU
1996: 3DFX Voodoo graphics card implements texture mapping, z-
buffering, and rasterization, but no vertex processing
1999: GPUs implement the full graphics pipeline in fixed-function
hardware (Nvidia GeForce 256, ATI Radeon 7500)
2001: Programmable shader pipelines (Nvidia Geforce 3)
2006: Unified shader architecture (ATI Radeon R600, Nvidia Geforce
8, Intel GMA X3000, ATI Xenos for Xbox360)
2010: General Purpose GPUs for non-graphical compute-intensive
applications, Nvidia CUDA parallel programming API
2014: Unprecedented compute power
Nvidia Geforce GTX Titan Z - 8.2 TFLOPS
AMD Radeon R9 295X2 (dual GPU card) - 11.5 TFLOPS
GPU Architecture

(programmable) Nvidia Geforce 6

series (2004)

(programmable)
Shader processors

Vertex Shader core Fragment/Pixel Shader core

Single Instruction, Multiple Data
(SIMD)
● Shader processors are
generally SIMD
● A single instruction
executed on every
vertex or pixel
Functional block diagram
Optimizations
● Combining different types of shader cores
into a single unified shader core
● Dynamic task scheduling to balance the load
on all cores
Workload Distribution
- Frames with many
“edges” (vertices)
require more vertex
shaders

- Frames with large

primitives require
more pixel shaders
Solution: Unified Shader
● Pixel shaders, geometry shaders, and vertex
shaders run on the same core - a unified
shader core
o Unified shaders limit idle shader cores
o Instruction set shared across all shader types
o Program determines type of shader
● Modern GPUs all use unified shader cores
● Shader cores are programmed using
graphics APIs like OpenGL and Direct3D
Static Task Distribution

- Unequal task distribution leads to inefficient hardware usage

- Parallel processors should handle tasks of equal complexity
Dynamic Task Distribution

- Tasks are dynamically distributed among pixel shaders

- Slots for output are pre-allocated in output FIFO
Modern GPU Hardware
Color Key

Basic blocks in a modern GPU

Unified Architecture example

Nvidia Geforce 8 (2006)

Unified Architecture example

AMD Radeon R600 (2006)

GPU Compute Power: Recent History
GPU Memory Bandwidth: Recent History
Current Development and Future
- GPU fixed-function units are being
abstracted away
- Newest versions of CUDA and OpenGL
include instructions for general-purpose
computing
- Future GPUs will resemble multi-core CPUs
with hyper-threading
Nvidia Fermi (2010)

- 16-cores with 32-way hyper-threading per core

- 1.5 TFLOPS (peak)
Sources
D. Hower. (2013, May 21). GPU Architectures: A CPU Perspective. [Online]. Available:
http://courses.cs.washington.edu/courses/cse471/13sp/lectures/GPUsStudents.pdf

C. M. Wittenbrink, E. Kilgariff, and A. Prabhu. (2011, April). IEEE Micro. [Online]. 31(2), pp. 50 - 59. Available:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5751939

D. Luebke and G. Humphreys, “How GPUs Work,” IEEE Computer. [Online]. vol. 40, no. 2, pp. 96-100, Feb. 2007. Available:
http://www.cs.virginia.edu/~gfx/papers/pdfs/59_HowThingsWork.pdf

B. Mederos, L. Velho, and L. H. de Figueiredo, “Moving least squares multiresolution surface approximation,” Proc. XVI Brazilian Symp.
Computer Graphics and Image Processing, [Online]. Oct. 2003, pp 19-26. Available: http://w3.impa.br/~boris/mederosb_moving.pdf

K. Hagen. (2014, July 23). Introduction to Real-Time Rendering. [Online]. Available: http://www.slideshare.net/korayhagen/introduction-to-real-
time-rendering

G. Turk. (2000, Aug.). “The Stanford Bunny,” [Online]. Available: http://www.cc.gatech.edu/~turk/bunny/bunny.html.old01

“Drawing Polygons,” OpenGL. [Online]. Available: https://open.gl/drawing

K. Fatahalian. (2011). How a GPU Works. [Online]. Available: http://www.cs.cmu.edu/afs/cs/academic/class/15462-

f11/www/lec_slides/lec19.pdf

D. Luebke. (2007). GPU Architecture: Implications & Trends. [Online]. Available: http://s08.idav.ucdavis.edu/luebke-nvidia-gpu-architecture.pdf
Sources (continued)
M. Houston and A. Lefohn. (2011). GPU architecture II: Scheduling the graphics pipeline. [Online]. Available:
https://courses.cs.washington.edu/courses/cse558/11wi/lectures/08-GPU-architecture-II_BPS-2011.pdf

J. Ragan-Kelley. (2010, July 29). Keeping Many Cores Busy: Scheduling the Graphics Pipeline. [Online]. Available:
http://bps10.idav.ucdavis.edu/talks/09-raganKelley_SchedulingRenderingPipeline_BPS_SIGGRAPH2010.pdf

B. C. Johnstone, “Bandwidth Requirements of GPU Architectures,” M. S. thesis, Dept. Comp. Eng., Rochester Institute of Technology,
Rochester, NY, 2014.

J. D. Owens, M. Houston, D. Luebke, and S. Green, (2008). “GPU Computing,” Proc. IEEE, vol. 96, no. 5, pp. 879-899, [Online]. Available:
http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4490127&tag=1

T. Dalling. (2014, Feb. 24). “Explaining Homogeneous Coordinates & Projective Geometry,” [Online]. Available:
http://www.tomdalling.com/blog/modern-opengl/explaining-homogenous-coordinates-and-projective-geometry/

NVIDIA, (2009). "Whitepaper: NVIDIA's next generation CUDA compute architecture: Fermi," [Online]. Available:
http://www.nvidia.com/content/pdf/fermi_white_papers/nvidiafermicomputearchitecturewhitepaper.pdf

C. McClanahan. (2010). History and Evolution of GPU Architecture: A Paper Survey. [Online]. Available: http://mcclanahoochie.com/blog/wp-
content/uploads/2011/03/gpu-hist-paper.pdf

J. Bikker. (2014). Graphics: Universiteit Utrecht - Information and Computing Sciences. [Online]. Available:
http://www.cs.uu.nl/docs/vakken/gr/2015/index.html
Sources (continued)
K. Rupp. (2014, June 21). “CPU, GPU and MIC Hardware Characteristics over Time,” [Online]. Available: http://www.karlrupp.net/2013/06/cpu-
gpu-and-mic-hardware-characteristics-over-time/

J. Lawrence. (2012, Oct. 22). 3D Polygon Rendering Pipeline. [Online]. Available:

http://www.cs.virginia.edu/~gfx/Courses/2012/IntroGraphics/lectures/13-Pipeline.pdf

M. Christen. (2003). Tutorial 4: Varying Variables. [Online]. Available: http://www.clockworkcoders.com/oglsl/tutorial4.htm

T. S. Crow, “Evolution of the Graphical Processing Unit,” M. S. thesis, Dept. Comp. Science, University of Nevada, Reno, NV, 2004.

“Utah teapot,” Wikipedia. [Online]. Available: http://en.wikipedia.org/wiki/Utah_teapot

A. Rege. (2008). An Introduction to Modern GPU Architecture. [Online]. Available:

http://http.download.nvidia.com/developer/cuda/seminar/TDCI_Arch.pdf

HD2000 - The First GPU’s under the AMD Name. (2007, May 14) [Online]. Available: http://www.bjorn3d.com/2007/05/hd2000-the-first-gpus-
under-the-amd-name-2/

E. Kelgariff and R. Fernando. (2005). “Chapter 30. The GeForce 6 Series GPU Architecture,” GPU Gems 2. [Online]. Available:
http://http.developer.nvidia.com/GPUGems2/gpugems2_chapter30.html

P. N. Glaskowsky. (2009, Sept.). NVIDIA’s Fermi: The First Complete GPU Computing Architecture. [Online]. Available:
http://www.nvidia.com/content/PDF/fermi_white_papers/P.Glaskowsky_NVIDIA%27s_Fermi-The_First_Complete_GPU_Architecture.pdf

Linux Foundation Certified Kubernetes Administrator (CKA) Program - CKA Exam Questions (2025) - 9
No ratings yet
Linux Foundation Certified Kubernetes Administrator (CKA) Program - CKA Exam Questions (2025) - 9
5 pages
Becoming SRE Engineer
No ratings yet
Becoming SRE Engineer
3 pages
Valn 10 Ip
No ratings yet
Valn 10 Ip
6 pages
Lecture - 01 - CUDA Programming
No ratings yet
Lecture - 01 - CUDA Programming
52 pages
Graphic Processing Unit
100% (1)
Graphic Processing Unit
20 pages
Graphics Processing Unit
No ratings yet
Graphics Processing Unit
21 pages
NVIDIA GPU Computing - A Journey From PC Gaming To Deep Learning
100% (1)
NVIDIA GPU Computing - A Journey From PC Gaming To Deep Learning
91 pages
MSI Creator TRX40 Manual
No ratings yet
MSI Creator TRX40 Manual
91 pages
Chapter 9 - Multiple Core Computers
No ratings yet
Chapter 9 - Multiple Core Computers
44 pages
9.C++ Programming An Object-Oriented Approach
No ratings yet
9.C++ Programming An Object-Oriented Approach
63 pages
UNIT 4 GPU Computing - HPC
No ratings yet
UNIT 4 GPU Computing - HPC
13 pages
VNX2 Customer Software Upgrade Block
No ratings yet
VNX2 Customer Software Upgrade Block
8 pages
Kirk+Hwu GPU
No ratings yet
Kirk+Hwu GPU
92 pages
Cloud Computing - KCS713 - Assignment
No ratings yet
Cloud Computing - KCS713 - Assignment
1 page
GPU (Graphics Processing Unit)
No ratings yet
GPU (Graphics Processing Unit)
11 pages
How A GPU Works: Kayvon Fatahalian 15-462 (Fall 2011)
No ratings yet
How A GPU Works: Kayvon Fatahalian 15-462 (Fall 2011)
87 pages
Purchasing - File - Hasil Lelang Final
No ratings yet
Purchasing - File - Hasil Lelang Final
2 pages
The Evolution of Gpus For General Purpose Computing
No ratings yet
The Evolution of Gpus For General Purpose Computing
38 pages
02 - R20 I Yr M.Tech (Embedded Systems) I Sem - Syllabus
No ratings yet
02 - R20 I Yr M.Tech (Embedded Systems) I Sem - Syllabus
36 pages
Parallel 4
No ratings yet
Parallel 4
3 pages
p10 Cuda
No ratings yet
p10 Cuda
28 pages
Gpu IEEE Paper
No ratings yet
Gpu IEEE Paper
14 pages
Graphics Processing Unit: R.Raghu Ram 15P35A0419 Ivece3 T.Devi (M.Tech
No ratings yet
Graphics Processing Unit: R.Raghu Ram 15P35A0419 Ivece3 T.Devi (M.Tech
27 pages
0 Gpu Computing I Give It
No ratings yet
0 Gpu Computing I Give It
57 pages
10 GPU-IntroCUDA3
No ratings yet
10 GPU-IntroCUDA3
141 pages
Aspire One 1410
No ratings yet
Aspire One 1410
256 pages
Architectural Details of Tesla GPU Microarchitecture
No ratings yet
Architectural Details of Tesla GPU Microarchitecture
9 pages
Modern GPU Architecture
No ratings yet
Modern GPU Architecture
93 pages
Mingpu: A Minimum Gpu Library For Computer Vision: Pavel Babenko and Mubarak Shah
No ratings yet
Mingpu: A Minimum Gpu Library For Computer Vision: Pavel Babenko and Mubarak Shah
30 pages
What Is A GPU
No ratings yet
What Is A GPU
3 pages
Exercise 3 - Introduction To Scilab
No ratings yet
Exercise 3 - Introduction To Scilab
5 pages
Lecture 17-Introduction To GPU
No ratings yet
Lecture 17-Introduction To GPU
36 pages
Profibus DP Introduction
No ratings yet
Profibus DP Introduction
38 pages
LENOVO AIO M90a GEN 5
No ratings yet
LENOVO AIO M90a GEN 5
3 pages
Attia 2011
No ratings yet
Attia 2011
6 pages
How Gpus Work
No ratings yet
How Gpus Work
5 pages
Chapter 2
No ratings yet
Chapter 2
21 pages
Introduction To Graphics Hardware and Gpus Introduction To Graphics Hardware and Gpus
No ratings yet
Introduction To Graphics Hardware and Gpus Introduction To Graphics Hardware and Gpus
22 pages
Comparing Databases For An Industrial IoT Use-Case: MongoDB, TimescaleDB, InfluxDB and CrateDB
No ratings yet
Comparing Databases For An Industrial IoT Use-Case: MongoDB, TimescaleDB, InfluxDB and CrateDB
6 pages
Lecture-12-PDC - CUDA
No ratings yet
Lecture-12-PDC - CUDA
25 pages
Lecture 2
No ratings yet
Lecture 2
15 pages
GPU (Graphics Processing Unit)
No ratings yet
GPU (Graphics Processing Unit)
23 pages
Sorting Data: Implement Grep and Tar
No ratings yet
Sorting Data: Implement Grep and Tar
3 pages
Design of Graphics Processing Framework On FPGA
No ratings yet
Design of Graphics Processing Framework On FPGA
5 pages
GPU Architecture
No ratings yet
GPU Architecture
12 pages
Report On Gpu
No ratings yet
Report On Gpu
39 pages
Graphics Processing Unit (GPU)
No ratings yet
Graphics Processing Unit (GPU)
13 pages
GPU 01.intro
No ratings yet
GPU 01.intro
36 pages
Gpus
No ratings yet
Gpus
32 pages
59 HowThingsWork
No ratings yet
59 HowThingsWork
5 pages
Csi ZG518 Ec-2
No ratings yet
Csi ZG518 Ec-2
6 pages
Graphics Processing Unit
No ratings yet
Graphics Processing Unit
22 pages
Graphics Processing Unit
No ratings yet
Graphics Processing Unit
14 pages
HPC 5th Unit - 240504 - 160548
No ratings yet
HPC 5th Unit - 240504 - 160548
18 pages
GPGPU
No ratings yet
GPGPU
139 pages
Unit V Data Communication: Prepared by B.R.S.Reddy Lecturer/ ECE NIT
No ratings yet
Unit V Data Communication: Prepared by B.R.S.Reddy Lecturer/ ECE NIT
60 pages
GPUIntro
No ratings yet
GPUIntro
21 pages
Lecture GPUArchCUDA01
No ratings yet
Lecture GPUArchCUDA01
57 pages
SIMEAS P OperatingInstructionProfibus E50417 B1076 C238 A2 30082004 en
No ratings yet
SIMEAS P OperatingInstructionProfibus E50417 B1076 C238 A2 30082004 en
27 pages
Graphics Processing Unit: Shashwat Shriparv Infinitysoft
No ratings yet
Graphics Processing Unit: Shashwat Shriparv Infinitysoft
39 pages
Presentation Prepared by Saatwik Kumar 1101219423 ETC, ET-2
No ratings yet
Presentation Prepared by Saatwik Kumar 1101219423 ETC, ET-2
18 pages
How A GPU Works - Kayvon Fatahalian
No ratings yet
How A GPU Works - Kayvon Fatahalian
87 pages
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
No ratings yet
Chapter 11: Indexing and Storage: Modified From: Database System Concepts, 6 Ed
53 pages
Block Diagram of A Computer
No ratings yet
Block Diagram of A Computer
21 pages
Telematica 2010/2011: Test ACL 1
No ratings yet
Telematica 2010/2011: Test ACL 1
6 pages
Graphics Processing Unit
No ratings yet
Graphics Processing Unit
9 pages
Transitioning Applications From CAN 2.0 To CAN FD
No ratings yet
Transitioning Applications From CAN 2.0 To CAN FD
8 pages
Sybex CCNA 640-802: Chapter 5: Managing A Cisco Internetwork
No ratings yet
Sybex CCNA 640-802: Chapter 5: Managing A Cisco Internetwork
33 pages
Docker Resp
No ratings yet
Docker Resp
2 pages
GPU
No ratings yet
GPU
17 pages
TDCI Arch
No ratings yet
TDCI Arch
77 pages
History and Evolution of Gpu Architecture: Chris Mcclanahan
No ratings yet
History and Evolution of Gpu Architecture: Chris Mcclanahan
7 pages
Developers Had To Map Scientific Calculations Onto Problems That Could Be Represented by Triangles and Polygons
No ratings yet
Developers Had To Map Scientific Calculations Onto Problems That Could Be Represented by Triangles and Polygons
2 pages
Graphics Processing Unit (Gpu) : BY Amal Raj.R Electronics C.P.T.C
No ratings yet
Graphics Processing Unit (Gpu) : BY Amal Raj.R Electronics C.P.T.C
30 pages
Graphics Processing Unit Graphics Processing Unit: Dhan V Sagar CB - EN.P2CSE13007
No ratings yet
Graphics Processing Unit Graphics Processing Unit: Dhan V Sagar CB - EN.P2CSE13007
21 pages
GPU Fundamentals
No ratings yet
GPU Fundamentals
21 pages
Procedure To Rectify The Scripting Error On Autosequence and Reporting On Micros 3700
No ratings yet
Procedure To Rectify The Scripting Error On Autosequence and Reporting On Micros 3700
3 pages
Why GPU?: CS8803SC Software and Hardware Cooperative Computing
No ratings yet
Why GPU?: CS8803SC Software and Hardware Cooperative Computing
14 pages
Presented By:: J.Ambaji (07W81A1247)
No ratings yet
Presented By:: J.Ambaji (07W81A1247)
35 pages
Evolution of The Graphics Process Units: Dr. Zhijie Xu Z.xu@hud - Ac.uk
No ratings yet
Evolution of The Graphics Process Units: Dr. Zhijie Xu Z.xu@hud - Ac.uk
24 pages
Industrial Network Security Monitoring - ICS - NSM - POSTER
No ratings yet
Industrial Network Security Monitoring - ICS - NSM - POSTER
2 pages
6101 Fundamentals Ofcomputer and IT MCQ
100% (1)
6101 Fundamentals Ofcomputer and IT MCQ
69 pages
Parallel Processing Using GPU's
No ratings yet
Parallel Processing Using GPU's
34 pages
Unit 2 - GPU DFG
No ratings yet
Unit 2 - GPU DFG
27 pages
Taxonomy Parallel Computer Architectures Instruction Data
No ratings yet
Taxonomy Parallel Computer Architectures Instruction Data
2 pages
Arcadis SW Installation
100% (1)
Arcadis SW Installation
62 pages
Shader: Exploring Visual Realms with Shader: A Journey into Computer Vision
From Everand
Shader: Exploring Visual Realms with Shader: A Journey into Computer Vision
Fouad Sabry
No ratings yet
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
From Everand
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
Rodrigo Copetti
No ratings yet
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet

GPU Architecture and Function: Michael Foster and Ian Frasch

Uploaded by

GPU Architecture and Function: Michael Foster and Ian Frasch

Uploaded by

GPU Architecture and

Vertices Primitives Textures

Simple shaded 6-vertex

(programmable) Nvidia Geforce 6

Vertex Shader core Fragment/Pixel Shader core

- Frames with large

- Unequal task distribution leads to inefficient hardware usage

- Tasks are dynamically distributed among pixel shaders

Basic blocks in a modern GPU

Nvidia Geforce 8 (2006)

AMD Radeon R600 (2006)

- 16-cores with 32-way hyper-threading per core

G. Turk. (2000, Aug.). “The Stanford Bunny,” [Online]. Available: http://www.cc.gatech.edu/~turk/bunny/bunny.html.old01

“Drawing Polygons,” OpenGL. [Online]. Available: https://open.gl/drawing

K. Fatahalian. (2011). How a GPU Works. [Online]. Available: http://www.cs.cmu.edu/afs/cs/academic/class/15462-

J. Lawrence. (2012, Oct. 22). 3D Polygon Rendering Pipeline. [Online]. Available:

M. Christen. (2003). Tutorial 4: Varying Variables. [Online]. Available: http://www.clockworkcoders.com/oglsl/tutorial4.htm

“Utah teapot,” Wikipedia. [Online]. Available: http://en.wikipedia.org/wiki/Utah_teapot

A. Rege. (2008). An Introduction to Modern GPU Architecture. [Online]. Available:

You might also like