0% found this document useful (0 votes)

25 views8 pages

Mouse Ray Picking Explained

There comes a time in every 3D game where the user needs to click on something in the scene. Maybe he needs to select a unit in an RTS, or open a door in an RPG, or delete some geometry in a level editing tool. This conceptually simple task is

Uploaded by

GomesGilzamir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views8 pages

Mouse Ray Picking Explained

Uploaded by

GomesGilzamir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Mouse Ray Picking Explained

Brian Hook
http://www.bookofhook.com

April 5, 2005

1 Introduction
There comes a time in every 3D game where the user needs to click on
something in the scene. Maybe he needs to select a unit in an RTS, or open
a door in an RPG, or delete some geometry in a level editing tool. This
conceptually simple task is easy to screw up since there are so many little
steps that can go wrong.
The problem, simply stated, is this: given the mouse’s position in window
coordinates, how can I determine what object in the scene the user has selected
with a mouse click?
One method is to generate a ray using the mouse’s location and then in-
tersect it with the world geometry, finding the object nearest to the viewer.
Alternatively we can determine the actual 3-D location that the user has
clicked on by sampling the depth buffer (giving us (x, y, z) in viewport
space) and performing an inverse transformation. Technically there is a
third approach, using a selection or object ID buffer, but this has numer-
ous limitations that makes it impractical for widespread use.
This article describes using the inverse transformation to derive world
space coordinates from the mouse’s position on screen.
Before we worry about the inverse transformation, we need to estab-
lish how the standard forward transformation works in a typical graphics
pipeline.

1
2 The View Transformation
The standard view transformation pipeline takes a point in model space
and transforms it all the way to viewport space 1 . It does this by trans-
forming the original point through a series of coordinate systems:

M odel
↓
W orld
↓
V iew
↓
Clip
↓
N ormalizedDevice
↓
V iewport

Not each step is discrete. OpenGL has the GL MODELVIEW matrix, (M ),

that transforms a point p from model space to a point v in view space. Both
model and view space use a right-handed coordinate system with +Y up,
+X to the right, and -Z into the screen.

Mp = v (1)

Another matrix, GL PROJECTION (P ), then transforms the view space

point to the homogeneous clip space point c. Clip space is a right-handed
coordinate system (+Z into the screen) contained within a canonical clip-
ping volume extending from (−1, −1, −1) to (+1, +1, +1).

Pv = c (2)
1
viewport coordinates are also sometimes known as window coordinates or, for systems
without a window system, screen coordinates

2
After clipping is performed the perspective divide transforms the ho-
mogeneous coordinate c back into the Cartesian point n in normalized de-
vice space. Normalized device coordinates are left-handed, where nw = 1,
and are contained within the canonical view frustum from (−1, −1, −1) to
(+1, +1, +1).

c
n = (3)
cw
Finally there is the viewport scale and translation, V , which transforms
n into the final viewport window coordinates w. Another axis inversion
occurs here; this time +Y goes down instead of up.2 . Viewport depth val-
ues are calculated by rescaling NDC Z coordinates from the range (−1, 1)
to (0, 1), with 0 at the near clip plane and 1 at the far clip plane. It’s impor-
tant to note that any user specified depth bias may impact our calculations
later.

Vn = w (4)

Using this pipeline we can take a model space point, apply a series of
transformations, and get a viewport coordinate.
Our goal is to transform the mouse position in viewport coordinates
all the way back to world space. Since we’re not rendering a model, model
space and world space are the same.

3 The Inverse View Transformation

To go from mouse coordinates to world coordinates we have to do the
exact opposite of the view transformation:

V iewport → N DC → Clip → V iew → W orld/M odel

That’s a lot of steps, and it’s easy to screw up, and if you screw up just
a little that’s enough to blow everything apart.
2
Some less common window systems may place the origin at another location, such
as the bottom left of the window, so this isn’t always true

3
3.1 V iewport → N DC → Clip
The first step is to transform the viewport coordinates into clip coordi-
nates. The viewport transformation V takes a normalized device coor-
dinate n and transforms it into a viewport coordinate. Given viewport
width w and height h then our viewport coordinate v is:

 nx +1 
2
w
1−ny 
Vn=v= 
2
h (5)
nz +1
2

So we need to do the inverse of this process by rearranging Equation 5

to solve for n:
 2vx 
w
−1
 2vy 
n= h
2v z − 1
 (6)
1
Okay, not so bad. The only real question is “what is v z ?” We can either
calculate v z by reading it back from the framebuffer, or ignore it altogether
and substitute 0, in which case we’ll be computing a ray passing through
v that we’ll then have to intersect with world geometry to find the corre-
sponding point in 3-space.
From here we need to go to clip coordinates, which, if you recall, are
the homogeneous versions of the NDC coordinates (i. e. w 6= 1.0). Since w
is already 1.0 and the transformation back to clip coordinates is a scale by
w, this step can be skipped and we can assume that our NDC coordinates
are the same as our clip coordinates.

3.2 Clip → V iew

A vector v in view space is transformed to clip coordinates c by multiply-
ing it against the GL PROJECTION matrix P :

Pv = c (7)

Given this, we can do the opposite by multiplying the clip coordinate

by the inverse of the GL PROJECTION matrix. This isn’t as scary as it

4
sounds – we can avoid computing a true 4x4 matrix inverse if we just
construct the inverse projection matrix at the same time we build the pro-
jection matrix.
A typical OpenGL perspective projection matrix P takes the form:
 
a 0 0 0
 0 b 0 0
P = 0 0 c d
 (8)
0 0 e 0
The specific coefficient values depend on the nature of the perspective
projection matrix (for more information I recommend you look at the man
pages for gluPerspective). These co-efficients should scale and bias
v x , v y , and v z into clip space while assigning −v z to cw .
To transform from view coordinate v to clip coordinates c:
 
av x
 bv y 
Pv = c =  
cv z + dv w  (9)
ev z
So solving for v we get:
cx
 
a
 cy 
v= b  (10)
 cw 
e
cz ccw
d
− de

Encoding Equation 10 in matrix form gives us the inverse projection

matrix:
1 
a
0 0 0
0 1 0 0 
P −1 = 
0 0 0
b
1 
 (11)
e
0 0 d1 − de c

Computing the view coordinate from a clip coordinate is now simply:

P −1 c = v (12)

5
There’s no guarantee that v w will be 1, so we’ll want to rescale v ap-
propriately:
v
v0 = (13)
vw

3.3 V iew → M odel

Finally we just need to go from view coordinates to world coordinates
by multiplying the view coordinates against the inverse of the modelview
matrix. Again we can avoid doing a true inverse if we just logically break
down what the modelview transform accomplishes when working with
the camera: it is a translation (centering the universe around the camera)
and then a rotation (to reflect the camera’s orientation). The inverse of this
is reversed rotation (accomplished with a transpose) followed by a trans-
lation with the negation of the modelview matrix’s translation component
after it has been rotated by the inverse rotation.
If given our initial modelview matrix M consisting of a 3x3 rotation
submatrix R and a 3-element translation vector t:
 
R11 R12 R13 tx
R21 R22 R23 ty 
M = R31 R32 R33 tz 
 (14)
0 0 0 1
Then we can construct the inverse modelview M −1 using the transpose
of the rotation submatrix RT and the camera’s translation vector t.

R T t = t0 (15)
RT11 RT12 RT13 0
 
−t x
R T RT22 RT23 −t0 y 
M −1 =  T21  (16)
R
31 RT32 RT33 −t0 z 
0 0 0 1
If you’re specifying the modelview matrix directly, for example by us-
ing glLoadMatrix, then you already have it lying around and you can
build the inverse as described in Equation 15. If, on the other hand, the
modelview matrix is built dynamically using something like gluLookAt
or a sequence of glTranslate, glRotate, and glScale calls, you can
use glGetFloatv to retrieve the current modelview matrix.

6
Now that we have the inverse modelview matrix we can use it to trans-
form our view coordinate v into world space, giving us w.

M −1 v = w (17)
If depth value under the mouse was used to construct the original
viewport coordinate, then w should correspond to the point in 3-space
where the user clicked. If the depth value was not read then we have
an arbitrary point in space with which we can construct a ray from the
viewer’s position, a:
−
→
r = a + t(w − a) (18)
However, there’s a trick we can use to skip Equation 18 altogether. Set-
ting v w in Equation 17 to 0 right before the inverse modelview transforma-
tion any translation components are removed. This means we’ll be taking
a ray in view coordinates and getting a ray back in world coordinates. Of
course this is only relevant if we’re trying to compute a pick ray instead of
back projecting an actual point in space.

4 Picking
We should now have one of two things: either an actual point in world
space corresponding to the location of the mouse click, or a world space
ray representing the direction of the mouse click.
If we have an actual point we can search against all geometry in the
world and see which piece it’s closest to. If we have the ray, then we’ll need
to perform an intersection test between the ray and the world geometry
and find the geometry closest to the near Z clip plane. Either method
should be reasonably simple to implement.

5 Conclusion
Picking objects in a 3D scene using a mouse is a common task, but there
are very few papers that describe pragmatic approaches to accomplishing
this. Hopefully this paper helps someone trying to muddle through this
on their own, God knows I could have used it a few weeks ago.

7
6 Greets
A shout out and greets to my boys Casey Muratori and Nichola Vining
for helping me sort through this shit and hopefully not sounding like a
dumbass. Yo.

DocuPrint CM205b - CP205w - CP205 - CP105b Service Manual Draft
0% (2)
DocuPrint CM205b - CP205w - CP205 - CP105b Service Manual Draft
98 pages
CSA Section by Section 8.6
100% (3)
CSA Section by Section 8.6
4 pages
3.7 Coordinate Systems in OpenGL
No ratings yet
3.7 Coordinate Systems in OpenGL
12 pages
Chapter 3
No ratings yet
Chapter 3
39 pages
8 3d Viewing
No ratings yet
8 3d Viewing
51 pages
Lecture 41
No ratings yet
Lecture 41
85 pages
LECTURE 11,12 Camera Models: CSE 320 Graphics Programming
No ratings yet
LECTURE 11,12 Camera Models: CSE 320 Graphics Programming
71 pages
Cse167 04
No ratings yet
Cse167 04
32 pages
TUTO2
No ratings yet
TUTO2
6 pages
cs405 05 3dviewing
No ratings yet
cs405 05 3dviewing
103 pages
Viewing
No ratings yet
Viewing
49 pages
World-View Matrix Camera Class FPS Controls
No ratings yet
World-View Matrix Camera Class FPS Controls
19 pages
The OpenGL Viewing Pipeline Explained
No ratings yet
The OpenGL Viewing Pipeline Explained
7 pages
Chapter Four - Windows and Viewports and Clipping Algorithms
No ratings yet
Chapter Four - Windows and Viewports and Clipping Algorithms
128 pages
Lecture 4 - 3D Drawings
No ratings yet
Lecture 4 - 3D Drawings
56 pages
Viewing Pipeline
100% (1)
Viewing Pipeline
34 pages
4-Using Transformations in OpenGL
No ratings yet
4-Using Transformations in OpenGL
67 pages
7 - 3D Concepts & Transformation
No ratings yet
7 - 3D Concepts & Transformation
53 pages
Unproject Explained
No ratings yet
Unproject Explained
4 pages
3 Dimensional Viewing
No ratings yet
3 Dimensional Viewing
61 pages
04 Perspective
No ratings yet
04 Perspective
46 pages
Computer Graphics (CS 543) Lecture 4 (Part 3) : Viewing & Camera Control
No ratings yet
Computer Graphics (CS 543) Lecture 4 (Part 3) : Viewing & Camera Control
27 pages
Unit 5 3D Graphics - CG - PU
No ratings yet
Unit 5 3D Graphics - CG - PU
19 pages
Extension
No ratings yet
Extension
11 pages
3D Viewing & Clipping: Angel Chapter 5
No ratings yet
3D Viewing & Clipping: Angel Chapter 5
20 pages
Extension Writeup
No ratings yet
Extension Writeup
11 pages
New Lecture and Lab Information: Lectures
No ratings yet
New Lecture and Lab Information: Lectures
23 pages
New Lecture and Lab Information: Lectures
No ratings yet
New Lecture and Lab Information: Lectures
23 pages
Viewing and Transformation
No ratings yet
Viewing and Transformation
28 pages
Computer Graphics and Visualization SSE, Mukka
No ratings yet
Computer Graphics and Visualization SSE, Mukka
44 pages
Lect21 2009 3D-OpenGL ZBuffer
No ratings yet
Lect21 2009 3D-OpenGL ZBuffer
22 pages
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
No ratings yet
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
56 pages
Part 3 - Transformation - Notes
No ratings yet
Part 3 - Transformation - Notes
5 pages
CG Unit-3
No ratings yet
CG Unit-3
16 pages
Cs3vr16 Graphics 3
No ratings yet
Cs3vr16 Graphics 3
37 pages
Viewing Pipeline: Modeling Transformations Viewing Transformations
No ratings yet
Viewing Pipeline: Modeling Transformations Viewing Transformations
14 pages
Csc461: Lecture 16 Opengl Transformations
No ratings yet
Csc461: Lecture 16 Opengl Transformations
20 pages
Computer Graphics Lecture Notes On UNIT 6 PDF
No ratings yet
Computer Graphics Lecture Notes On UNIT 6 PDF
36 pages
3d Geometry
No ratings yet
3d Geometry
12 pages
AVR Unit 1 Part 2
No ratings yet
AVR Unit 1 Part 2
54 pages
Mod 5
No ratings yet
Mod 5
23 pages
Projection Transform (Direct3D 9) - Win32 Apps - Microsoft Learn
No ratings yet
Projection Transform (Direct3D 9) - Win32 Apps - Microsoft Learn
4 pages
06 CG Transformation 01
No ratings yet
06 CG Transformation 01
34 pages
Viewing and Camera Control in Opengl
No ratings yet
Viewing and Camera Control in Opengl
6 pages
Viewing and Camera Control in Opengl
No ratings yet
Viewing and Camera Control in Opengl
6 pages
3D Viewing and Projection+Color
No ratings yet
3D Viewing and Projection+Color
42 pages
2D Viewing
No ratings yet
2D Viewing
29 pages
Lecture2 v1
No ratings yet
Lecture2 v1
69 pages
CG Unit 5
No ratings yet
CG Unit 5
49 pages
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
No ratings yet
Computer Graphics Using Opengl, 3 Edition F. S. Hill, Jr. and S. Kelley
63 pages
Three-Dimensional Viewing: Computer Graphics
No ratings yet
Three-Dimensional Viewing: Computer Graphics
52 pages
Chap 6
No ratings yet
Chap 6
65 pages
Unit - V Computer Graphics
No ratings yet
Unit - V Computer Graphics
9 pages
TP2 Camera ENG
No ratings yet
TP2 Camera ENG
3 pages
CAP4730: Computational Structures in Computer Graphics: 3D Transformations
No ratings yet
CAP4730: Computational Structures in Computer Graphics: 3D Transformations
48 pages
Ch4 Grap Ch4
No ratings yet
Ch4 Grap Ch4
60 pages
gdc07 Lengyel
No ratings yet
gdc07 Lengyel
43 pages
CG-module 4
No ratings yet
CG-module 4
58 pages
Computer Graphics LCE-5
No ratings yet
Computer Graphics LCE-5
51 pages
Class 03 Virtual World
No ratings yet
Class 03 Virtual World
31 pages
3D Projection Transformations and Opengl: Soon Tee Teoh Cs 116A
No ratings yet
3D Projection Transformations and Opengl: Soon Tee Teoh Cs 116A
24 pages
Homography: Homography: Transformations in Computer Vision
From Everand
Homography: Homography: Transformations in Computer Vision
Fouad Sabry
No ratings yet
A0158-00326-02 WFC
No ratings yet
A0158-00326-02 WFC
14 pages
The Doubling Theory Dark Matter
No ratings yet
The Doubling Theory Dark Matter
5 pages
(Doi 10.1017 - CBO9781139015004.003) Crivelli, Paolo - Plato's Account of Falsehood (A Study of The Sophist) - The Sophist Defined
No ratings yet
(Doi 10.1017 - CBO9781139015004.003) Crivelli, Paolo - Plato's Account of Falsehood (A Study of The Sophist) - The Sophist Defined
15 pages
Few Practical Obs e 00 Stap
No ratings yet
Few Practical Obs e 00 Stap
96 pages
DAN2400 Product Brief
No ratings yet
DAN2400 Product Brief
2 pages
Kidde Submittal With BOQ - Approved-1
No ratings yet
Kidde Submittal With BOQ - Approved-1
1 page
Joining AND Assembly Processes:: Welding
No ratings yet
Joining AND Assembly Processes:: Welding
27 pages
Pendulum Experiment - Determining G
100% (1)
Pendulum Experiment - Determining G
3 pages
Structure and Features: Mounting Holed Type High Rigidity Crossed Roller Bearings V
No ratings yet
Structure and Features: Mounting Holed Type High Rigidity Crossed Roller Bearings V
14 pages
TP343
No ratings yet
TP343
7 pages
Seed Germination Chamber
No ratings yet
Seed Germination Chamber
6 pages
DVB-S2 Modem: Sk-Ip / SK-DV / Sk-Ts
No ratings yet
DVB-S2 Modem: Sk-Ip / SK-DV / Sk-Ts
6 pages
Friday July 30, 2010 Leader
No ratings yet
Friday July 30, 2010 Leader
47 pages
Comparison of Methods For Measuring Zero Sequence Impedances in 3-Phase Core-Type Transformers
No ratings yet
Comparison of Methods For Measuring Zero Sequence Impedances in 3-Phase Core-Type Transformers
5 pages
Family Miles 23.9.2023
No ratings yet
Family Miles 23.9.2023
5 pages
NX902 - Morphing Objects Using OmniCAD
No ratings yet
NX902 - Morphing Objects Using OmniCAD
7 pages
Bowl-And-Snail Technique For Soft Cataract: Ahmed Gomaa, FRCS, PHD, Christopher Liu, Frcophth
No ratings yet
Bowl-And-Snail Technique For Soft Cataract: Ahmed Gomaa, FRCS, PHD, Christopher Liu, Frcophth
3 pages
The Ethiopian Herald May 15 2024 - Opt
No ratings yet
The Ethiopian Herald May 15 2024 - Opt
10 pages
6.0 Power Series Related Question
No ratings yet
6.0 Power Series Related Question
9 pages
Articulo Biologia Molecular
No ratings yet
Articulo Biologia Molecular
12 pages
10.1515 - Hjbpa 2017 0005666666
No ratings yet
10.1515 - Hjbpa 2017 0005666666
14 pages
Frank Anderson's Blog: Arctic Adventure!: The Unicorn of The Sea
No ratings yet
Frank Anderson's Blog: Arctic Adventure!: The Unicorn of The Sea
4 pages
Lectures On Digital Design Principles (2023, River Publishers, Routledge) - Libgen - Li
No ratings yet
Lectures On Digital Design Principles (2023, River Publishers, Routledge) - Libgen - Li
280 pages
TLE-CSS Grade9 Q1 LAS1
No ratings yet
TLE-CSS Grade9 Q1 LAS1
6 pages
Tongue Management Part 2!!!!! 5
No ratings yet
Tongue Management Part 2!!!!! 5
6 pages
Postal Ballot Team 315
No ratings yet
Postal Ballot Team 315
23 pages
HarnessingTheSun 2
No ratings yet
HarnessingTheSun 2
102 pages
Exercise Topic 3
No ratings yet
Exercise Topic 3
5 pages

Mouse Ray Picking Explained

Uploaded by

Mouse Ray Picking Explained

Uploaded by

Mouse Ray Picking Explained

Not each step is discrete. OpenGL has the GL MODELVIEW matrix, (M ),

Another matrix, GL PROJECTION (P ), then transforms the view space

3 The Inverse View Transformation

V iewport → N DC → Clip → V iew → W orld/M odel

So we need to do the inverse of this process by rearranging Equation 5

3.2 Clip → V iew

Given this, we can do the opposite by multiplying the clip coordinate

Encoding Equation 10 in matrix form gives us the inverse projection

Computing the view coordinate from a clip coordinate is now simply:

3.3 V iew → M odel

You might also like