0% found this document useful (0 votes)

203 views17 pages

CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese

This document provides an overview of camera models in computer vision, including: 1. The pinhole camera model, which establishes a mapping between 3D points in the real world and 2D points on an image plane using a single aperture point. 2. How lens-based cameras improve on pinhole cameras by using lenses to focus light rays onto the image plane, producing brighter images. However, lenses can cause blurring or distortions for objects outside the focal plane. 3. Common lens distortions like barrel and pincushion distortions, which change the image magnification as a function of the distance from the optical axis.

Uploaded by

herusyahputra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

203 views17 pages

CS231A Course Notes 1: Camera Models: Kenji Hata and Silvio Savarese

Uploaded by

herusyahputra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

CS231A Course Notes 1: Camera Models

Kenji Hata and Silvio Savarese

1 Introduction
The camera is one of the most essential tools in computer vision. It is the
mechanism by which we can record the world around us and use its output -
photographs - for various applications. Therefore, one question we must ask
in introductory computer vision is: how do we model a camera?

2 Pinhole cameras

object barrier film

aperture

Figure 1: A simple working camera model: the pinhole camera model.

Let’s design a simple camera system – a system that can record an image
of an object or scene in the 3D world. This camera system can be designed
by placing a barrier with a small aperture between the 3D object and a
photographic film or sensor. As Figure 1 shows, each point on the 3D object
emits multiple rays of light outwards. Without a barrier in place, every point
on the film will be influenced by light rays emitted from every point on the
3D object. Due to the barrier, only one (or a few) of these rays of light passes
through the aperture and hits the film. Therefore, we can establish a one-
to-one mapping between points on the 3D object and the film. The result is
that the film gets exposed by an “image” of the 3D object by means of this
mapping. This simple model is known as the pinhole camera model.

1
𝒋
𝑷 𝒇

𝑪′ 𝚷′
𝒌
𝑶
𝒊

𝑷′

Figure 2: A formal construction of the pinhole camera model.

A more formal construction of the pinhole camera is shown in Figure 2. In

this construction, the film is commonly called the image or retinal plane.
The aperture is referred to as the pinhole O or center of the camera. The
distance between the image plane and the pinhole O is the focal length f .
Sometimes, the retinal plane is placed between O and the 3D object at a
distance f from O. In this case, it is called the virtual image or virtual
retinal plane. Note that the projection of the object in the image plane
and the image of the object in the virtual image plane are identical up to a
scale (similarity) transformation.
T
Now, how do we use pinhole cameras? Let P = x y z be a point
on some 3D object visible to the pinhole camera. P will be mapped or pro-
T
jected onto the image plane Π0 , resulting in point1 P 0 = x0 y 0 . Similarly,

the pinhole itself can be projected onto the image plane, giving a new point
C 0.
Here, we can define a coordinate system i j k centered at the pinhole
O such that the axis k is perpendicular to the image plane and points toward
it. This coordinate system is often known as the camera reference system
or camera coordinate system. The line defined by C 0 and O is called the
optical axis of the camera system.
Recall that point P 0 is derived from the projection of 3D point P on the
image plane Π0 . Therefore, if we derive the relationship between 3D point
P and image plane point P 0 , we can understand how the 3D world imprints
itself upon the image taken by a pinhole camera. Notice that triangle P 0 C 0 O
is similar to the triangle formed by P , O and (0, 0, z). Therefore, using the
law of similar triangles we find that:
1
Throughout the course notes, let the prime superscript (e.g. P 0 ) indicate that this
point is a projected or complementary point to the non-superscript version. For example,
P 0 is the projected version of P .

2
T T
P 0 = x0 y 0 = f xz f yz

(1)
Notice that one large assumption we make in this pinhole model is that
the aperture is a single point. In most real world scenarios, however, we
cannot assume the aperture can be infinitely small. Thus, what is the effect
of varying aperture size?

Figure 3: The effects of aperture size on the image. As the aperture size
decreases, the image gets sharper, but darker.

As the aperture size increases, the number of light rays that passes
through the barrier consequently increases. With more light rays passing
through, then each point on the film may be affected by light rays from
multiple points in 3D space, blurring the image. Although we may be in-
clined to try to make the aperture as small as possible, recall that a smaller
aperture size causes less light rays to pass through, resulting in crisper but
darker images. Therefore, we arrive at the fundamental problem presented by
the pinhole formulation: can we develop cameras that take crisp and bright
images?

3 Cameras and lenses

In modern cameras, the above conflict between crispness and brightness is
mitigated by using lenses, devices that can focus or disperse light. If we
replace the pinhole with a lens that is both properly placed and sized, then
it satisfies the following property: all rays of light that are emitted by some
point P are refracted by the lens such that they converge to a single point P 0

3
object lens film

Figure 4: A setup of a simple lens model. Notice how the rays of the top
point on the tree converge nicely on the film. However, a point at a different
distance away from the lens results in rays not converging perfectly on the
film.

in the image plane. Therefore, the problem of the majority of the light rays
blocked due to a small aperture is removed (Figure 4). However, please note
that this property does not hold for all 3D points, but only for some specific
point P . Take another point Q which is closer or further from the image
plane than P . The corresponding projection into the image will be blurred
or out of focus. Thus, lenses have a specific distance for which objects are
“in focus”. This property is also related to a photography and computer
graphics concept known as depth of field, which is the effective range at
which cameras can take clear photos.

object lens film

P’
focal point

-z f zo

Figure 5: Lenses focus light rays parallel to the optical axis into the fo-
cal point. Furthermore, this setup illustrates the paraxial refraction model,
which helps us find the relationship between points in the image plane and
the 3D world in cameras with lenses.

Camera lenses have another interesting property: they focus all light rays
traveling parallel to the optical axis to one point known as the focal point
(Figure 5). The distance between the focal point and the center of the lens
is commonly referred to as the focal length f . Furthermore, light rays

4
passing through the center of the lens are not deviated. We thus can arrive
at a similar construction to the pinhole model that relates a point P in 3D
space with its corresponding point P 0 in the image plane.
0 0 x
0 x z
P = 0 = 0 yz (2)
y zz
The derivation for this model is outside the scope of the class. However,
please notice that in the pinhole model z 0 = f , while in this lens-based model,
z 0 = f +z0 . Additionally, since this derivation takes advantage of the paraxial
or “thin lens” assumption2 , it is called the paraxial refraction model.

normal pincushion barrel

Figure 6: Demonstrating how pincushion and barrel distortions affect images.

Because the paraxial refraction model approximates using the thin lens
assumption, a number of aberrations can occur. The most common one is
referred to as radial distortion, which causes the image magnification to
decrease or increase as a function of the distance to the optical axis. We
classify the radial distortion as pincushion distortion when the magnifi-
cation increases and barrel distortion3 when the magnification decreases.
Radial distortion is caused by the fact that different portions of the lens have
differing focal lengths.

4 Going to digital image space

In this section, we will discuss the details of the parameters we must account
for when modeling the projection from 3D space to the digital images we
know. All the results derived will use the pinhole model, but they also hold
for the paraxial refraction model.
2
For the angle θ that incoming light rays make with the optical axis of the lens, the
paraxial assumption substitutes θ for any place sin(θ) is used. This approximation of θ
for sin θ holds as θ approaches 0.
3
Barrel distortion typically occurs when one uses fish-eye lenses.

5
As discussed earlier, a point P in 3D space can be mapped (or projected)
into a 2D point P 0 in the image plane Π0 . This R3 → R2 mapping is referred
to as a projective transformation. This projection of 3D points into the
image plane does not directly correspond to what we see in actual digital
images for several reasons. First, points in the digital images are, in general,
in a different reference system than those in the image plane. Second, digital
images are divided into discrete pixels, whereas points in the image plane are
continuous. Finally, the physical sensors can introduce non-linearity such as
distortion to the mapping. To account for these differences, we will introduce
a number of additional transformations that allow us to map any point from
the 3D world to pixel coordinates.

4.1 The Camera Matrix Model and Homogeneous Co-

ordinates
4.1.1 Introduction to the Camera Matrix Model
The camera matrix model describes a set of important parameters that affect
how a world point P is mapped to image coordinates P 0 . As the name
suggests, these parameters will be represented in matrix form. First, let’s
introduce some of those parameters.
The first parameters, cx and cy , describe how image plane and digital
image coordinates can differ by a translation. Image plane coordinates have
their origin C 0 at the image center where the k axis intersects the image
plane. On the other hand, digital image coordinates typically have their ori-
gin at the lower-left corner of the image. Thus, 2D points in the image plane
T
and 2D points in the image are offset by a translation vector cx , cy . To
accommodate this change of coordinate systems, the mapping now becomes:
0 x
0 x f z + cx
P = 0 = (3)
y f yz + cy
The second effect we must account for that the points in digital images are
expressed in pixels, while points in image plane are represented in physical
measurements (e.g. centimeters). In order to accommodate this change of
units, we must introduce two new parameters k and l. These parameters,
whose units would be something like pixels
cm
, correspond to the change of units
in the two axes of the image plane. Note that k and l may be different
because the aspect ratio of a pixel is not guaranteed to be one. If k = l,
we often say that the camera has square pixels. We adjust our previous
mapping to be

6
0 x x
0 x f k z + cx α z + cx
P = 0 = = (4)
y f l yz + cy β yz + cy
Is there a better way to represent this projection from P → P 0 ? If this
projection is a linear transformation, then it can be represented as a product
of a matrix and the input vector (in this case, it would be P . However, from
Equation 4, we see that this projection P → P 0 is not linear, as the opera-
tion divides one of the input parameters (namely z). Still, representing this
projection as a matrix-vector product would be useful for future derivations.
Therefore, can we represent our transformation as a matrix-vector product
despite its nonlinearity? Homogeneous coordinates are the solution.

4.1.2 Homogeneous Coordinates

One way to solve this problem is to change the coordinate systems. For
example, we introduce a new coordinate, such that any point P 0 = (x0 , y 0 )
becomes (x0 , y 0 , 1). Similarly, any point P = (x, y, z) becomes (x, y, z, 1).
This augmented space is referred to as the homogeneous coordinate sys-
tem. As demonstrated previously, to convert a Euclidean vector (v1 , ..., vn )
to homogeneous coordinates, we simply append a 1 in a new dimension to get
(v1 , ..., vn , 1). Note that the equality between a vector and its homogeneous
coordinates only occurs when the final coordinate equals one. Therefore,
when converting back from arbitrary homogeneous coordinates (v1 , ..., vn , w),
we get Euclidean coordinates ( vw1 , ..., vwn ). Using homogeneous coordinates, we
can formulate
 
    x  
αx + cx z α 0 cx 0   α 0 cx 0
y 
Ph0 =  βy + cy z  =  0 β cy 0   z  = 0 β c y 0 Ph
 (5)
z 0 0 1 0 0 0 1 0
1

From this point on, assume that we will work in homogeneous coordinates,
unless stated otherwise. We will drop the h index, so any point P or P 0 can
be assumed to be in homogeneous coordinates. As seen from Equation 5,
we can represent the relationship between a point in 3D space and its image
coordinates by a matrix vector relationship:
 
 0   x  
x α 0 cx 0   α 0 cx 0
y 
P 0 = y 0  =  0 β cy 0  z  = 0 β cy 0 P = M P
 (6)
z 0 0 1 0 0 0 1 0
1

7
We can decompose this transformation a bit further into
 
α 0 cx
P 0 = M P =  0 β cy  I 0 P = K I 0 P

(7)
0 0 1

The matrix K is often referred to as the camera matrix.

4.1.3 The Complete Camera Matrix Model

The camera matrix K contains some of the critical parameters that describes
a camera’s characteristics and its model, including the cx , cy , k, and l param-
eters as discussed above. Two parameters are currently missing this formula-
tion: skewness and distortion. We often say that an image is skewed when
the camera coordinate system is skewed, meaning that the angle between the
two axes is slightly larger or smaller than 90 degrees. Most cameras have
zero-skew, but some degree of skewness may occur because of sensor manu-
facturing errors. Deriving the new camera matrix accounting for skewness is
outside the scope of this class and we give it to you below:
 0  
x α −α cot θ cx
β
K = y 0  =  0 sin θ
cy  (8)
z 0 0 1

Most methods that we introduce in this class ignore distortion effects, there-
fore our class camera matrix K has 5 degrees of freedom: 2 for focal length, 2
for offset, and 1 for skewness. These parameters are collectively known as the
intrinsic parameters, as they are unique and inherent to a given camera
and relate to essential properties of the camera, such as its manufacturing.

4.2 Extrinsic Parameters

So far, we have described a mapping between a point P in the 3D camera
reference system to a point P 0 in the 2D image plane using the intrinsic
parameters of a camera described in matrix form. But what if the information
about the 3D world is available in a different coordinate system? Then, we
need to include an additional transformation that relates points from the
world reference system to the camera reference system. This transformation
is captured by a rotation matrix R and translation vector T . Therefore,
given a point in a world reference system Pw , we can compute its camera
coordinates as follows:
R T
P = P (9)
0 1 w

8
Substituting this in equation (7) and simplifying gives

P 0 = K R T Pw = M P w

(10)

These parameters R and T are known as the extrinsic parameters

because they are external to and do not depend on the camera.

This completes the mapping from a 3D point P in an arbitrary world

reference system to the image plane. To reiterate, we see that the full pro-
jection matrix M consists of the two types of parameters introduced above:
intrinsic and extrinsic parameters. All parameters contained in the camera
matrix K are the intrinsic parameters, which change as the type of camera
changes. The extrinsic paramters include the rotation and translation, which
do not depend on the camera’s build. Overall, we find that the 3 × 4 projec-
tion matrix M has 11 degrees of freedom: 5 from the intrinsic camera matrix,
3 from extrinsic rotation, and 3 from extrinsic translation.

5 Camera Calibration
To precisely know the transformation from the real, 3D world into digital
images requires prior knowledge of many of the camera’s intrinsic parame-
ters. If given an arbitrary camera, we may or may not have access to these
parameters. We do, however, have access to the images the camera takes.
Therefore, can we find a way to deduce them from images? This problem of
estimating the extrinsic and intrinsic camera parameters is known as camera
calibration.

Figure 7: The setup of an example calibration rig.

Specifically, we do this by solving for the intrinsic camera matrix K and

the extrinsic parameters R, T from Equation 10. We can describe this prob-
lem in the context of a calibration rig, such as the one show in Figure 7. The

9
rig usually consists of a simple pattern (i.e. checkerboard) with known di-
mensions. Furthermore, the rig defines our world reference frame with origin
Ow and axes iw , jw , kw . From the rig’s known pattern, we have known points
in the world reference frame P1 , ..., Pn . Finding these points in the image we
take from the camera gives corresponding points in the image p1 , ..., pn .
We set up a linear system of equations from n correspondences such
that for each correspondence Pi , pi and camera matrix M whose rows are
m1 , m2 , m3 : m1 Pi
ui
pi = = M Pi = m 3 Pi
m2 Pi (11)
vi m3 Pi
As we see from the above equation, each correspondence gives us two
equations and, consequently, two constraints for solving the unknown pa-
rameters contained in m. From before, we know that the camera matrix
has 11 unknown parameters. This means that we need at least 6 correspon-
dences to solve this. However, in the real world, we often use more, as our
measurements are often noisy. To explicitly see this, we can derive a pair of
equations that relate ui and vi with Pi .
ui (m3 Pi ) − m1 Pi = 0
vi (m3 Pi ) − m2 Pi = 0
Given n of these corresponding points, the entire linear system of equa-
tions becomes
u1 (m3 P1 )−m1 P1 = 0
v1 (m3 P1 )−m2 P1 = 0
..
.
un (m3 Pn )−m1 Pn = 0
vn (m3 Pn )−m2 Pn = 0

This can be formatted as a matrix-vector product shown below:

 
P1T 0T −u1 P1T
 0T P T −v1 P1T 
 
 1  mT1
..  T
 m2 = Pm = 0 (12)

 T .T

T  mT

Pn 0 −un Pn 3
0T PnT −vn PnT

When 2n > 11, our homogeneous linear system is overdetermined. For

such a system m = 0 is always a trivial solution. Furthemore, even if there

10
were some other m that were a nonzero solution, then ∀k ∈ R, km is also
a solution. Therefore, to constrain our solution, we complete the following
minimization:
minimize kPmk2
m
(13)
subject to kmk2 = 1
To solve this minimization problem, we simply use singular value decompo-
sition. If we let P = U DV T , then the solution to the above minimization is
to set m equal to the last column of V . The derivation for this solution is
outside the scope of this class and you may refer to Section 5.3 of Hartley &
Zisserman on pages 592-593 for more details.
After reformatting the vector m into the matrix M , we now want to
explicitly solve for the extrinsic and intrinsic parameters. We know our
SVD-solved M is known up to scale, which means that the true values of the
camera matrix are some scalar multiple of M :
 T
αr1 − α cot θr2T + cx r3T αtx − α cot θty + cx tz

β β
ρM =  rT + cy r3T
sin θ 2
t + cy tz
sin θ y
 (14)
T
r3 tz
Here, r1T , r2T , and r3T are the three rows of R. Dividing by the scaling
parameter gives
 T
αr1 − α cot θr2T + cx r3T αtx − α cot θty + cx tz
  T 
a1 b1
1 β T T β
T 
M= r + cy r 3
sin θ 2
t + cy tz
sin θ y
 = A b = a2 b2
ρ T T
r3 tz a3 b3
Solving for the intrinsics gives
1
ρ=±
ka3 k
2
cx = ρ (a1 · a3 )
cy = ρ2 (a2 · a3 )
(15)
−1 (a1 × a3 ) · (a2 × a3 )
θ = cos −
ka1 × a3 k · ka2 × a3 k
2
α = ρ ka1 × a3 k sin θ
β = ρ2 ka2 × a3 k sin θ
The extrinsics are
a2 × a3
r1 =
ka2 × a3 k
r2 = r3 × r1 (16)
r3 = ρa3
T = ρK −1 b

11
We leave the derivations as a class exercise or you can refer to Section 1.3.1
of the Forsyth & Ponce textbook.
With the calibration procedure complete, we warn against degenerate
cases. Not all sets of n correspondences will work. For example, if the
points Pi lie on the same plane, then the system will not be able to be
solved. These unsolvable configurations of points are known as degenerate
configurations. More generally, degenerate configurations have points that
lie on the intersection curve of two quadric surfaces. Although this outside
the scope of the class, you can find more information in Section 1.3 of the
Forsyth & Ponce textbook.

6 Handling Distortion in Camera Calibration

So far, we have been working with ideal lenses which are free from any
distortion. However, as seen before, real lenses can deviate from rectilinear
projection, which require more advanced methods. This section provides just
a brief introduction to handling distortions.
Often, distortions are radially symmetric because of the physical symme-
try of the lens. We model the radial distortion with an isotropic transforma-
tion: 1 
λ
0 0
u
QPi = 0 λ 0 M Pi = i = pi
 1  (17)
vi
0 0 1
If we try to rewrite this into a system of equations as before, we get

ui q3 Pi = q1 Pi
vi q3 Pi = q2 Pi

This system, however, is no longer linear, and we require the use of non-
linear optimization techniques, which are covered in Section 22.2 of Forsyth
& Ponce. We can simplify the nonlinear optimization of the calibration prob-
lem if we make certain assumptions. In radial distortion, we note that the
ratio between two coordinates ui and vi is not affected. We can compute this
ratio as
m 1 Pi
ui m 1 Pi
= m 3 Pi
m 2 Pi
= (18)
vi m P
m 2 Pi
3 i

Assuming that n correspondences are available, we can set up the system

12
of linear equations:
v1 (m1 P1 )−u1 (m2 P1 ) = 0
..
.
vn (m1 Pn )−un (m2 Pn ) = 0
Similar to before, this gives a matrix-vector product that we can solve via
SVD:  
v1 P1T −u1 P1T T
Ln =  ...
 ..  m1 (19)
.  mT
T T 2
vn Pn −un Pn
Once m1 and m2 are estimated, m3 can be expressed as a nonlinear func-
tion of m1 , m2 , and λ. This requires to solve a nonlinear optimization problem
whose complexity is much simpler than the original one.

7 Appendix A: Rigid Transformations

The basic rigid transformations are rotation, translation, and scaling. This
appendix will cover them for the 3D case, as they are common type in this
class.
Rotating a point in 3D space can be represented by rotating around each
of the three coordinate axes respectively. When rotating around the coordi-
nate axes, common convention is to rotate in a counter-clockwise direction.
One intuitive way to think of rotations is how much we rotate around each
degree of freedom, which is often referred to as Euler angles. However, this
methodology can result in what is known as singularities, or gimbal lock,
in which certain configurations result in a loss of a degree of freedom for the
rotation.
One way to prevent this is to use rotation matrices, which are a more gen-
eral form of representing rotations. Rotation matrices are square, orthogonal
matrices with determinant one. Given a rotation matrix R and a vector v,
we can compute the resulting vector v 0 as
v 0 = Rv
Since rotation matrices are a very general representation of matrices, we
can represent a rotation α, β, γ around each of the respective axes as follows:
 
1 0 0
Rx (α) = 0 cos α − sin α
0 sin α cos α

13
 
cos β 0 sin β
Ry (β) =  0 1 0 
− sin β 0 cos β
 
cos γ − sin γ 0
Rz (γ) =  sin γ cos γ 0
0 0 1
Due to the convention of matrix multiplication, the rotation achieved by
first rotating around the z-axis, then y-axis, then x-axis is given by the matrix
product Rx Ry Rz .
Translations, or displacements, are used to describe the movement in a
certain direction. In 3D space, we define a translation vector t with 3 values:
the displacements in each of the 3 axes, often denoted as tx , ty , tz . Thus,
given some point P which is translated to some other point P 0 by t, we can
write it as:    
Px tx
0
P = P + t = Py + ty 
  
Pz tz
In matrix form, translations can be written using homogeneous coordi-
nates. If we construct a translation matrix as
 
1 0 0 tx
0 1 0 ty 
T = 0 0 1 tz 


0 0 0 1
then we see that P 0 = T P is equivalent to P 0 = P + t.
If we want to combine translation with our rotation matrix multiplication,
we can again use homogeneous coordinates to our advantage. If we want to
rotate a vector v by R and then translate it by t, we can write the resulting
vector v 0 as: 0
v R t v
=
1 0 1 1
Finally, if we want to scale the vector in certain directions by some amount
Sx , Sy , Sz , we can construct a scaling matrix
 
Sx 0 0
S =  0 Sy 0 
0 0 Sz
Therefore, if we want to scale a vector, then rotate, then translate, our
final transformation matrix would be:

RS t
T =
0 1

14
Note that all of these types of transformations would be examples of affine
transformations. Recall that projective transformations occur when the final
row of T is not 0 0 0 1 .

8 Appendix B: Different Camera Models

We will now describe a simple model known as the weak perspective
model. In the weak perspective model, points are first projected to the
reference plane using orthogonal projection and then projected to the image
plane using a projective transformation.

Figure 8: The weak perspective model: orthogonal projection onto reference

plane

As Figure 8 shows, given a reference plane Π at a distance zo from the

center of the camera, the points P, Q, R are first projected to the plane
Π using an orthogonal projection, generating points P , Q , R . This is a
reasonable approximation when deviations in depth from the plane are small
compared to the distance of the camera.
Figure 9 illustrates how points P , Q , R are then projected to the image
plane using a regular projective transformation to produce the points p0 , q 0 , r0 .
Notice, however, that because we have approximated the depth of each point
to zo the projection has been reduced to a simple, constant magnification.
The magnification is equal to the focal length f 0 divided by zo , leading to
f0 f0
x0 = x y0 = y
z0 z0
This model also simplifies the projection matrix

A b
M=
0 1

15
Figure 9: The weak perspective model: projection onto the image plane

As we see, the last row of M is 0 0 0 1 in the weak perspective model,
compared to v 1 in the normal camera model. We do not prove this result
and leave it to you as an exercise. The simplification is clearly demonstrated
when mapping the 3D points to the image plane.
   
m1 m1 P
P 0 = M P = m2  P = m2 P  (20)
m3 1
Thus, we see that the image plane point ultimately becomes a magnification
of the original 3D point, irrespective of depth. The nonlinearity of the projec-
tive transformation disappears, making the weak perspective transformation
a mere magnifier.

Figure 10: The orthographic projection model

Further simplification leads to the orthographic (or affine) projection

model. In this case, the optical center is located at infinity. The projection

16
rays are now perpendicular to the retinal plane. As a result, this model
ignores depth altogether. Therefore,

x0 = x
y0 = y

Orthographic projection models are often used for architecture and industrial
design.
Overall, weak perspective models result in much simpler math, at the
cost of being somewhat imprecise. However, it often yields results that are
very accurate when the object is small and distant from the camera.

Unit 3 DL
No ratings yet
Unit 3 DL
15 pages
CV Notes PDF
No ratings yet
CV Notes PDF
206 pages
Ai & ML Digital Notes
No ratings yet
Ai & ML Digital Notes
177 pages
Swarm Robotics: Fundamentals and Applications
From Everand
Swarm Robotics: Fundamentals and Applications
Fouad Sabry
No ratings yet
Kepler's Cosmological Synthesis - Astrology, Mechanism and The Soul (PDFDrive) PDF
100% (1)
Kepler's Cosmological Synthesis - Astrology, Mechanism and The Soul (PDFDrive) PDF
201 pages
A Flexible New Technique For Camera Calibration
No ratings yet
A Flexible New Technique For Camera Calibration
40 pages
Introduction To Control Engineering: Andy Pomfret and Tim Clarke
100% (1)
Introduction To Control Engineering: Andy Pomfret and Tim Clarke
54 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Asphalt Shingles Data Analysis PDF
No ratings yet
Asphalt Shingles Data Analysis PDF
4 pages
TS25 - Deep Learning
No ratings yet
TS25 - Deep Learning
12 pages
Image Processing in UAV
No ratings yet
Image Processing in UAV
11 pages
The Amazing Son in Law 4139 4140
No ratings yet
The Amazing Son in Law 4139 4140
8 pages
Lecture 12 - Deep Learning
No ratings yet
Lecture 12 - Deep Learning
25 pages
Image Processing
No ratings yet
Image Processing
39 pages
Explainable Artificial Intelligence Applications I
No ratings yet
Explainable Artificial Intelligence Applications I
38 pages
Autonomous Robotic Systems
No ratings yet
Autonomous Robotic Systems
22 pages
Unit 1 CV Notes
No ratings yet
Unit 1 CV Notes
122 pages
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
No ratings yet
Cs490 Advanced Topics in Computing (Deep Learning) : Lecture 16: Convolutional Neural Networks (CNNS)
63 pages
Camera Parameters Intrinsic - Extrinsic PDF
No ratings yet
Camera Parameters Intrinsic - Extrinsic PDF
35 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
The Amazing Son in Law 5362 5380
No ratings yet
The Amazing Son in Law 5362 5380
68 pages
CVR 5
No ratings yet
CVR 5
29 pages
Deep Learning For Vision Book 2
No ratings yet
Deep Learning For Vision Book 2
292 pages
CVR 4
No ratings yet
CVR 4
38 pages
Camera Calibration Toolbox For Matlab
50% (2)
Camera Calibration Toolbox For Matlab
3 pages
Autonomous Systems in The Intelligence Community: Many Possibilities and Challenges
No ratings yet
Autonomous Systems in The Intelligence Community: Many Possibilities and Challenges
10 pages
2 Convolutional Neural Network For Image Classification
No ratings yet
2 Convolutional Neural Network For Image Classification
6 pages
CVR 3
100% (1)
CVR 3
32 pages
Module 1:image Representation and Modeling
No ratings yet
Module 1:image Representation and Modeling
48 pages
Vision Systems Applications PDF
No ratings yet
Vision Systems Applications PDF
618 pages
CNN Hands On
No ratings yet
CNN Hands On
12 pages
Feature Extraction
No ratings yet
Feature Extraction
14 pages
Assignment ENT 413
No ratings yet
Assignment ENT 413
3 pages
Forces Quadcopter
No ratings yet
Forces Quadcopter
12 pages
Multi Object Tracking in Traffic Environments: A Systematic Literature
No ratings yet
Multi Object Tracking in Traffic Environments: A Systematic Literature
13 pages
YOLOV8
No ratings yet
YOLOV8
13 pages
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
No ratings yet
Yolov3: An Incremental Improvement: Joseph Redmon, Ali Farhadi
6 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Basic Relationship Between Pixels
No ratings yet
Basic Relationship Between Pixels
22 pages
AI&ML BM4251 Unit 1-5 Notes
No ratings yet
AI&ML BM4251 Unit 1-5 Notes
116 pages
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
No ratings yet
R-CNN, Fast R-CNN, Faster R-CNN, YOLO - Object Detection Algorithms
11 pages
Camera Calibration
No ratings yet
Camera Calibration
39 pages
4221 002 Autonomous Weapons Systems Full Report
No ratings yet
4221 002 Autonomous Weapons Systems Full Report
106 pages
Frequency Domain Filtering Image Processing
100% (1)
Frequency Domain Filtering Image Processing
24 pages
Chapter 6. Image Segmentation
No ratings yet
Chapter 6. Image Segmentation
82 pages
Unit3 CV
No ratings yet
Unit3 CV
27 pages
Active Range Finding
No ratings yet
Active Range Finding
6 pages
AE - IEEE - REPORT - 01fe20bei040
No ratings yet
AE - IEEE - REPORT - 01fe20bei040
5 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Robotics and Autonomous Systems - Journal - Elsevier
No ratings yet
Robotics and Autonomous Systems - Journal - Elsevier
11 pages
Fundamental Steps in Digital Image Processing
100% (2)
Fundamental Steps in Digital Image Processing
2 pages
Mapua Institute of Technology at Laguna Academic Year 2019-2020
No ratings yet
Mapua Institute of Technology at Laguna Academic Year 2019-2020
17 pages
Image Captioning Using CNN and LSTM
No ratings yet
Image Captioning Using CNN and LSTM
9 pages
Image Classification
No ratings yet
Image Classification
18 pages
KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
From Everand
Hebbian Learning: Fundamentals and Applications for Uniting Memory and Learning
Fouad Sabry
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
3 Camera Model Lec 2
No ratings yet
3 Camera Model Lec 2
5 pages
01-Camera Models
No ratings yet
01-Camera Models
10 pages
Pinhole Camera Model
No ratings yet
Pinhole Camera Model
4 pages
Curs 1 Introducere
No ratings yet
Curs 1 Introducere
25 pages
3D Reconstruction: Jeff Boody
No ratings yet
3D Reconstruction: Jeff Boody
32 pages
The 8-Point Algorithm: 16-385 Computer Vision (Kris Kitani)
No ratings yet
The 8-Point Algorithm: 16-385 Computer Vision (Kris Kitani)
33 pages
Framework of Advanced Driving Assistance System (ADAS) Research
No ratings yet
Framework of Advanced Driving Assistance System (ADAS) Research
8 pages
Design
No ratings yet
Design
17 pages
Confirmation en
No ratings yet
Confirmation en
1 page
Image Filtering: Davide Scaramuzza
No ratings yet
Image Filtering: Davide Scaramuzza
63 pages
03 Image Formation 2
No ratings yet
03 Image Formation 2
64 pages
Confirmation en
No ratings yet
Confirmation en
1 page
Vision Algorithms For Mobile Robotics: Davide Scaramuzza
No ratings yet
Vision Algorithms For Mobile Robotics: Davide Scaramuzza
78 pages
Visual Odometry (Vo) : Csc2541, Feb 9, 2016 Presented by Patrick Mcgarey
No ratings yet
Visual Odometry (Vo) : Csc2541, Feb 9, 2016 Presented by Patrick Mcgarey
36 pages
Fabric Defect Detection Using Local Homogeneity and Morphological Image Processing
No ratings yet
Fabric Defect Detection Using Local Homogeneity and Morphological Image Processing
5 pages
Eigen Values and Eigen Vector
No ratings yet
Eigen Values and Eigen Vector
13 pages
Tema-3-Econometria-Tema-3 en
No ratings yet
Tema-3-Econometria-Tema-3 en
21 pages
TLE9 CSS Q3 M4 Maintain-Hand-Tools
No ratings yet
TLE9 CSS Q3 M4 Maintain-Hand-Tools
8 pages
Septic Tank & Leach Field
No ratings yet
Septic Tank & Leach Field
1 page
GRADE 12 Economics Test 2
No ratings yet
GRADE 12 Economics Test 2
2 pages
Cable As Axial Elements
No ratings yet
Cable As Axial Elements
9 pages
Purposive Communication 11
No ratings yet
Purposive Communication 11
12 pages
CP Cooler Technik
No ratings yet
CP Cooler Technik
16 pages
GREENER - Energy Efficiency in Buildings
No ratings yet
GREENER - Energy Efficiency in Buildings
4 pages
3D Dynamic Soil - Uid-Structure Interaction Analysis in The Time Domain
No ratings yet
3D Dynamic Soil - Uid-Structure Interaction Analysis in The Time Domain
6 pages
Understanding The Aumann's Agreement Theorem
No ratings yet
Understanding The Aumann's Agreement Theorem
25 pages
DLL g7 Sci Micros
100% (2)
DLL g7 Sci Micros
3 pages
Delhi Public School: Name-Vipanshu Class 10th G Summited To
No ratings yet
Delhi Public School: Name-Vipanshu Class 10th G Summited To
19 pages
Flexural Behaviour of RC One-Way Slabs Reinforced Using PAN Based Carbon Textile Grid
No ratings yet
Flexural Behaviour of RC One-Way Slabs Reinforced Using PAN Based Carbon Textile Grid
13 pages
Astm C 110 Pruebas Fisicas O y OH de Ca PDF
No ratings yet
Astm C 110 Pruebas Fisicas O y OH de Ca PDF
21 pages
3 Lesson 1 - Worksheet
No ratings yet
3 Lesson 1 - Worksheet
3 pages
Radiation Protection in Medical Radiography 9th Edition Sherer Solution Manual Full Download
100% (2)
Radiation Protection in Medical Radiography 9th Edition Sherer Solution Manual Full Download
411 pages
Mechanical Engineer - Hassan Omail - Rev 1
No ratings yet
Mechanical Engineer - Hassan Omail - Rev 1
1 page
Thermographic Scanning Fact Sheet
No ratings yet
Thermographic Scanning Fact Sheet
3 pages
Cfss Action Plan2021
100% (2)
Cfss Action Plan2021
2 pages
Oliver Twist Essay Questions
100% (2)
Oliver Twist Essay Questions
4 pages
Herpetology Notes, Volume 4 219-224 (2011) (Published Online On 27 May 2011) - New Locality Records For Chelonians
No ratings yet
Herpetology Notes, Volume 4 219-224 (2011) (Published Online On 27 May 2011) - New Locality Records For Chelonians
6 pages
Film Pendek "Gen Z" Sebagai Media Edukasi Tentang Pentingnya Memuliakan Orang Tua
No ratings yet
Film Pendek "Gen Z" Sebagai Media Edukasi Tentang Pentingnya Memuliakan Orang Tua
91 pages
Algebra: Binomial/Multinomial Expansion SITUATION 1: Given The Quadratic Equation
No ratings yet
Algebra: Binomial/Multinomial Expansion SITUATION 1: Given The Quadratic Equation
6 pages
Detailed Lesson Plan in MAPEH 8 Arts q1
75% (4)
Detailed Lesson Plan in MAPEH 8 Arts q1
15 pages
Simply Supported Beam Example
No ratings yet
Simply Supported Beam Example
4 pages
Eddy Current Level-1
No ratings yet
Eddy Current Level-1
111 pages
108 Unix
No ratings yet
108 Unix
20 pages