0% found this document useful (0 votes)

13 views11 pages

CSE463 Reading Lecture Note 2

The document covers the fundamentals of image formation in computer vision, detailing the geometry of capturing 3D scenes onto 2D image planes through perspective and orthographic projections. It introduces the pinhole camera model, intrinsic and extrinsic camera parameters, and various image filtering techniques, including linear and non-linear filters. Additionally, it discusses the applications of filters in image processing, such as noise reduction and edge detection.

Uploaded by

omarblaze15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views11 pages

CSE463 Reading Lecture Note 2

Uploaded by

omarblaze15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

1

CSE463
Computer Vision: Fundamentals and Applications
Lecture 2
Image Formation and Filters

Geometry of Image Formation

The geometry of image formation studies the process by which 3D objects in the world are
captured and represented on a 2D image plane. This field is foundational in understanding how
cameras perceive depth, scale, and spatial relationships in a scene.

Perspective Projection

In perspective projection, objects appear smaller as they move further away from the camera,
and lines that are parallel in the 3D world converge in the 2D image, typically towards a
“vanishing point.” This principle explains why nearby objects appear large while distant objects
appear small and are crucial for a realistic representation of depth in images.

Camera Image Formation

Camera image formation refers to the process of capturing a 3D scene from the world and
projecting it onto a 2D image plane, which is a critical aspect of understanding how images are
formed in computer vision and photogrammetry. This process involves several physical
principles and geometrical concepts that ensure a 3D scene is represented correctly on a 2D
plane.

Pin-Hole Camera Model

The simplest model of image formation is the pinhole camera model, which provides a
conceptual framework for understanding how light from a scene is captured through a small
aperture (the "pinhole") and projected onto an image plane (the camera sensor).

Pinhole Camera Model Components:

● Scene (3D world): A real-world scene consisting of objects in three-dimensional space.

● Camera: The device that captures the scene, consisting of a lens and an image plane.
● Pinhole: A small aperture through which light passes.
● Image plane: A 2D surface (typically a digital sensor or film) where the scene is
projected.

How It Works:

1. Light rays from the 3D objects in the scene pass through the pinhole and hit the image
plane.
2. Each light ray corresponds to a specific point in the scene and is projected onto a point
on the image plane.
3. The resulting image on the image plane is inverted, meaning that objects higher in the
scene appear lower on the image plane, and objects farther away appear closer.
4. The size of the image depends on the distance between the scene, the pinhole, and the
image plane.

The pinhole camera model is a simple approximation, but it provides the basis for more
sophisticated camera models that include lens effects like distortion and focus.
3

Camera Calibration Parameters

For accurate image formation and interpretation, a camera's internal and external properties
must be understood. These properties are captured in the intrinsic and extrinsic parameters of
the camera.

Intrinsic Parameters (Camera Intrinsics):

These are the internal properties of the camera that affect how it captures the scene.

● Focal Length: The distance between the camera's lens and the image plane. It
determines the magnification and the field of view (FOV).
● Principal Point: The point on the image plane where the optical axis intersects (usually
near the center of the image).
● Pixel Aspect Ratio: The ratio of the width to the height of a pixel in the camera sensor.
This parameter is used to account for non-square pixels.
● Skew: A measure of non-orthogonality of the image axes (often assumed to be zero in
most cameras).

These parameters are typically represented in a camera matrix KKK, which is used to transform
3D coordinates into 2D image coordinates.

Extrinsic Parameters (Camera Extrinsic):

These parameters describe the position and orientation of the camera in the world.

● Rotation Matrix (R): A 3x3 matrix that describes the camera’s orientation in 3D space.
● Translation Vector (T): A 3x1 vector that describes the camera’s position in 3D space
relative to the world coordinate system.

Extrinsic parameters define how the camera is positioned relative to the world and are critical for
reconstructing 3D scenes from images.

Projection Models
4

The projection from 3D space onto 2D space can be modeled using different projection
techniques, such as perspective projection and orthographic projection.

Perspective Projection:

● Most common in real-world cameras and is responsible for the phenomenon where
objects appear smaller as they get farther away from the camera (i.e., the vanishing
point).
● In perspective projection, light rays converge towards a single point (the camera's focal
point or pinhole).
● The transformation from 3D world coordinates (X, Y, Z) to 2D image coordinates (x,y) is
a nonlinear operation and involves both intrinsic and extrinsic parameters.

The mathematical formulation for perspective projection is as follows:

Where:

● [R∣T]is the extrinsic matrix (rotation and translation),

● Kis the intrinsic camera matrix,
● (X, Y, Z)are the 3D coordinates of a point in the world,
● (x,y)are the corresponding 2D image coordinates.
5

Orthographic Projection:

● Assumes parallel projection where objects appear the same size regardless of their
distance from the camera.
● It is often used for technical drawings or engineering applications but not for real-world
photography, as it doesn’t capture depth perception.

In orthographic projection, the transformation from 3D world coordinates to 2D coordinates is

linear, and the depth dimension is ignored.

Image Formation:
○ Image formation models describe the physics of how images are formed on the
camera sensor.
○ Light and Aperture: Light enters the camera through the aperture, which controls the
amount of light hitting the sensor. The aperture and lens focus the incoming light,
creating an image on the sensor.
○ Focal Length and Depth of Field: The focal length determines the magnification of the
image, while the depth of field affects the range of distances at which objects appear
sharply in focus. Adjusting these parameters changes the scene's perspective and
focus.

Image Filtering (2D Convolution)

Image filtering is a technique applied to images to enhance or preprocess them for analysis by
modifying pixel values in systematic ways. Filters can remove noise, enhance edges, or
sharpen an image, depending on the filter type and parameters. Filters are also known as
kernels and can have a shape of 1x1, 3x3, 5x5, 7x7, and so on.

Types of Filters:

Linear Filters:

● Gaussian Filter: A smoothing filter used to reduce noise by averaging pixel values in a
local region, creating a blurring effect. It’s widely used as a preprocessing step in
computer vision tasks.
● Box filtering: this is an average-of-surrounding-pixel kind of image filtering that causes
blurring. A 3x3 box filter is given as follows:
6

● Sobel Filter: An edge-detection filter that calculates the gradient of image intensity,
highlighting regions with rapid intensity change, which correspond to edges.

Non-Linear Filters:

● Median Filter: A noise-reduction filter that replaces each pixel with the median value of
neighboring pixels. It is effective at removing "salt-and-pepper" noise without blurring
edges.
● Bilateral Filter: This filter smooths the image while preserving edges, by combining both
spatial and intensity information, making it useful in preserving details in high-frequency
areas.

Sliding of a Filter/Kernal Example

Eg 1 for 3x3 Kernal with explanation:

https://www.songho.ca/dsp/convolution/convolution2d_example.html

Eg 2: https://www.youtube.com/watch?v=yb2tPt0QVPY

Eg 3:

Applications of Filters:

Filters are widely used in image processing tasks such as:

1. noise reduction (Gaussian, Median filters)

2. edge detection (Sobel filter)
3. image sharpening or smoothing

Filtering helps prepare images for higher-level tasks by enhancing specific features or reducing
irrelevant data.
7

Exercises
1. What is the pinhole camera model, and how does it explain the projection of a 3D world
onto a 2D image plane?
2. Describe the difference between intrinsic and extrinsic camera parameters. Why are
both necessary for accurate image projection?
3. What is perspective projection, and how does it affect the appearance of objects as they
move farther from the camera?
4. Compare and contrast orthographic projection with perspective projection. In what
scenarios might each be preferred?
5. In terms of image formation, explain how light enters through the aperture and is focused
onto the image sensor. What role does the lens play in this process?
6. What is the purpose of using a Gaussian filter in image processing? How does it work to
reduce noise in an image?
7. Explain the difference between linear and non-linear filters, and provide examples of
each. How do these filters affect images?
8. Write down the matrix representation for a 5x5 box filter. And apply it over the image
given below-

a. b.

9.
8

10. The image on the left shows a noisy image. What filter can be used to revert it to its original
form?

Image Filtering Mathematical Examples-

· Output Size Calculation

The output size H_out, W_out for a convolution is given by:

Input Size (H_in,W_in): The height or width of the original input image (before filtering).

Filter Size (K_h, K_w): The height and width of the filter (kernel) being applied to the image.

Stride (S): The number of pixels the filter moves (or "slides") horizontally or vertically in each
step.

Assuming an Image size of 10×10 applying a Filter size: 3×3 with a stride of 2-

Since we can't have fractional pixels, we need to floor the value: Output Size = 4 x 4.

Gaussian Filter-

● The Gaussian kernel is defined mathematically as:

➔ σ is the standard deviation controlling the extent of smoothing.

➔ For example, with σ=1, a 3x3 kernel might look like:
10

● Imagine you have a image of size 5x5 and a filter of size 3x3 with a stride 1,

Sample Image Matrix,

Gaussian Kernel,

Output Image Size-

Simplified Formula for squared image = ((N - F + 2P)/S) + 1 where N=5 (input), F=3 (filter),
P=0 (padding), S=1 (stride)

So, Output size = ((5-3+0)/1) + 1 = 3x3

Steps for Each Position:

We slide the kernel across the image, calculate the weighted sum for each 3×3 patch, and
normalize the result by dividing by 16.

(a) For Position (0, 0):

● Extract the sub-image:

● Perform element-wise multiplication:

● Calculate the result (sum and normalize) and put it in the (0, 0) position:
11

(b) For position (0, 1): Slide the filter to cover the sub-image (stride = 1),

● Extract the sub-image:

● Repeat the weighted sum and normalization steps:

= * (2 * 1 + 3 * 2 + 4 * 1 + 7 * 2 + 8 * 4 + 9 * 2 + 12 * 1 + 13 * 2 + 14 * 1) = 8

Continue this for all positions.

Put it in the filtered image matrix-

7 8 ?

? ? ?

Digital Image Processing Unit 2
No ratings yet
Digital Image Processing Unit 2
82 pages
444 Plurals Countable Uncountable Nouns Test A1 A2 Grammar Exercises
No ratings yet
444 Plurals Countable Uncountable Nouns Test A1 A2 Grammar Exercises
3 pages
Family Vocabulary PPT ESL
100% (5)
Family Vocabulary PPT ESL
14 pages
From Depth Map To Point Cloud. How To Convert A RGBD Image To Points
100% (1)
From Depth Map To Point Cloud. How To Convert A RGBD Image To Points
9 pages
Proskouriakoff OCR PDF
100% (1)
Proskouriakoff OCR PDF
292 pages
Computer Paper Igcse 2024
No ratings yet
Computer Paper Igcse 2024
12 pages
EasyPicing PDF
No ratings yet
EasyPicing PDF
169 pages
Wagner's Relevance For Today Adorno PDF
100% (1)
Wagner's Relevance For Today Adorno PDF
29 pages
Fee Management: Kendriya Vidyalaya Sangathan Ambernath
No ratings yet
Fee Management: Kendriya Vidyalaya Sangathan Ambernath
7 pages
Sse2 12 Camera Params
No ratings yet
Sse2 12 Camera Params
19 pages
Jiddu Krishnamurti
No ratings yet
Jiddu Krishnamurti
29 pages
GEC 109 - Final Exam
100% (1)
GEC 109 - Final Exam
3 pages
Grade 1 To 12 Daily Lesson Log: Monday Tuesday Wednesday Thursday Friday
No ratings yet
Grade 1 To 12 Daily Lesson Log: Monday Tuesday Wednesday Thursday Friday
5 pages
Computer Project
No ratings yet
Computer Project
37 pages
Camera Geometry Alignment Final
No ratings yet
Camera Geometry Alignment Final
118 pages
Holiday Homework Xii History
No ratings yet
Holiday Homework Xii History
3 pages
Cameras and Projection: Starts at 2:40pm
No ratings yet
Cameras and Projection: Starts at 2:40pm
61 pages
British Literature - Wikipedia
No ratings yet
British Literature - Wikipedia
221 pages
9 Projection Geometry
No ratings yet
9 Projection Geometry
124 pages
Lecture W4ab
No ratings yet
Lecture W4ab
64 pages
Lec02 Camera
No ratings yet
Lec02 Camera
71 pages
Cameras Stereo 17 Ink
No ratings yet
Cameras Stereo 17 Ink
87 pages
Cameras and Geometry
No ratings yet
Cameras and Geometry
49 pages
U2 Shoofly Pie Naom Se PDF
No ratings yet
U2 Shoofly Pie Naom Se PDF
20 pages
Image Formation: DD2423 Image Analysis and Computer Vision
No ratings yet
Image Formation: DD2423 Image Analysis and Computer Vision
45 pages
Formação Da Imagem
No ratings yet
Formação Da Imagem
59 pages
Projections and Camera Calibration: Man-522: Computer Vision SET-2
No ratings yet
Projections and Camera Calibration: Man-522: Computer Vision SET-2
98 pages
Camera Notes Part 2 For Photogrammetry
No ratings yet
Camera Notes Part 2 For Photogrammetry
90 pages
Computer Vision Notes
No ratings yet
Computer Vision Notes
72 pages
Unit 1 AIIA
No ratings yet
Unit 1 AIIA
68 pages
Image Transforms
No ratings yet
Image Transforms
48 pages
Lec 9
No ratings yet
Lec 9
51 pages
Lec 02 Cam Models
No ratings yet
Lec 02 Cam Models
44 pages
Lec02 Image Mod
No ratings yet
Lec02 Image Mod
64 pages
Basic Mathematics of Projection
No ratings yet
Basic Mathematics of Projection
63 pages
Lec 4
No ratings yet
Lec 4
122 pages
Photogrammetry Midterm Cheat Sheet
No ratings yet
Photogrammetry Midterm Cheat Sheet
1 page
Perspective Projection
No ratings yet
Perspective Projection
44 pages
The Camera: 15-463: Computational Photography Alexei Efros, CMU, Fall 2005
No ratings yet
The Camera: 15-463: Computational Photography Alexei Efros, CMU, Fall 2005
46 pages
3DCV Lec01 Camera
No ratings yet
3DCV Lec01 Camera
76 pages
Globers Vs Flatearthers Lect5
No ratings yet
Globers Vs Flatearthers Lect5
44 pages
SGM4-Study Guide For Module 4
No ratings yet
SGM4-Study Guide For Module 4
15 pages
Unit 2
No ratings yet
Unit 2
88 pages
(0.1 Ha) Lecture - 02 - Image Basic
No ratings yet
(0.1 Ha) Lecture - 02 - Image Basic
126 pages
02
No ratings yet
02
43 pages
Unit 2
No ratings yet
Unit 2
88 pages
Computer Vision - Image Formation (Camera) - 1
No ratings yet
Computer Vision - Image Formation (Camera) - 1
27 pages
Ometry
No ratings yet
Ometry
54 pages
Ai 3
No ratings yet
Ai 3
41 pages
CV - Image Formation
No ratings yet
CV - Image Formation
28 pages
Camera
No ratings yet
Camera
48 pages
Camera Parameters
No ratings yet
Camera Parameters
50 pages
Chinese Literature
No ratings yet
Chinese Literature
27 pages
02 - Image Formation and Acquisition
No ratings yet
02 - Image Formation and Acquisition
49 pages
Computer Vision
No ratings yet
Computer Vision
18 pages
Lecture 02 Cameras
No ratings yet
Lecture 02 Cameras
38 pages
Identify The Choice That Best Completes The Statement or Answers The Question. Bcde Opqr DE PQ OR OP QR MNO PQR, MN PR M P NO QR N Q ABC
No ratings yet
Identify The Choice That Best Completes The Statement or Answers The Question. Bcde Opqr DE PQ OR OP QR MNO PQR, MN PR M P NO QR N Q ABC
32 pages
TSJ CASO CORPOELEC MUERTEcx
No ratings yet
TSJ CASO CORPOELEC MUERTEcx
31 pages
VC 1
No ratings yet
VC 1
20 pages
Unit 7 - English 11 - Listening - Bai Giang
No ratings yet
Unit 7 - English 11 - Listening - Bai Giang
16 pages
Praveen Kumar - Talend Developer - Delhi
No ratings yet
Praveen Kumar - Talend Developer - Delhi
4 pages
Lecture19 Camera Model Cont
No ratings yet
Lecture19 Camera Model Cont
49 pages
Computer Vision Lecture Notes All Compress
No ratings yet
Computer Vision Lecture Notes All Compress
17 pages
Giant Maths Word Search WORD LIST
No ratings yet
Giant Maths Word Search WORD LIST
1 page
Why Camera Modeling?: Image Processing
No ratings yet
Why Camera Modeling?: Image Processing
10 pages
Camera - Callibration and Pose - Estimation
No ratings yet
Camera - Callibration and Pose - Estimation
16 pages
Camera History
No ratings yet
Camera History
17 pages
Rare Birds of North America Course Book Steve N G Howell Ian Lewington Will Russell Download
No ratings yet
Rare Birds of North America Course Book Steve N G Howell Ian Lewington Will Russell Download
86 pages
Lecture 2 - Pinhole Camera
No ratings yet
Lecture 2 - Pinhole Camera
18 pages
Camera Calibration CV
No ratings yet
Camera Calibration CV
14 pages
Simple Camera-Model From Wikipedia
No ratings yet
Simple Camera-Model From Wikipedia
13 pages
Research Document Group 2
No ratings yet
Research Document Group 2
20 pages
How To Schedule Query Extracts Using RSCRM - BAPI
No ratings yet
How To Schedule Query Extracts Using RSCRM - BAPI
14 pages
Lecture8 2
No ratings yet
Lecture8 2
19 pages
Coordinate Conventions and Imaging Geometry
No ratings yet
Coordinate Conventions and Imaging Geometry
9 pages
Jha BrahmanicalIntoleranceEarly 2016
No ratings yet
Jha BrahmanicalIntoleranceEarly 2016
9 pages
Image Formation: - The Two Parts of The Image Formation Process
No ratings yet
Image Formation: - The Two Parts of The Image Formation Process
9 pages
Why Camera Modeling?: Image Processing
No ratings yet
Why Camera Modeling?: Image Processing
10 pages
Folclorul - FACTOR EDUCATIV FUNDAMENTAL in Invatamantul Artistic Romanesc
No ratings yet
Folclorul - FACTOR EDUCATIV FUNDAMENTAL in Invatamantul Artistic Romanesc
6 pages
Image Formation: - The Two Parts of The Image Formation Process
No ratings yet
Image Formation: - The Two Parts of The Image Formation Process
9 pages
Zooming Lens White Paper
No ratings yet
Zooming Lens White Paper
6 pages
Sol3e Int U5 Short Test 1b
No ratings yet
Sol3e Int U5 Short Test 1b
2 pages
Present Perfect Simple or Present Perfect Continuous Exercise 3
No ratings yet
Present Perfect Simple or Present Perfect Continuous Exercise 3
3 pages
Camera Resectioning
No ratings yet
Camera Resectioning
5 pages
Lens Light: in Barrel Distortion, Straight Lines Bulge Outwards at The Center, As in A
No ratings yet
Lens Light: in Barrel Distortion, Straight Lines Bulge Outwards at The Center, As in A
7 pages
Reading Action Plan 24-25
No ratings yet
Reading Action Plan 24-25
4 pages
Camera Calibration and Stereo Vision
No ratings yet
Camera Calibration and Stereo Vision
4 pages
Purpose: Autopipe - Installation Test Set
No ratings yet
Purpose: Autopipe - Installation Test Set
1 page
Lesson Plan Critique Smaion
No ratings yet
Lesson Plan Critique Smaion
16 pages
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
From Everand
Computer Stereo Vision: Exploring Depth Perception in Computer Vision
Fouad Sabry
No ratings yet

CSE463 Reading Lecture Note 2

Uploaded by

CSE463 Reading Lecture Note 2

Uploaded by

1

Geometry of Image Formation

Camera Image Formation

Pin-Hole Camera Model

Pinhole Camera Model Components:

● Scene (3D world): A real-world scene consisting of objects in three-dimensional space.

Camera Calibration Parameters

Intrinsic Parameters (Camera Intrinsics):

Extrinsic Parameters (Camera Extrinsic):

The mathematical formulation for perspective projection is as follows:

● [R∣T]is the extrinsic matrix (rotation and translation),

In orthographic projection, the transformation from 3D world coordinates to 2D coordinates is

Image Filtering (2D Convolution)

Sliding of a Filter/Kernal Example

Eg 1 for 3x3 Kernal with explanation:

Filters are widely used in image processing tasks such as:

1. noise reduction (Gaussian, Median filters)

Image Filtering Mathematical Examples-

· Output Size Calculation

● The Gaussian kernel is defined mathematically as:

➔ σ is the standard deviation controlling the extent of smoothing.

Sample Image Matrix,

Output Image Size-

So, Output size = ((5-3+0)/1) + 1 = 3x3

Steps for Each Position:

(a) For Position (0, 0):

● Extract the sub-image:

● Perform element-wise multiplication:

● Extract the sub-image:

● Repeat the weighted sum and normalization steps:

Continue this for all positions.

Put it in the filtered image matrix-

You might also like