0% found this document useful (0 votes)
18 views14 pages

IT5409 Ch1 Intro New Template

Uploaded by

Mai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views14 pages

IT5409 Ch1 Intro New Template

Uploaded by

Mai
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

2/20/24

Computer Vision
Course
School of Informations and
Communication Technology

1
2/20/24

Computer Vision
Chapter 1: Introduction

Computer Vision Group


School of Information and Communications Technology

Contents

• What is computer vision?


• Concepts and definitions
• Level of vision (low, middle and
high)
• Related fields
• Application areas

CV Group – School of Information and Communication and Technology


4
4

2
2/20/24

What is computer vision?

Human vision
Image or video Sensing (device) Interpreting
Interpretations

Computer vision
From CS131 course “computer vision”,
Prof. Fei-Fei Li, Stanford 'Vision'Lab

The goal of computer vision

• To bridge the gap between pixels and “meaning”


Source: S. Narasimhan

What we see What a computer sees

26-Sep-17 6
6

3
2/20/24

What kind of information can we extract from an image?

• Metric 3D information
• Semantic information

Vision as measurement device

Structure from motion Reconstruction from


Internet photo collections

NASA Mars Rover

Pollefeys et al. Goesele et al.

4
2/20/24

Vision as a source of semantic information

sky amusement park Objects


Activities
Scenes
Locations
The Wicked
Cedar Point Text / writing
Twister
Faces
Ferris Gestures
wheel Motions
ride ride Emotions…
12 E
Slide credit: Kristen Grauman

Lake Erie water


tree ride
tree people waiting in line

people sitting on ride


carousel
tree umbrellas
deck maxair
bench tree pedestrians

What is computer vision?

• Computer vision
• Is an interdisciplinary scientific field that deals with how computers
can be made to gain high-level understanding from digital images or
videos.
• From the perspective of engineering, it seeks to automate tasks that
the human visual system can do.
• Computer vision tasks include
• methods for acquiring, processing, analyzing and understanding digital
images,
• and extraction of high-dimensional data from the real world in order
to produce numerical or symbolic information, e.g., in the forms of
decisions. (Wikipedia).

10

10

5
2/20/24

What is computer vision?

• The two definitions of CV can be defined as a scientific field


that extracts information out of digital images.

• Another way to define CV is through its applications.


• Computer vision is building algorithms that can understand the
content of images and use it for other application

What kind of scene?


Where are the cars?
How far is the building?

11

11

Levels of vision

• Low-level Vision: Image Formation, Acquisition,


Image Processing
• Image formation studies the forward process of
producing images and videos
• Image acquisition:
• A digital image is produced by several image sensors.
• Depending on the type of sensor, the resulting image
data is an ordinary 2D image, a 3D volume, or an
image sequence.
• Image processing focuses on 2D image data
processing using point operators such as contrast
enhancement, filtering (local operations), noise
reduction, image transforms. Image processing is
considered as pre-processing that is usually
necessary to process the image data for CV
applications.
• Work with image as a matrix
• Input: image è output: image

12

12

6
2/20/24

Levels of vision

• Middle-level Vision: Feature, Image matching


• Feature extraction: Image features at various levels of
complexity are extracted from the image data. Examples
of such features: Edges, ridges, lines, texture, shape …
• Image matching
• Image segmentation

13

13

3D urban modeling

Bing maps, Google Streetview


Source: S. Seitz

14

14

7
2/20/24

Levels of vision

• High-level Vision: High-level vision is to infer the semantics,


for example, object recognition and scene understanding.
• Several application topics:
• Object recognition (classification), Identification
• Detection
• Motion analysis
• Scene reconstruction; 3D reconstruction
• Image-based rendering ….

15

15

Related fields

Computer vision at the intersection of multiple scientific fields

16

16

8
2/20/24

Related fields

Source: From EECS 432-Advanced Computer Vision, Northwestern University

17

17

Related fields

• Machine Learning: “The field of study that gives computers the


ability to learn without being explicitly programmed.” – Arthur
Samuel
• Artificial intelligence and computer vision share other topics such
as pattern recognition and learning techniques.
• Computer vision - Deep learning: Artificial Neural Networks with
many layers (CNN: Convolutional Neural Network )

18

18

9
2/20/24

Applications areas

Robotics Application
• Localization-determine robot location automatically
• Navigation
• Obstacles avoidance
• Assembly peg − in − hole, welding, painting
• Manipulation e. g. PUMA robot manipulator
• Human Robot Interaction HRI: Intelligent robotics to
interact with and serve people

19

19

Applications areas

Security Application
• Biometrics iris, fingerprint, face recognition
• Surveillance-detecting certain suspicious activities or
behaviors
• …

Face recognition systems


Fingerprint scanners
Source: from S. Seitz

20

20

10
2/20/24

Examples - Face Detection

Source: from S. Seitz

21

21

Examples of Computer Vision

Slide from Vicente Ordonez

22

22

11
2/20/24

Applications areas

Medicine Application
• Classification and detection e. g.
• 2D/3D segmentation
• 3D human organ reconstruction MRI or ultrasound
• Vision-guided robotics surgery
• …

Slide from Jason Lawrence


23

23

Applications areas

Industrial Automation Application


• Industrial inspection defect detection
• Barcode and package label reading
• Object sorting
• Document understanding e. g. OCR
• …
Transportation Application
• Autonomous vehicle
• Safety, e.g., driver vigilance monitoring
• …

24

24

12
2/20/24

Facebook's suggestion
• Understand the image

25
25

Examples- Toys and Robots

26

26

13
2/20/24

Examples - Optical character recognition (OCR)

License plate readers


Digit recognition,(AT&T labs) http://en.wikipedia.org/wiki/Automatic_number_pl
ate_recognition

Source: from S. Seitz

27

27

THANK YOU !

CV Group – School of Information and Communication and Technology 28

28

14

You might also like