Fundamentals of Computer Vision
Fundamentals of Computer Vision
• Research interests:
– Computer/machine vision
– Digital Image Processing
– computer graphics
1
9/5/2018
• Office hours:
Mon & Wed at 10:00 am to 11:00 noon
or by appointment, M-014 Ground floor
• Textbook:
Computer Imaging Digital Image Processing and Analysis
Nov. 2010
• Class material:
SLATE
This week
2
9/5/2018
• Vision is useful
• Vision is interesting
• Vision is difficult
– Half of primate cerebral cortex is devoted to visual
processing
– Achieving human-level visual perception is
probably “AI-complete”
3
9/5/2018
Artificial Intelligence
Computer Vision
Image Processing
4
9/5/2018
5
9/5/2018
Smile detection?
6
9/5/2018
LaneHawk by Evoluti-onRobotics
“A smart camera is flush-mounted in the checkout lane, continuously
watching for items. When an item is detected and recognized, the cashier
verifies the quantity of items that were found under the basket, and
continues to close the transaction. The item can remain under the basket,
and with LaneHawk,you are assured to get paid for it… “
7
9/5/2018
Snaptell
http://download.cnet.com/ios/snaptell/3260-20_4-6312649-1.html
8
9/5/2018
9
9/5/2018
Sports
Smart cars
• Mobileye
– Vision systems currently in high-end BMW, GM, Volvo models
– By 2010: 70% of car manufacturers.
– Video demo
10
9/5/2018
Smart cars
• Mobileye
– Vision systems currently in high-end BMW, GM, Volvo models
– By 2010: 70% of car manufacturers.
– Video demo
11
9/5/2018
Vision in space
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
Medical imaging
12
9/5/2018
http://www.cs.ubc.ca/spider/lowe/vision.h
tml
13
9/5/2018
14
9/5/2018
Vision as Structure
Real-time stereo
measurement
from motion
device
Multi-view stereo for
community photo collections
• Small Reason:
– For measurement, other sensors are often better (in
DARPA Grand Challenge, vision was barely used!)
– For navigation, you still need to learn!
• Big Reason:
– The goals of computer vision (what + where) are
in terms of what humans care about.
15
9/5/2018
Object categorization
sky
building
flag
face
banner
wall
street lamp
bus bus
16
9/5/2018
slanted
non-rigid moving
object
vertical
17
9/5/2018
Challenges: illumination
18
9/5/2018
Challenges: scale
Challenges: deformation
19
9/5/2018
Challenges: occlusion
20
9/5/2018
21
9/5/2018
Challenges or opportunities?
• Images are confusing, but they also reveal the structure of
the world through numerous cues
• Our job is to interpret the cues! (e.g. Texture for ICR query)
22
9/5/2018
23
9/5/2018
Source: J. Koenderink
24
9/5/2018
25
9/5/2018
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a
particular 2D picture
Bottom line
• Perception is an inherently ambiguous problem
– Many different 3D scenes could have given rise to a particular 2D
picture
• Possible solutions
– Bring in more constraints (more images)
– Use prior knowledge about the structure of the world
• Need both exact measurements and statistical inference!
Image source: F. Durand
26
9/5/2018
I. Early vision
• Basic image formation and processing
* =
Linear filtering
Edge detection
Cameras and sensors
Light and color
Alignment
27
9/5/2018
IV. Recognition
28
9/5/2018
V. Advanced Topics
• Some in class other in term projects…
Course requirements
• Philosophy: computer vision is best experienced if hands-on
• Final project:
– Putting several pieces together
– List of options will be posted in the next few weeks (some great
ideas can be find on web)
– Expect to commit to a project idea by the end of Aug/September
• Participation: ?%
– Ask questions •Mid & Final Terms: 80%
– Answer questions
– Give me feedback: I’m learning too!
29
9/5/2018
Collaboration policy
30