Lecture 01 Introduction To Computer Vision PDF
Lecture 01 Introduction To Computer Vision PDF
COMPUTER VISION
LECTURE 1:
INTRODUCTION TO COMPUTER VISION
www.theICTlab.org
Copyright © Charalambos Poullis
Recent work
Semantic Segmentation from Remote Sensor Data and the Exploitation of Latent Learning for Classification of Auxiliary
Tasks, B. Chaterjee, C. Poullis, CRV 2019
Delineation of Road Networks Using Deep Residual Neural Networks and Iterative Hough Transform, P. Xu, C. Poullis, ISVC
2019
Inattentional Blindness for Redirected Walking Using Dynamic Foveated Rendering, Under Review 2020
What we see
Source: S. Narasimhan
The goal of computer vision
• To extract “meaning” from pixels
Semantic information
What kind of information can be
extracted from an image?
European
City
Outdoor scene
…
Semantic information
What kind of information can be
extracted from an image?
tree tree
roof
sky chimney
building
building
window
door
trashcan car car
person
Outdoor scene
ground City European
…
Semantic information
What kind of information can be
extracted from an image?
tree tree
roof
sky chimney
building
building
window
door
trashcan car car
person
Outdoor scene
ground City European
…
L. G. Roberts,
Machine Perception of Three Dime
nsional Solids
, Ph.D. thesis, MIT Department of
Electrical Engineering, 1963.
Successes of computer vision to date
Optical character recognition (OCR)
Digit recognition
yann.lecun.com
Sudoku grabber
http://sudokugrab.blogspot.com/
https://www.youtube.com/watch?v=oI
mMJ6p6mKE
Automatic check processing Source: S. Seitz, N. Snavely
Biometrics
Source: S. Seitz
Biometrics
Source: S. Seitz
Face detection
Source: S. Seitz
Smile detection
http://www.apple.com/ilife/iphoto/
Visual search: Google search by image
Google self-driving cars
https://www.youtube.com/watch?v=bDOnn0-4Nq8
YouTube Video
Occipital: 3D structure sensor for iPad
http://www.kickstarter.com/projects/occi
pital/structure-sensor-capture-the-world
-in-3d
http://structure.io/#home-about-vid
3D reconstruction from photo collections
Source: S. Seitz
Vision in space
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
http://www.cs.ubc.ca/spider/lowe/vision.html
Robotics
Computer Vision
Machine
Scope of Human Computer
Interaction
Learning COMP 425/6341
Image Processing Medical Imaging
Graphics
Feature Matching
Computational Recognition
Neuroscience
Photography
Optics
Computer Vision and Nearby Fields
• Computer Graphics: Models to Images
• Comp. Photography: Images to Images
• Computer Vision: Images to Models
Course Website
http://www.poullis.org/courses/2020/Winter/COMP425-6341
http://szeliski.org/Book/
Copyright © Charalambos Poullis
Labs
• Start on Tuesday, 14th January 2020
• Tutorials in OpenCV
• OpenCV API
R. Szeliski, A. Efros,
S. Lazebnik,
D. Forsyth, B. Freeman, F. Durand,
J. Ponce, M. Pollefeys, L. Fei-Fei,
J. Koenderink, D. Lowe, A. Torralba,
S. Seitz, K. Grauman, R. Fergus,
J. Hays
Copyright © Charalambos Poullis
Assignments and Project
• #1: Demosaicing
• #2: Feature Detection and Matching
• Project: Image Stitching
Demosaicing
• Implement the conversion of the Bayer pixel pattern to an
RGB representation where each pixel has red, green and blue
color channels
Feature Detection and Matching
• Implement a feature detection and matching algorithm
• Feed feature matches to a structure-from-motion system