01 Introduction
01 Introduction
https://github.com/mint-lab/3dv_tutorial
Introduction to 3D Vision
1312 pages
An Invitation
Introduction to 3D Vision
: A Tutorial for Everyone
▪ Computer vision is an interdisciplinary field that deals with how computers can be made to gain
high-level understanding from digital images or videos.
▪ From the perspective of engineering, it seeks to automate tasks that the human visual system can
do.[1][2][3]
▪ "Computer vision is concerned with the automatic extraction, analysis and understanding of useful
information from a single image or a sequence of images.
▪ It involves the development of a theoretical and algorithmic basis to achieve automatic visual
understanding."[9]
Reference: Wikipedia 3
What is Computer Vision?
▪ Computer vision is an interdisciplinary field that deals with how computers can be made to gain
high-level understanding from digital images or videos.
▪ From the perspective of engineering, it seeks to automate tasks that the human visual system can
do.[1][2][3]
▪ "Computer vision is concerned with the automatic extraction, analysis and understanding of useful
information from a single image or a sequence of images.
▪ It involves the development of a theoretical and algorithmic basis to achieve automatic visual
understanding."[9]
Reference: Wikipedia 4
What is Computer Vision?
Image Understanding
Computer
Graphics
Shape
Computer Face
Location
Vision
Human
Image
Processing
(transformed) Image/Signal
5
What is Computer Vision? Computer Vision
Image Understanding
Computer
Graphics
Shape
Computer Face
Location
Vision
Human
Image
Processing
(transformed) Image/Signal
6
What is Computer Vision?
Computer Vision
7
What is 3D Vision?
Visual Geometry
Multiple View Geometry
Geometric Vision
Computer Vision
9
What is 3D Vision?
▪ Reference books
10
What is 3D Vision?
Reference: Snavely et al., Photo Tourism: Exploring Photo Collections in 3D, SIGGRAPH, 2006 12
Applications) 3D Reconstruction
Reference: Im et al., High Quality Structure from Small Motion for Rolling Shutter Cameras, ICCV, 2015
Reference: Hedman et al., Casual 3D Photography, SIGGRAPH Asia, 2017 14
Applications) Real-time Visual SLAM
▪ ORB-SLAM (2014)
Reference: Mur-Artal et al., ORB-SLAM: A Versatile and Accurate Monocular SLAM System, T-RO, 2015 15
Applications) Augmented Reality
Reference: Klein and Murray, Parallel Tracking and Mapping for Small AR Workspaces, ISMAR, 2007 16
Applications) Virtual Reality
Image: TechSpot 17
Applications) Mixed Reality
Image: SlashGear 18
Note) Slides and example codes are available:
Summary https://github.com/mint-lab/3dv_tutorial
Next Topics
▪ Single-view Geometry
▪ Two-view Geometry
▪ Solving Equations
▪ Finding Correspondence
▪ Multiple-view Geometry
▪ Bayesian Filtering
▪ Visual SLAM and Odometry
19