0% found this document useful (0 votes)

105 views40 pages

Computer Vision - 01 Introduction

The document outlines a course on Computer Vision, detailing grading criteria, textbooks, and essential topics such as image processing, feature detection, segmentation, and object recognition. It covers various algorithms and applications in the field, including optical character recognition, medical imaging, and automotive safety. The course aims to provide a comprehensive understanding of both theoretical and practical aspects of computer vision.

Uploaded by

MohammadUsmanShabbir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views40 pages

Computer Vision - 01 Introduction

Uploaded by

MohammadUsmanShabbir

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Computer Vision

(240069)

Introduction

Muhammad Tariq Mahmood

tariq@[Link]
School of Computer Science and Engineering
Korea University of Technology and Education
1
Computer Vision (240069)
 Grading
 2 Assignments/Presentations (40%)
 Term Project (paper + presentation) (30%)
 Final exam (30%)

 Textbook
 Computer Vision: Algorithms and Applications by R. Szeliski
([Link]
 Additional References
 Computer vision: models, learning and inference by Simon J.D. Prince
([Link]
 Computer Vision: A Modern Approach by D. Forsyth and J. Ponce
 Multiple View Geometry by R. Hartly and A. Zisserman

2
What is Computer Vision?
 Vision
 It is the process of discovering what is present in the world and where
it is by looking.

 Computer Vision
 It is the study of analysis of pictures and videos in order to achieve
results similar to those as by men.

3
What is Computer Vision?

Pattern Machine
Recognition Learning

Mathematics
Image Computer & Probability
Processing Vision

Computer Optics
Graphics

4
Contents

Computer Vision Algorithms and Applications by Richard Szeliski Springer

5
Image Formation
 Geometric primitives
 2D transformations
 3D transformations
 3D rotations
 3D to 2D projections
 Lens distortions
 Photometric image formation
 Lighting
 Reflectance and shading
 Optics
 The digital camera
 Sampling and aliasing
 Color
 Compression

6
Image Processing
 Point operators
 Pixel transforms
 Color transforms
 Compositing and matting
 Histogram equalization
 Application: Tonal adjustment
 Linear filtering
 Separable filtering
 Examples of linear filtering
 Band-pass and steerable filters
 More neighborhood operators
 Non-linear filtering
 Morphology
 Distance transforms
 Connected components

7
Feature Detection and Matching
 Points and patches
 Feature detectors
 Feature descriptors
 Feature matching
 Feature tracking
 Application: Performance-driven animation
 Edges
 Edge detection
 Edge linking
 Application: Edge editing and enhancement
 Lines
 Successive approximation
 Hough transforms
 Vanishing points
 Application: Rectangle detection

8
Segmentation
 Active contours
 Snakes
 Dynamic snakes and
CONDENSATION
 Scissors
 Level Sets
 Application: Contour tracking and
rotoscoping
 Split and merge
 Watershed
 Region splitting (divisive clustering)
 Region merging (agglomerative
clustering)
 Graph-based segmentation
 Probabilistic aggregation
 Mean shift and mode finding
 K-means and mixtures of Gaussians
 Mean shift
 Normalized cuts
 Graph cuts and energy-based methods
 Application: Medical image
segmentation
9
Segmentation
 Active contours
 Snakes
 Dynamic snakes and
CONDENSATION
 Scissors
 Level Sets
 Application: Contour tracking and
rotoscoping
 Split and merge
 Watershed
 Region splitting (divisive clustering)
 Region merging (agglomerative
clustering)
 Graph-based segmentation
 Probabilistic aggregation
 Mean shift and mode finding
 K-means and mixtures of Gaussians
 Mean shift
 Normalized cuts
 Graph cuts and energy-based methods
 Application: Medical image
segmentation
10
Feature-based alignment
 2D and 3D feature-based alignment
 2D alignment using least squares
 Application: Panography
 Iterative algorithms
 Robust least squares and RANSAC
 3D alignment
 Pose estimation
 Linear algorithms
 Iterative algorithms
 Application: Augmented reality
 Geometric intrinsic calibration
 Calibration patterns
 Vanishing points
 Application: Single view metrology
 Rotational motion
 Radial distortion

11
Structure from motion
 Triangulation
 Two-frame structure from motion
 Projective (uncalibrated)
reconstruction
 Self-calibration
 Application: View morphing
 Factorization
 Perspective and projective
factorization
 Application: Sparse 3D model
extraction
 Bundle adjustment
 Exploiting sparsity
 Application: Match move and
augmented reality
 Uncertainty and ambiguities
 Application: Reconstruction from
Internet photos
 Constrained structure and motion
 Line-based techniques
 Plane-based techniques

12
Dense motion estimation
 Translational alignment
 Hierarchical motion estimation
 Fourier-based alignment
 Incremental refinement
 Parametric motion
 Application: Video stabilization
 Learned motion models
 Spline-based motion
 Application: Medical image
registration
 Optical flow
 Multi-frame motion estimation
 Application: Video denoising
 Application: De-interlacing
 Layered motion
 Application: Frame
interpolation
 Transparent layers and
reflections
13
Image Stitching
 Motion models
 Planar perspective motion
 Application: Whiteboard and
document scanning
 Rotational panoramas
 Gap closing
 Application: Video summarization
and compression
 Cylindrical and spherical coordinates
 Global alignment
 Bundle adjustment
 Parallax removal
 Recognizing panoramas
 Direct vs. feature-based alignment
 Compositing
 Choosing a compositing surface
 Pixel selection and weighting (de-
ghosting)
 Application: Photomontage
 Blending

14
Computational Photography
 Photometric calibration
 Radiometric response function
 Noise level estimation
 Vignetting
 Optical blur (spatial response)
estimation
 High dynamic range imaging
 Tone mapping
 Application: Flash photography
 Super-resolution and blur removal
 Color image demosaicing
 Application: Colorization
 Image matting and compositing
 Blue screen matting
 Natural image matting
 Optimization-based matting
 Smoke, shadow, and flash matting
 Video matting
 Texture analysis and synthesis
 Application: Hole filling and
inpainting
 Application: Non-photorealistic
rendering 15
Stereo correspondence
 Epipolar geometry
 Rectification
 Plane sweep
 Sparse correspondence
 3D curves and profiles
 Dense correspondence
 Similarity measures
 Local methods
 Sub-pixel estimation and uncertainty
 Application: Stereo-based head
tracking
 Global optimization
 Dynamic programming
 Segmentation-based techniques
 Application: Z-keying and
background replacement
 Multi-view stereo
 Volumetric and 3D surface
reconstruction
 Shape from silhouettes

16
3D reconstruction
 Shape from X
 Shape from shading and photometric stereo
 Shape from texture
 Shape from focus
 Active rangefinding
 Range data merging
 Application: Digital heritage
 Surface representations
 Surface interpolation
 Surface simplification
 Geometry images
 Point-based representations
 Volumetric representations
 Implicit surfaces and level sets
 Model-based reconstruction
 Architecture
 Heads and faces
 Application: Facial animation
 Whole body modeling and tracking
 Recovering texture maps and albedos
 Estimating BRDFs
 Application: 3D photography

17
Image-based rendering
 View interpolation
 View-dependent texture maps
 Application: Photo Tourism
 Layered depth images
 Impostors, sprites, and layers
 Light fields and Lumigraphs
 Unstructured Lumigraph
 Surface light fields
 Application: Concentric mosaics
 Environment mattes
 Higher-dimensional light fields
 The modeling to rendering
continuum
 Video-based rendering
 Video-based animation
 Video textures
 Application: Animating pictures
 3D Video
 Application: Video-based
walkthroughs

18
 Object detection
Recognition
 Face detection
 Pedestrian detection
 Face recognition
 Eigenfaces
 Active appearance and 3D shape models
 Application: Personal photo collections
 Instance recognition
 Geometric alignment
 Large databases
 Application: Location recognition
 Category recognition
 Bag of words
 Part-based models
 Recognition with segmentation
 Application: Intelligent photo editing
 Context and scene understanding
 Learning and large image collections
 Application: Image search
 Recognition databases and test sets

19
Computer Vision Levels
 Low-level vision (early vision)
 Image formation
 Edge detection & image filtering
 Optical flow
 Segmentation
 Shape matching
 Stereopsis
 Mid-level vision
 Object tracking
 Human motion analysis
 High-level vision
 Object recognition
 Event detection
 Scene & video understanding
20
Computer Vision Algorithms
 Structure from motion algorithms can reconstruct a sparse 3D
point model of a large complex scene from hundreds of partially
overlapping photographs(Snavely, Seitz, and Szeliski 2006)c
2006 ACM.

21
Computer Vision Algorithms
 Stereo matching algorithms can build a detailed 3D model of a
building from hundreds of differently exposed photographs
taken from the Internet (Goesele, Snavely, Curless et al. 2007) c
2007 IEEE.

22
Computer Vision Algorithms
 Person tracking algorithms can track a person walking in front
of a cluttered background (Sidenbladh, Black, and Fleet 2000) c
2000 Springer

23
Computer Vision Algorithms
 Face detection algorithms, coupled with color-based clothing
and hair detection algorithms, can locate and recognize the
individuals in this image (Sivic, Zitnick, and Szeliski 2006) c
2006 Springer.

24
Computer Vision Applications
 Optical character recognition (OCR): reading handwritten
postal codes on letters (Figure 1.4a) and automatic number plate
recognition (ANPR)

25
Computer Vision Applications
 Machine inspection: rapid parts inspection for quality assurance
using stereo vision with specialized illumination to measure
tolerances on aircraft wings or auto body parts or looking for
defects in steel castings using X-ray vision;

26
Computer Vision Applications
 Retail: object recognition for automated checkout

27
Computer Vision Applications
 Medical imaging: registering pre-operative and intra-operative
imagery or performing long-term studies of people’s brain
morphology as they

28
Computer Vision Applications
 Automotive safety: detecting unexpected obstacles such as
pedestrians on the street, under conditions where active vision
techniques such as radar or lidar do not work well

29
Computer Vision Applications
 Surveillance: monitoring for intruders, analyzing highway
traffic, and monitoring pools for drowning victims;

30
Computer Vision Application
 Fingerprint recognition and biometrics: for automatic access
authentication as well

 3D model building (photogrammetry): fully automated

construction of 3D models from aerial photographs

31
Computer Vision Applications

 image stitching: merging different views (Szeliski and Shum

1997) c 1997 ACM

32
Computer Vision Applications

 Exposure bracketing: merging different exposures

33
Computer Vision Applications

 Morphing:blending between two photographs (Gomes, Darsa,

Costa et al. 1999) c 1999 Morgan Kaufmann

34
Computer Vision Applications

 Turning a collection of photographs into a 3D model (Sinha,

Steedly, Szeliski et al. 2008) c 2008 ACM

35
A Brief History

36
A Brief History

37
A Brief History

38
A Brief History

39
A Brief History

Unit 4 Computer Vision Lecture Notes 1 4 Compress
No ratings yet
Unit 4 Computer Vision Lecture Notes 1 4 Compress
138 pages
Computer Vision 1731163352
No ratings yet
Computer Vision 1731163352
153 pages
COMP3411 Week 7 - Computer Vision
No ratings yet
COMP3411 Week 7 - Computer Vision
58 pages
Computer Vision Algorithms and Applications 2nd Edition Richard Szeliski Full Access
No ratings yet
Computer Vision Algorithms and Applications 2nd Edition Richard Szeliski Full Access
163 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Module 1
No ratings yet
Module 1
18 pages
1
No ratings yet
1
22 pages
Machine Vision: Chapter Index
No ratings yet
Machine Vision: Chapter Index
1 page
CS436 CS5310 EE513 L01 Introduction
No ratings yet
CS436 CS5310 EE513 L01 Introduction
54 pages
CO Machine Vision
No ratings yet
CO Machine Vision
3 pages
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
No ratings yet
Computer Vision CS-6350: Prof. Sukhendu Das Deptt. of Computer Science and Engg., IIT Madras, Chennai - 600036
48 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Unit 1 - MV - 10212EC159
No ratings yet
Unit 1 - MV - 10212EC159
71 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
CO1 Notes
No ratings yet
CO1 Notes
105 pages
Image Processing Research Papers Bibliography
No ratings yet
Image Processing Research Papers Bibliography
81 pages
Unit 1
No ratings yet
Unit 1
200 pages
Computer Vision for Coders
No ratings yet
Computer Vision for Coders
152 pages
01 Lecture No. 1
No ratings yet
01 Lecture No. 1
52 pages
Intro to Computer Vision Course
No ratings yet
Intro to Computer Vision Course
76 pages
Computer Vision I
No ratings yet
Computer Vision I
61 pages
AI For Computer Vision
No ratings yet
AI For Computer Vision
6 pages
Digital Image Processing: Instructor: Namrata Vaswani
No ratings yet
Digital Image Processing: Instructor: Namrata Vaswani
27 pages
Computer Vision Workshop Overview
No ratings yet
Computer Vision Workshop Overview
21 pages
Int345 Computer Vision
No ratings yet
Int345 Computer Vision
2 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer VISION - 1
No ratings yet
Computer VISION - 1
21 pages
Computer Vision Sample
No ratings yet
Computer Vision Sample
57 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Unit 1
No ratings yet
Unit 1
186 pages
CS5330 F22 Lectures
No ratings yet
CS5330 F22 Lectures
116 pages
Digital Image Processing Overview
No ratings yet
Digital Image Processing Overview
20 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
Module 1 Chapter1
No ratings yet
Module 1 Chapter1
6 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
643 pages
Computer Vision Al 701
No ratings yet
Computer Vision Al 701
50 pages
Computer Vision: Techniques & Uses
No ratings yet
Computer Vision: Techniques & Uses
46 pages
Computer Vision Basics Course Overview
No ratings yet
Computer Vision Basics Course Overview
65 pages
Introduction To Machine Vision
No ratings yet
Introduction To Machine Vision
15 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
913 pages
Summary of Computer Vision Cyril Stanissh
No ratings yet
Summary of Computer Vision Cyril Stanissh
13 pages
CV Notes
No ratings yet
CV Notes
333 pages
Opencv 2 Refman
No ratings yet
Opencv 2 Refman
929 pages
Lecture-1 CV
No ratings yet
Lecture-1 CV
18 pages
Computer Vision 2 Marks Answers
No ratings yet
Computer Vision 2 Marks Answers
4 pages
Machine Vision System Overview
No ratings yet
Machine Vision System Overview
48 pages
Intro to Computer Vision & IP
No ratings yet
Intro to Computer Vision & IP
48 pages
Lecture 1 S
No ratings yet
Lecture 1 S
23 pages
Manual de Referencia Opencv
No ratings yet
Manual de Referencia Opencv
915 pages
Service Manual LCD Television File No.: Z5Lw Reference No
No ratings yet
Service Manual LCD Television File No.: Z5Lw Reference No
48 pages
Switches, Relays, and Transistors
No ratings yet
Switches, Relays, and Transistors
1 page
The Rape of The Lock
No ratings yet
The Rape of The Lock
6 pages
Udx1726q-N10 - E11f03p87
No ratings yet
Udx1726q-N10 - E11f03p87
4 pages
(Ebook) The Age of Migration: Lnternational Population Movements in The Modern World by Stephen Castles Hein de Haas Mark J. Miller ISBN 9780230355774, 0230355773 PDF Download
No ratings yet
(Ebook) The Age of Migration: Lnternational Population Movements in The Modern World by Stephen Castles Hein de Haas Mark J. Miller ISBN 9780230355774, 0230355773 PDF Download
133 pages
VA7000 Product Brief V4.0
No ratings yet
VA7000 Product Brief V4.0
4 pages
The Art of WolfWalkers - Text
100% (1)
The Art of WolfWalkers - Text
229 pages
1 F M04 BLIT0108 06 LG CH03
No ratings yet
1 F M04 BLIT0108 06 LG CH03
3 pages
Consolidated Guidelines For Prevention and Treatment of HIV in Uganda 2016
No ratings yet
Consolidated Guidelines For Prevention and Treatment of HIV in Uganda 2016
152 pages
Brochure Hyundai Aero
100% (3)
Brochure Hyundai Aero
10 pages
Acesita Stainless Steel Specs
No ratings yet
Acesita Stainless Steel Specs
16 pages
Global Battery Management Systems Market
No ratings yet
Global Battery Management Systems Market
19 pages
Holland Brewing History 900-1900
100% (3)
Holland Brewing History 900-1900
451 pages
Restoration of Carnegie Hall's Acoustics
No ratings yet
Restoration of Carnegie Hall's Acoustics
188 pages
krc1 Computer Unit KUKA en
No ratings yet
krc1 Computer Unit KUKA en
10 pages
Updated Attendees List IHST
No ratings yet
Updated Attendees List IHST
19 pages
To Study The Braking System
No ratings yet
To Study The Braking System
8 pages
Risk Assessment for Offshore Systems
No ratings yet
Risk Assessment for Offshore Systems
63 pages
Doctor Who: Titanic in Space
No ratings yet
Doctor Who: Titanic in Space
90 pages
Epi Reviewer
No ratings yet
Epi Reviewer
4 pages
Igcse Ict Storage Devices and Media 0417
No ratings yet
Igcse Ict Storage Devices and Media 0417
3 pages
FIORDA Case Study Corrosion Attack On Primary Reformer Tubes
100% (1)
FIORDA Case Study Corrosion Attack On Primary Reformer Tubes
7 pages
Management of Penile Fracture
No ratings yet
Management of Penile Fracture
4 pages
Secondary Electron Spectra in Dielectrics
No ratings yet
Secondary Electron Spectra in Dielectrics
9 pages
4 Sinopec TULUX T600F 5W-40 Diesel Engine Oil Full Syn
No ratings yet
4 Sinopec TULUX T600F 5W-40 Diesel Engine Oil Full Syn
3 pages
24 PG Interactive Notebook - Geography - Volume 1
No ratings yet
24 PG Interactive Notebook - Geography - Volume 1
24 pages
Grade 7 English Exam
No ratings yet
Grade 7 English Exam
6 pages
Kaplan Turbine: Design, Benefits, Uses
No ratings yet
Kaplan Turbine: Design, Benefits, Uses
8 pages
Paper Cup Machine (2024-06-21 19 - 55 - 26)
No ratings yet
Paper Cup Machine (2024-06-21 19 - 55 - 26)
3 pages

Computer Vision - 01 Introduction

Uploaded by

Computer Vision - 01 Introduction

Uploaded by

Computer Vision

Muhammad Tariq Mahmood

Computer Vision Algorithms and Applications by Richard Szeliski Springer

 3D model building (photogrammetry): fully automated

 image stitching: merging different views (Szeliski and Shum

 Exposure bracketing: merging different exposures

 Morphing:blending between two photographs (Gomes, Darsa,

 Turning a collection of photographs into a 3D model (Sinha,

You might also like