0% found this document useful (0 votes)

13 views50 pages

1 Intro Visión Artificial

Uploaded by

ZABDIEL DAVID BLANCO AMAYA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views50 pages

1 Intro Visión Artificial

Uploaded by

ZABDIEL DAVID BLANCO AMAYA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 50

Computer Vision

Eduardo Avendaño Fernández

Hello!
I am Eduardo Avendaño Fernandez

I am here because I like DSP

and Computer Vision.
You can find me at:
@eduardoavendanofernandez

2
Introduction to
Artificial Vision
Let’s start with the first set of slides

3
“

4 Magic Leap, Oculus, Hololens, etc.

Every image tells a story

◎ Goal of computer vision:

perceive the “story” behind
the picture
◎ Compute properties of the
world
○ 3D shape
○ Names of people or objects
○ What happened?
5
The goal of computer vision

6
But humans can tell a lot about a scene from a little
information…

Source: “80 million tiny images” by Torralba, et al. 7

Can computers match human perception?
Credits: Fei-Fei, Fergus & Torralba

Sky
Building

Flag
Street Lamp

Banner Wall

Bus

Bus Cars 8
The Goal of Computer Vision

◎ Forensics

Source: Nayar and Nishino, “Eyes for Relighting” 9

Source: Nayar and Nishino, “Eyes for Relighting” 10
The Goal of Computer Vision

Source: Nayar and Nishino, “Eyes for

Relighting”

11
Improve photos (“Computational Photography”)

Super-resolution (source: 2d3)

Low-light photography
(credit: Hasinoff et al., SIGGRAPH ASIA 2016)

Inpainting / image completion

Depth of field on cell phone camera (image credit: Hays and Efros)
(source: Google Research Blog) 12
Source: Nayar and Nishino, “Eyes for Relighting”
◎ study computer vision?
Why
◎ Billions of images/videos captured per day

◎ Huge number of potential applications

◎ The next slides show the current state of the art 13
Optical character recognition (OCR)

Digit recognition, AT&T labs (1990’s) License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
http://yann.lecun.com/exdb/lenet/

Sudoku grabber http://sudokugrab.blogspot.com/ Automatic check processing 14

Face analysis and
recognition

15
Vision-based biometrics

◎ Who is she?
“How the Afghan Girl was Identified by Her Iris Patterns” Read
the story

Source: S. Seitz 16
Login without a password

Fingerprint scanners on Face unlock on Apple iPhone X

many new smartphones See also http://www.sensiblevision.com/
and other devices
17
Special effects: shape capture

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Source: S. Seitz

18
Special effects:
motion capture

Source:
S. Seitz

Pirates of the Carribean, Industrial Light and Magic 19

20
Which face is real?

https://www.whichfaceisreal.com/ 21
Image synthesis

Zhu, et al., Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks, ICCV 2017
22
Smart Cars

◎ Mobileye
◎ Tesla Autopilot
◎ Safety features

Pedestrian collision warning

Forward collision warning
Lane departure warning
Headway monitoring and warning

23
Vision-based interaction: Xbox Kinect

24
Computer vision research in biology

http://www.vision.caltech.edu/visipedia/
http://leafsnap.com/
25
Applications: 3D Scanning

Scanning Michelangelo’s “The David”

• UW Prof. Brian Curless, collaborator
• The Digital Michelangelo Project • 2 BILLION polygons, accuracy to .29mm
- http://graphics.stanford.edu/projects/mich/
26
Applications: 3D Scanning

27
Applications: 3D Scanning

28
Self-driven cars

◎ Waymo

29
Robotics

Amazon Prime Air

Amazon Picking Challenge

http://www.robocup2016.org/en/events/amazon-picking-challenge/

NASA’s Mars Curiosity Rover

30
https://en.wikipedia.org/wiki/Curiosity_(rover) Amazon Scout
Medical imaging

3D imaging (MRI, CT)

Skin cancer classification with deep learning https://cs.stanford.edu/people/esteva/nature/

31
32
Virtual & Augmented Reality

Hand & body

6DoF head tracking
tracking

3D scene understanding 3D-360 video capture 33

Current state
of the art
Many examples < 5 years old
Active research area (ML, DL…)
Startups (robotics, autonomous
vehicles, medical imaging,
construction, inspection, VR/AR)
34
Why computer vision matters

Safety Health Security

Comfort Fun Access 35

Ridiculously brief history of computer vision

◎ 1966: Minsky assigns computer vision as an undergrad summer project

◎ 1960’s: interpretation of synthetic worlds
◎ 1970’s: some progress on interpreting selected images
◎ 1980’s: ANNs come and go; shift toward geometry and increased
mathematical rigor
◎ 1990’s: face recognition; statistical analysis in vogue (Michael Jackson)
◎ 2000’s: broader recognition; large annotated datasets available; video
processing starts
◎ 2010’s: Deep learning with ConvNets
◎ 2020’s: Widespread autonomous vehicles?
◎ 2030’s: robot uprising? 36
Why is computer vision difficult?

Viewpoint variation

Illumination Credit: Flickr user michaelpaul

Scale 37
Why is computer vision difficult?

Motion (Source: S. Lazebnik)

Intra-class variation

Background clutter Occlusion

Bottom line

◎ Perception is an inherently ambiguous problem

○ Many different 3D scenes could have given rise to a
given 2D image

◎ We often must use prior knowledge about the world’s

structure

Artist Julian Beever with his anamorphic Coke bottle 39

Course overview

1. Low-level vision
○ image processing, edge detection,
feature detection, cameras, image
formation
2. Geometry and algorithms
○ projective geometry, stereo,
structure from motion, optimization
3. Recognition
○ face detection / recognition,
category recognition, segmentation

40
1. Low-level vision: Basic image processing and image
formation

* =
Filtering, edge detection

Image formation 41
Feature extraction
Geometry

Image credit: IDS Imaging

Projective geometry
Stereo vision

Multi-view stereo Structure from motion 42

Recognition
“dog”

Image classification
Object detection

Convolutional Neural Networks 43

Integridad Académica

◎ Assignments will be done solo or in pairs

◎ Exams: theoretical and practical on weeks 3, 6, 9 and
15
◎ Delays on the delivery at Moodle will be penalized
(first 6 hours -0.5 Units, 12 hours -1 Unit, 24 hours - 2
Unit)

44
What is Computer Vision?

◎ Input: images or video

◎ Output: description of the world
○ Many levels of description

45
Low-Level or “Early” Vision

Considers local
properties of an image

“There’s an edge!”

46
Mid-Level Vision

Grouping and
segmentation

“There’s an object
and a background!”

47
High-Level Vision

Recognition

“It’s a chair!” 48
Vision and Other Fields

Cognitive Psychology Signal Processing

Computer Vision
Computer Graphics

Pattern Analysis

Metrology
49
Questions?

[email protected]

Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
Software Engineering ASSIGNMENT QUESTION
100% (1)
Software Engineering ASSIGNMENT QUESTION
5 pages
MTH001 Final Term Current
No ratings yet
MTH001 Final Term Current
14 pages
Cognizant Response To AZ CISS RFP-112806-Word
100% (4)
Cognizant Response To AZ CISS RFP-112806-Word
241 pages
Computer Vision
100% (1)
Computer Vision
48 pages
What Is Wiki
No ratings yet
What Is Wiki
2 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
Vision Algorithms For Mobile Robotics: Davide Scaramuzza
No ratings yet
Vision Algorithms For Mobile Robotics: Davide Scaramuzza
78 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Computer Vision Report
No ratings yet
Computer Vision Report
31 pages
IT - R19 Final - 210
No ratings yet
IT - R19 Final - 210
210 pages
Medison128BW SA6000 II
No ratings yet
Medison128BW SA6000 II
8 pages
An Introduction To Computer Vision
No ratings yet
An Introduction To Computer Vision
7 pages
Unit 1
No ratings yet
Unit 1
186 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Quantum Machine Learning Quantum Algorithms and Neural Networks Pethuru Raj Instant Download
No ratings yet
Quantum Machine Learning Quantum Algorithms and Neural Networks Pethuru Raj Instant Download
91 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
Pendekar Laut Generasi 1
100% (1)
Pendekar Laut Generasi 1
6 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
Lecture AI 15 23052025 112103am
No ratings yet
Lecture AI 15 23052025 112103am
69 pages
1 Intro24
No ratings yet
1 Intro24
79 pages
1 Intro
No ratings yet
1 Intro
103 pages
Teacher Resume Format in Word India
100% (1)
Teacher Resume Format in Word India
6 pages
Computer Vision Part1
No ratings yet
Computer Vision Part1
96 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Qxtend IG v0182
No ratings yet
Qxtend IG v0182
98 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
01 Introduction
No ratings yet
01 Introduction
62 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Testbank For Before We Are Born 9th Edition Moore
No ratings yet
Testbank For Before We Are Born 9th Edition Moore
17 pages
Lec 00
No ratings yet
Lec 00
76 pages
01 - Introduction
No ratings yet
01 - Introduction
37 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
PCB & Electronics Hardware
No ratings yet
PCB & Electronics Hardware
84 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
CSE480: Machine Vision
No ratings yet
CSE480: Machine Vision
51 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
Automatic Plant Irrigation System
No ratings yet
Automatic Plant Irrigation System
7 pages
Lec01 - Intro To Computer Vision
No ratings yet
Lec01 - Intro To Computer Vision
43 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision Applications
No ratings yet
Computer Vision Applications
35 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
tb7100 Inst Guide 19p
No ratings yet
tb7100 Inst Guide 19p
19 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
CV-1 1
No ratings yet
CV-1 1
18 pages
Lecture1 - Introduction
No ratings yet
Lecture1 - Introduction
35 pages
EPGP in Data Science (Curriculum)
No ratings yet
EPGP in Data Science (Curriculum)
30 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Readymade Dissertation in Delhi
100% (1)
Readymade Dissertation in Delhi
4 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
USER MANUAL InSite Pro M
No ratings yet
USER MANUAL InSite Pro M
96 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
Human Sensing 03
No ratings yet
Human Sensing 03
9 pages
CV 01 Introduction
No ratings yet
CV 01 Introduction
14 pages
Mahesh (7 0)
No ratings yet
Mahesh (7 0)
6 pages
Pythonpython
No ratings yet
Pythonpython
6 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
100 Baby Challenge Rules: The Sims 2
No ratings yet
100 Baby Challenge Rules: The Sims 2
7 pages
Xtream 16-07-2022
No ratings yet
Xtream 16-07-2022
4 pages
Text For Presentation
No ratings yet
Text For Presentation
5 pages
Apps & Webportals 2024 - April To June - Topic-Wise PDF by AffairsCloud 2
No ratings yet
Apps & Webportals 2024 - April To June - Topic-Wise PDF by AffairsCloud 2
13 pages
Computer Vision: Evolution and Promise
No ratings yet
Computer Vision: Evolution and Promise
5 pages
Mini
No ratings yet
Mini
6 pages
Food - Wb.gov - in Food Digitalportal ApplyNewFPS - Aspx
No ratings yet
Food - Wb.gov - in Food Digitalportal ApplyNewFPS - Aspx
2 pages
Spek Super Control Monitoring System
No ratings yet
Spek Super Control Monitoring System
5 pages
DLAI4 Energy Boltzmann
No ratings yet
DLAI4 Energy Boltzmann
8 pages
Acces Problem - Agilent Cytogenomics 5.0: Skipalova, Karolina (Agilent Informatics Support)
No ratings yet
Acces Problem - Agilent Cytogenomics 5.0: Skipalova, Karolina (Agilent Informatics Support)
4 pages
SDS Upated 2022
No ratings yet
SDS Upated 2022
6 pages
Azure Cicd
No ratings yet
Azure Cicd
4 pages
VAST2024 - MC2 Data Description
No ratings yet
VAST2024 - MC2 Data Description
3 pages
The Ayin
From Everand
The Ayin
Gregor Former
No ratings yet
Revolutionizing Animation: The Rise of 3D and Hybrid Techniques in 2024
From Everand
Revolutionizing Animation: The Rise of 3D and Hybrid Techniques in 2024
Michael Connor
No ratings yet

1 Intro Visión Artificial

Uploaded by

1 Intro Visión Artificial

Uploaded by

Computer Vision

Eduardo Avendaño Fernández

I am here because I like DSP

4 Magic Leap, Oculus, Hololens, etc.

◎ Goal of computer vision:

Source: “80 million tiny images” by Torralba, et al. 7

Source: Nayar and Nishino, “Eyes for Relighting” 9

Source: Nayar and Nishino, “Eyes for

Super-resolution (source: 2d3)

Inpainting / image completion

◎ Huge number of potential applications

Digit recognition, AT&T labs (1990’s) License plate readers

Sudoku grabber http://sudokugrab.blogspot.com/ Automatic check processing 14

Fingerprint scanners on Face unlock on Apple iPhone X

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Pirates of the Carribean, Industrial Light and Magic 19

Pedestrian collision warning

Scanning Michelangelo’s “The David”

Amazon Prime Air

Amazon Picking Challenge

NASA’s Mars Curiosity Rover

3D imaging (MRI, CT)

Skin cancer classification with deep learning https://cs.stanford.edu/people/esteva/nature/

Hand & body

3D scene understanding 3D-360 video capture 33

Safety Health Security

Comfort Fun Access 35

◎ 1966: Minsky assigns computer vision as an undergrad summer project

Illumination Credit: Flickr user michaelpaul

Motion (Source: S. Lazebnik)

Background clutter Occlusion

◎ Perception is an inherently ambiguous problem

◎ We often must use prior knowledge about the world’s

Artist Julian Beever with his anamorphic Coke bottle 39

Image credit: IDS Imaging

Multi-view stereo Structure from motion 42

Convolutional Neural Networks 43

◎ Assignments will be done solo or in pairs

◎ Input: images or video

Cognitive Psychology Signal Processing

You might also like