0% found this document useful (0 votes)

2 views38 pages

DIP L01 Introduction

Uploaded by

adeeltanolimodernite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views38 pages

DIP L01 Introduction

Uploaded by

adeeltanolimodernite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

CSE-408 ( Diigital Image Processing)

Instructor: Dr. Muhammad Abeer Irfan

DCSE
Have you ever used Image processing and
computer vision?
Where?
How?
Have you ever used computer vision?
How? Where?
Reconstruction? Recognition? (Re)organization?

Think-Pair-Share
Laptop: Biometrics auto-login (face recognition, 3D), OCR
Smartphones: QR codes, computational photography (Android Lens Blur, iPhone
Portrait Mode), panorama construction (Google Photo Spheres), face detection,
expression detection (smile), Snapchat filters (face tracking), FaceID (iPhone), Night
Sight (Pixel), iPhone 12 Pro (LiDAR)
Web: Image search, Google photos (face recognition, object recognition, scene
recognition, geolocalization from vision), Facebook (image captioning), Google maps
aerial imaging (image stitching), YouTube (content categorization)
VR/AR: Outside-in tracking (HTC VIVE), inside out tracking (simultaneous localization
and mapping, HoloLens), object occlusion (dense depth estimation)
Motion: Kinect, full body tracking of skeleton, gesture recognition, virtual try-on
Medical imaging: CAT / MRI reconstruction, assisted diagnosis, automatic pathology,
connectomics, endoscopic surgery
Industry: Vision-based robotics (marker-based), machine-assisted router (jig),
automated post, ANPR (number plates), surveillance, drones, shopping
Transportation: Assisted driving (everything), face tracking/iris dilation for
drunkeness, drowsiness, automated distribution (all modes)
Media: Visual effects for film, TV (reconstruction), virtual sports replay
(reconstruction), semantics-based auto edits (reconstruction, recognition)
Optical character recognition (OCR)
Technology to convert images of text into text
If you have a scanner, it probably came with OCR software

Live
Camera
Translation

Mail digit recognition, AT&T labs

http://www.research.att.com/~yann/

License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition
JH
Face detection

• Almost all digital cameras detect faces

• Snapchat face filters
Facial login without a password…
Facial login without a password…
Facial login without a password…

Liang et
al. 2014
Vision-based biometrics

“How the Afghan Girl was Identified by Her Iris Patterns”

Read the story (Wikipedia)

JH
Smile detection

Sony Cyber-shot® T70 Digital Still Camera

JH
Video call eye gaze correction
Kuster et al., SIGGRAPH Asia 2012
– https://cgl.ethz.ch/publications/papers/paperKus12.php

Apple FaceTime
Attention Correction
Object recognition (in mobile phones)
e.g., Google Lens
Object recognition (in supermarkets)
How does it work? Think-Pair-Share
How does it work?
Source: Vivek Ramanujan
3D from images

Building Rome in a Day: Agarwal et al. 2009

Human shape capture
Human shape capture
Human shape capture
Human shape capture
Special effects: motion capture
Interactive Games
Object Recognition:
http://www.youtube.com/watch?feature=iv&v=fQ59dXOo63o
Mario: http://www.youtube.com/watch?v=8CTJL5lUjHg
3D: http://www.youtube.com/watch?v=7QrnwoO1-8A
Robot: http://www.youtube.com/watch?v=w8BmgtMKFbY

JH
Sports

Virtual pitch markings Free viewpoint video

Sportvision first down line [Canon 2017]

Nice explanation on www.howstuffworks.com

JH
Medical imaging

Image guided surgery

3D imaging
Grimson et al., MIT
MRI, CT

JH
AutoCars - Uber bought CMU’s lab (2015)
Then sold it (2020)
http://www.robocup.org/
Mobile robots
Saxena et al. 2008
STAIR at Stanford

Skydio 2 drone
6x fisheye cameras for
obstacle avoidance
Onboard NVIDIA GPU
Vision in space

NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.

Vision systems (JPL) used for several tasks

• Panorama stitching
• 3D terrain modeling
• Obstacle detection, position tracking
• For more, read “Computer Vision on Mars” by Matthies et al.
JH
NASA Perseverance lander and rover
Landed 18th February 2021
Humanoid Robots

https://blog.bostondynamics.com/flipping-the-script-with-atlas, Boston Dynamics (2021)

Augmented Reality and Virtual Reality

MS HoloLens, Oculus, Magic Leap,

ARCore / ARKit
Augmented Reality and Virtual Reality
Real-time monocular depth Real-time 3D hand
estimation and camera tracking pose estimation

Oculus (Quest)
Niantic
AI for Physical Interaction

Boston Dynamics (2017)

Lecture1 1
No ratings yet
Lecture1 1
30 pages
Iso 123
No ratings yet
Iso 123
13 pages
Lec 9 - Introduction To Latex
No ratings yet
Lec 9 - Introduction To Latex
23 pages
EATON SMP 4DP Manual
No ratings yet
EATON SMP 4DP Manual
2 pages
Lecture AI 15 23052025 112103am
No ratings yet
Lecture AI 15 23052025 112103am
69 pages
CV - Part 1
No ratings yet
CV - Part 1
48 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
Unit1 CV
No ratings yet
Unit1 CV
44 pages
1 Intro24
No ratings yet
1 Intro24
79 pages
Dissertation Knowledge Management PDF
100% (2)
Dissertation Knowledge Management PDF
7 pages
DIP L03 - Linear Algebra
No ratings yet
DIP L03 - Linear Algebra
45 pages
Comp Vis Week 1
No ratings yet
Comp Vis Week 1
39 pages
Adm Full Notes
No ratings yet
Adm Full Notes
74 pages
Chapter 03 (B)
No ratings yet
Chapter 03 (B)
27 pages
Lec 11 - Writing Research Articles
No ratings yet
Lec 11 - Writing Research Articles
27 pages
Lec 2
No ratings yet
Lec 2
52 pages
6 TH Semester Past Papers
No ratings yet
6 TH Semester Past Papers
14 pages
Com - Upgadata.up7723 Logcat
No ratings yet
Com - Upgadata.up7723 Logcat
47 pages
Computer Vision Part1
No ratings yet
Computer Vision Part1
96 pages
Lec 06 - Office Correspondence
No ratings yet
Lec 06 - Office Correspondence
18 pages
1 Intro
No ratings yet
1 Intro
103 pages
Computer Vision - Lecture Introduction
No ratings yet
Computer Vision - Lecture Introduction
33 pages
Governing AI For Humanity
No ratings yet
Governing AI For Humanity
101 pages
CS 474 Lec 01 Introduction
No ratings yet
CS 474 Lec 01 Introduction
69 pages
SOP For Protocol For Working Standard
No ratings yet
SOP For Protocol For Working Standard
6 pages
AICP Membership IDs - UET Peshawar
No ratings yet
AICP Membership IDs - UET Peshawar
3 pages
Brain Rot
No ratings yet
Brain Rot
2 pages
Mobile Networks: Hamid Reza Bolhasani
No ratings yet
Mobile Networks: Hamid Reza Bolhasani
58 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Module 1 Chapter1
No ratings yet
Module 1 Chapter1
6 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
Lecture 1
100% (1)
Lecture 1
21 pages
Globe Telecom Accounting Case Study
No ratings yet
Globe Telecom Accounting Case Study
20 pages
202550876663IF Chibuzor
No ratings yet
202550876663IF Chibuzor
1 page
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
1 Introduction
No ratings yet
1 Introduction
67 pages
S17.s1 - Material
No ratings yet
S17.s1 - Material
36 pages
DL4CV Week01 Part01
No ratings yet
DL4CV Week01 Part01
35 pages
Lecture 01
No ratings yet
Lecture 01
79 pages
01 Introduction
No ratings yet
01 Introduction
62 pages
Lect1 PDF
100% (1)
Lect1 PDF
45 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Huang GameFormer Game-Theoretic Modeling and Learning of Transformer-Based Interactive Prediction and ICCV 2023 Paper
No ratings yet
Huang GameFormer Game-Theoretic Modeling and Learning of Transformer-Based Interactive Prediction and ICCV 2023 Paper
11 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Lecture 01 Introduction
No ratings yet
Lecture 01 Introduction
62 pages
Virtual Reality As An Empirical Research Tool - Exploring User Experience in A Real Building and A Corresponding Virtual Model
No ratings yet
Virtual Reality As An Empirical Research Tool - Exploring User Experience in A Real Building and A Corresponding Virtual Model
3 pages
Computer Vision
100% (1)
Computer Vision
48 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Computer Vision: Cse 576 Ali Farhadi
No ratings yet
Computer Vision: Cse 576 Ali Farhadi
90 pages
Introduction To Digital Image Processing
100% (1)
Introduction To Digital Image Processing
81 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
Computer Vision Applications
No ratings yet
Computer Vision Applications
35 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
Karl George EMG
No ratings yet
Karl George EMG
2 pages
CV Module 1
No ratings yet
CV Module 1
166 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
Magnum Press-On
No ratings yet
Magnum Press-On
5 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
What Is Computer Vision?: (Slides From James Hays, Brown University)
No ratings yet
What Is Computer Vision?: (Slides From James Hays, Brown University)
25 pages
CSE480: Machine Vision
No ratings yet
CSE480: Machine Vision
51 pages
BCI Protocol V1.4
No ratings yet
BCI Protocol V1.4
3 pages
LectureNotes PDF
No ratings yet
LectureNotes PDF
212 pages
Lec 00
No ratings yet
Lec 00
76 pages
Bilal CV
No ratings yet
Bilal CV
3 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Naat Nisa Brochure 2023...
No ratings yet
Naat Nisa Brochure 2023...
4 pages
Spys Mykola Resume
No ratings yet
Spys Mykola Resume
1 page
DEA 5TT2 Quiz
No ratings yet
DEA 5TT2 Quiz
4 pages
5a931d082a7d0 PDF
No ratings yet
5a931d082a7d0 PDF
83 pages
Term Project GEN 351: Derry Ardiansyah Civil Engineering 61070503201
No ratings yet
Term Project GEN 351: Derry Ardiansyah Civil Engineering 61070503201
11 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Computer Vision Intorduction
No ratings yet
Computer Vision Intorduction
57 pages
BMC Bos
No ratings yet
BMC Bos
1 page
Fundamentals of Computer Vision
No ratings yet
Fundamentals of Computer Vision
30 pages
Computer Vision: From Recognition To Geometry
No ratings yet
Computer Vision: From Recognition To Geometry
26 pages
Simple Packer-In C Gunther
No ratings yet
Simple Packer-In C Gunther
10 pages
Introduction To Computer Vision: by James Hays
No ratings yet
Introduction To Computer Vision: by James Hays
32 pages
PKG List (Submit To Mr. Jeong)
No ratings yet
PKG List (Submit To Mr. Jeong)
6 pages
ATI Trincomalee: SRI Nka Insti U OF Dvanced Technological Education
No ratings yet
ATI Trincomalee: SRI Nka Insti U OF Dvanced Technological Education
4 pages
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
No ratings yet
C280 Computer Vision C280, Computer Vision: Prof. Trevor Darrell
68 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
JavaScript Cheatsheet - CodeWithHarry
No ratings yet
JavaScript Cheatsheet - CodeWithHarry
13 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
What Is Computer Vision?: (Slides From James Hays, Brown University)
No ratings yet
What Is Computer Vision?: (Slides From James Hays, Brown University)
25 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
34 pages
EEE 6512 Image Processing and Computer Vision
No ratings yet
EEE 6512 Image Processing and Computer Vision
43 pages
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
No ratings yet
803 (A) Image Processing and Computer Vision#: Subject In-Charge: Prof Shilpa Sharma
44 pages
FPFF RF PDF
No ratings yet
FPFF RF PDF
1 page
Lecture1 - Introduction
No ratings yet
Lecture1 - Introduction
35 pages
The Muncaster Steam-Engine Models: 5-Vertical Stationary Engines
No ratings yet
The Muncaster Steam-Engine Models: 5-Vertical Stationary Engines
3 pages
Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H) Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H)
No ratings yet
Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H) Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H)
1 page

DIP L01 Introduction

Uploaded by

DIP L01 Introduction

Uploaded by

CSE-408 ( Diigital Image Processing)

Instructor: Dr. Muhammad Abeer Irfan

Mail digit recognition, AT&T labs

License plate readers

• Almost all digital cameras detect faces

“How the Afghan Girl was Identified by Her Iris Patterns”

Sony Cyber-shot® T70 Digital Still Camera

Building Rome in a Day: Agarwal et al. 2009

Virtual pitch markings Free viewpoint video

Sportvision first down line [Canon 2017]

Image guided surgery

Vision systems (JPL) used for several tasks

https://blog.bostondynamics.com/flipping-the-script-with-atlas, Boston Dynamics (2021)

MS HoloLens, Oculus, Magic Leap,

Boston Dynamics (2017)

You might also like