0% found this document useful (0 votes)

38 views44 pages

Computer_Vision_1_introduction

The document provides an overview of computer vision, emphasizing its ability to analyze and interpret images and videos using AI algorithms, which aims to replicate human vision capabilities. It discusses the differences between human and computer vision, the processing levels involved in computer vision (low, mid, and high), and various applications such as object detection, facial recognition, and scene reconstruction. Additionally, it highlights the importance of feature extraction and matching in enhancing computer vision performance.

Uploaded by

sushanth.tambe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views44 pages

Computer_Vision_1_introduction

Uploaded by

sushanth.tambe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Computer Vision

Introduction
Computer Vision in general
• Analysis and understanding of single or multiple images
• Use single or multiple cameras, apply pre-processing
and then apply pattern recognition or AI algorithms for decision making
• Computer vision - Image/video as inputs and output is interpretation
• Image processing - input is image and output is also an image
Computer Vision

• Objective is to see objects like humans and possibly even better

• Amount of data generated is tremendous - more than 3 billion images/day are
shared online
• Data can be used to train deep learning models to make computer vision better
• Requires high level understanding of digital images or videos
• Computer Vision is a subfield of Deep Learning and Artificial Intelligence
• Algorithms make computers see and interpret the scene/ images
Image
• Human can
• observe in a few seconds
• process and take intelligent decisions
• perform tasks effortlessly and
effectively
• For computer
• not fast and not easy
• Computer vision enable computers to see
the world in the same way humans do
Features of Human Vision (stereo vision, shape)

• Both eyes measure different distances Shape similarity

• Using distance, depth is calculated
• Stereo Vision is required for depth
calculation
Features of Human Vision (texture, color)
Different texture patterns can have
different shape visualization Identify objects Based on color
Features of Human Vision (object recognition)
Recognize a friend in a photograph clicked many years ago

• Human can recognize image with different illumination, view point, expressions etc
• There is no limit on how many faces we can store in our brain for future recognition
Features of Human Vision (object identification)

• Human vision can infer the context and

key information

• Computer vision is more difficult task then human vision

Features of Human Vision (object identification)

Human and computer vision identify

• Human can and for computer vision, it
is a challenge
• Change algorithm to identify

Human vision is more powerful than computer vision

• Human can and for computer vision,

it is a challenge
• Change algorithm to identify
Human Vision
• Human vision can provide
• Depth perception
• Relative position/ occlusion
• Shading of objects
• Sharpness of edges of objects
• Size and shape of objects
• Structure of object
• Limitations of Human Vision
• limited memory
• limited to visible spectrum
• illusion
•…
Limitations of Human Vision

• Able to establish the context

• Computer vision can interpret and have complete observations within a short time
Limitations of Human Vision

Difference in distance Difference in perception

Limitations of Human Vision

• Sizes of orange circle appear to be different

• Human does interpolation of objects
Computer Vision vs Image Processing
• Image processing is image-to-image transformation
• Typical image processing operations include
• image compression
• image restoration
• image enhancement
• Most computer vision algorithms work on images which already pre-processed to
improve image quality
Applications
• Object
• Classification: Broad category of object in image
• Identification: Type of a given object in image
• Detection: Check whether object exists in image
• Landmark Detection: Identify key points of the objects
• Segmentation: Identify pixels belonging to objects
• Recognition: Existence and location of objects
• Video motion analysis to estimate the velocity of objects in a video, or the
camera
• Scene reconstruction to create a 3D model of a scene captured in the form of
images or video
Specific Applications
Specific Applications
Specific Applications
Specific Applications

Most of the applications are based on features of image

Features of image

• Features of a region of an image is used to represent the region

• Edges are feature
• Corners are more localized and can be used to generate features
• Corner points are called key points
• Generate features of key points
• Feature matching relates the features of similar region of one image with those of
another image
Feature Matching
• Feature matching are used for object identification
• Steps for feature matching
• Detection of keypoints:
• Harris Corner Detection, SIFT, and SURF
• Local descriptors:
• Region surrounding each keypoint is captured and local descriptors are obtained
• Feature matching:
• Derive features from local descriptors
• Match features in the corresponding images
Feature Matching
Scene Reconstruction
• Digital 3D reconstruction of an object from a photograph
Video Motion Analysis
• Is a study of moving objects and the trajectory of objects
• Motion analysis is a combination of
• Object detection
• Tracking
• Segmentation
• Estimation of movement
• Human motion analysis is used in areas like
• Sports
• Intelligent video analytics etc
• Manufacturing
• Count and track microorganisms like bacteria and viruses
Real-world computer vision applications
• Self-driving cars (allows self-driving cars to safely steer through streets and
highways)
• Facial recognition (match images of people’s faces to their identities)
• Augmented reality (mix virtual objects with real-world images)
• Medical imaging (scan X-rays, MRIs, and ultrasounds to detect health problems)
• to spot defective products on the assembly line and prevent them from shipping
to customers
• Intelligent Video Analytics
• Manufacturing and Construction
• OCR
• Retail
• Banks use it to verify customers’ identities before conducting large transactions
Levels of processing for computer Vision

• Low Level Processing

• Mid Level Processing
• High Level Processing
Low Level Processing
• Image enhancement
• Apply edge detection, corner detection, filtering, morphology are used

Texture to determine repetitive pattern

Edge Detection
Low Level Processing

Low Level Features

Lines Corners Salient points

Low Level Processing
Image Matching

Image Stitching
Low level features/ vision

Boundary detection Variation in texture information determines shape of objects

Low level features/ vision

Get stereo image to get depth

Images of left and right cameras are used to
information
determine disparity map which gives depth
information
Low level features/ vision
Structure from motion

• For a moving object, surface closer to camera

moves faster than farthest surface
• This helps in defining shape/ depth
information
Mid Level Processing
• Segmentation by breaking images/ videos into useful pieces followed by interpretation
• Find video sequences that correspond to one scenario
• Keep track of moving object

Find correspondence between frames through a

sequence of video frames

Track object using background

subtraction
Mid Level Processing
Object tracking - Grouping objects which have similar optical flow

• Detect area of interest and predict where the object

will be in next frame
Mid Level Processing

• K-means clustering (k=7)

High Level Processing (Image Understanding)
• Generated from low-level feature
• Contain more complicated details about image/video
• Reconstruct, interpret and understand a 3D scene from its 2D images in terms of
the properties of the structures present in the scene
• Ex: Convolutional neural networks (CNNs), Recurrent Neural Networks (RNNs) for
learning high-level features
High Level Processing (Image Understanding)
High Level Processing (Image Understanding)
Theme understanding

• Does it have people? • What objects are present in the image?

• Is it a market place or football ground • Draw bounding box around each label
or garden? • Classify object as building (easy)
• Identify location of bicycle • Is it a house or shop?
High Level Processing (Image Understanding)
Example: Event recognition
• Video has a scene of clapping and a person is cutting cake
• If birthday party, birthday boy is cutting the cake
Visual recognition:
• Classifying images/ videos, localize objects
• Classify human activities

• Identify action
• Predict next action
Difference between Low and High Level Processing
• Low level is characteristics extracted from an image, such as colors, edges, and
textures
• High level is extracted from low-level features and denotes more meaningful
concepts

Lines, Corners, Edges Faces with expressions

Difference between Low and High Level Processing
Low Level High Level
Content • Related to the raw pixel data of the • More robust
image • Higher level of understanding of the image content
• More sensitive to noise and changes
in the image
Scale • Typically retrieved at a local scale • Frequently retrieved at a global scale, which means
• Vulnerable to little modifications of that they take into account the whole picture or
the picture, like lighting or video and are more robust
orientation
Resources • Feature extraction usually takes • Requires more advanced machine learning
fewer system resources than high- methods
level feature extraction
Task • frequently task-specific, which • frequently more generic and suitable for a broader
specificity means they are appropriate for a range of jobs
certain set of activities
Useful • Image segmentation, object • Image classification, object recognition, and scene
detection, and feature matching understanding
Difference between Low and High Level Processing
Low Level High Level
Content • Related to the raw pixel data of the • More robust
image • Higher level of understanding of the image content
• More sensitive to noise and changes
in the image
Scale • Typically retrieved for local region • Frequently retrieved at a global scale
• Vulnerable to little modifications of • Takes into account the whole image/video
the picture, like lighting or • More robust
orientation
Resources • Feature extraction usually takes • Requires more advanced machine learning
fewer system resources than high- methods
level feature extraction
Task • frequently task-specific, which • frequently more generic and suitable for a broader
specificity means they are appropriate for a range of jobs
certain set of activities
Useful • Image segmentation, object • Image classification, object recognition, and scene
detection, and feature matching understanding
Difference between Low and High Level Processing
Low Level High Level
Content • Related to the raw pixel data of the • More robust
image • Higher level of understanding of the image content
• More sensitive to noise and changes
in the image
Scale • Typically retrieved for local region • Frequently retrieved at a global scale
• Vulnerable to little modifications of • Takes into account the whole image/video
the picture, like lighting or • More robust
orientation
Resources • Feature extraction usually takes • Requires more advanced techniques
fewer system resources
Task • frequently task-specific, which • frequently more generic and suitable for a broader
specificity means they are appropriate for a range of jobs
certain set of activities
Useful • Image segmentation, object • Image classification, object recognition, and scene
detection, and feature matching understanding
Difference between Low and High Level Processing

Low Level High Level

Content • Related to the raw pixel data of the • More robust
image • Higher level of understanding of the image content
• More sensitive to noise and changes
in the image
Scale • Typically retrieved for local region • Frequently retrieved at a global scale
• Vulnerable to little modifications of • Takes into account the whole image/video
the picture, like lighting or • More robust
orientation
Resources • Feature extraction usually takes • Requires more advanced techniques
fewer system resources
Useful • Image segmentation, object • Image classification, object recognition, and scene
detection, and feature matching understanding

Image Processing and Computer Vision (Notes)
No ratings yet
Image Processing and Computer Vision (Notes)
64 pages
Lens Focal Length Comparison Chart: High Def Formats Film Formats
No ratings yet
Lens Focal Length Comparison Chart: High Def Formats Film Formats
1 page
Leds-C4 The One 2015
No ratings yet
Leds-C4 The One 2015
836 pages
C
100% (1)
C
66 pages
Module 1 Chapter1
No ratings yet
Module 1 Chapter1
6 pages
Sony KDL-55W905A Picture Settings
No ratings yet
Sony KDL-55W905A Picture Settings
4 pages
Vallejo Paint List
No ratings yet
Vallejo Paint List
81 pages
18cse390t u1 s1 Slo1 Content
No ratings yet
18cse390t u1 s1 Slo1 Content
15 pages
Bai01 A Part of Computer Vision
No ratings yet
Bai01 A Part of Computer Vision
47 pages
Computer Vision ET
No ratings yet
Computer Vision ET
12 pages
Lesson 1: Basic of Color Theory
No ratings yet
Lesson 1: Basic of Color Theory
5 pages
Computer Vision
No ratings yet
Computer Vision
18 pages
Aravind Eye Care Systems: A Model With A Vision of Gifting Eye Sight To The Needy
No ratings yet
Aravind Eye Care Systems: A Model With A Vision of Gifting Eye Sight To The Needy
20 pages
01 Lecture No. 1
No ratings yet
01 Lecture No. 1
52 pages
502355296-Computer-Vision-Presentation-AI
No ratings yet
502355296-Computer-Vision-Presentation-AI
16 pages
Computer Vision Powerpoint Presentation PDF
No ratings yet
Computer Vision Powerpoint Presentation PDF
10 pages
Video Enhancement With Task-Oriented Flow: Liu and Sun 2011 Baker Et Al 2011 Liu and Freeman 2010
No ratings yet
Video Enhancement With Task-Oriented Flow: Liu and Sun 2011 Baker Et Al 2011 Liu and Freeman 2010
20 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Sample Color Cal Certificate
No ratings yet
Sample Color Cal Certificate
5 pages
Data Core
No ratings yet
Data Core
14 pages
IT5409 Ch1 Intro
No ratings yet
IT5409 Ch1 Intro
14 pages
Computer_Vision_Assignment
No ratings yet
Computer_Vision_Assignment
10 pages
What is Computer Vision
No ratings yet
What is Computer Vision
18 pages
CV - Lecture 1 - Iintroduction
No ratings yet
CV - Lecture 1 - Iintroduction
24 pages
Glaucomanew Converted 210627175813
No ratings yet
Glaucomanew Converted 210627175813
28 pages
Group-8 DIP MiniProject
No ratings yet
Group-8 DIP MiniProject
22 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
008 Image Processing
No ratings yet
008 Image Processing
28 pages
Kelainan Refraksi
No ratings yet
Kelainan Refraksi
39 pages
IT5409 Ch1 Intro New Template
No ratings yet
IT5409 Ch1 Intro New Template
14 pages
Computer Vision Lecture 3
No ratings yet
Computer Vision Lecture 3
19 pages
Lec 01 CompVision N DIP Intro
No ratings yet
Lec 01 CompVision N DIP Intro
91 pages
Chapter 1 [CV & IP]
No ratings yet
Chapter 1 [CV & IP]
41 pages
grp3_computerVision (4)
No ratings yet
grp3_computerVision (4)
28 pages
Blank Company Profile Business Presentation in Orange Pink Yellow 3D Style - 20241107 - 225020 - 0000
No ratings yet
Blank Company Profile Business Presentation in Orange Pink Yellow 3D Style - 20241107 - 225020 - 0000
21 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Cv Unit 1 Overview of Computer Vison and Application
No ratings yet
Cv Unit 1 Overview of Computer Vison and Application
51 pages
The Fascinating Field of Computer Vision
No ratings yet
The Fascinating Field of Computer Vision
8 pages
1a. Introduction
No ratings yet
1a. Introduction
32 pages
Introduction of Computer Vision
No ratings yet
Introduction of Computer Vision
5 pages
A computer vision system processes images acquired
No ratings yet
A computer vision system processes images acquired
4 pages
1 Vision Lec 1
No ratings yet
1 Vision Lec 1
49 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
UNIT-I_Introduction to Computer Vision
No ratings yet
UNIT-I_Introduction to Computer Vision
45 pages
CV_SVD_L01_P1_Intro
No ratings yet
CV_SVD_L01_P1_Intro
35 pages
Eyes (Doc Maala)
No ratings yet
Eyes (Doc Maala)
53 pages
Ocular Examination & Investigation
100% (1)
Ocular Examination & Investigation
17 pages
Computer Vision Mid
No ratings yet
Computer Vision Mid
2 pages
How Computer Vision is Used in Everyday Life
No ratings yet
How Computer Vision is Used in Everyday Life
5 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
Table of Contents
No ratings yet
Table of Contents
9 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
Computer Vision Class 10 Notes
No ratings yet
Computer Vision Class 10 Notes
5 pages
6 Team Double Elimination
No ratings yet
6 Team Double Elimination
16 pages
Computer Vision in Aritificial Intelligence
No ratings yet
Computer Vision in Aritificial Intelligence
33 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
CVIP-Module-01-Reviewer
No ratings yet
CVIP-Module-01-Reviewer
20 pages
ANSI INFOCOMM V202 01 2016 Display Image Size For 2D Content in Audiovisual Systems PDF
No ratings yet
ANSI INFOCOMM V202 01 2016 Display Image Size For 2D Content in Audiovisual Systems PDF
29 pages
A Precise Eye-Gaze Detection and Tracking System: January 2003
No ratings yet
A Precise Eye-Gaze Detection and Tracking System: January 2003
5 pages
Criteria For Accepting Passport Photos in Dutch Travel Documents
No ratings yet
Criteria For Accepting Passport Photos in Dutch Travel Documents
2 pages
CPCS335 - Chapter 9-Final
No ratings yet
CPCS335 - Chapter 9-Final
24 pages
Kuhn 2016
100% (1)
Kuhn 2016
548 pages
A Comprehensive Guide to Computer Vision
No ratings yet
A Comprehensive Guide to Computer Vision
6 pages
Chapter One-3
No ratings yet
Chapter One-3
8 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Unit 1
No ratings yet
Unit 1
20 pages
Primary Colors, Secondary and Tertiary Explained
No ratings yet
Primary Colors, Secondary and Tertiary Explained
5 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
TNT Brand Guidelines
No ratings yet
TNT Brand Guidelines
43 pages
DSD PDF Photo - Video Protocol
No ratings yet
DSD PDF Photo - Video Protocol
43 pages
Chapter One
No ratings yet
Chapter One
47 pages
New Seminar
No ratings yet
New Seminar
11 pages
Computer Vision: Evolution and Promise
No ratings yet
Computer Vision: Evolution and Promise
5 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
Lect 1 Computervision Student PPT 16-9-2017
No ratings yet
Lect 1 Computervision Student PPT 16-9-2017
143 pages
Ai Pra
No ratings yet
Ai Pra
15 pages
ECE885 Computer Vision: Prof. Bhupinder Verma
No ratings yet
ECE885 Computer Vision: Prof. Bhupinder Verma
59 pages
Computer Vision Report
No ratings yet
Computer Vision Report
31 pages
Defectos Refractivos y Cirugia Refractiva - PPP PDF
No ratings yet
Defectos Refractivos y Cirugia Refractiva - PPP PDF
199 pages
Overview of Computer Vision: CS491E/791E
No ratings yet
Overview of Computer Vision: CS491E/791E
55 pages
Acrylicpaintingstepbystep
No ratings yet
Acrylicpaintingstepbystep
66 pages
1 - 5 - Recognizing Dynamic Symmetry
No ratings yet
1 - 5 - Recognizing Dynamic Symmetry
11 pages
Introduction To Digital Imaging Using Photoshop: Practical Workbook
No ratings yet
Introduction To Digital Imaging Using Photoshop: Practical Workbook
51 pages
Visual and Lacrimal Appratus
No ratings yet
Visual and Lacrimal Appratus
45 pages
Pantone Color Chart (PMS)
No ratings yet
Pantone Color Chart (PMS)
21 pages
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet

Computer_Vision_1_introduction

Uploaded by

Computer_Vision_1_introduction

Uploaded by

Computer Vision

• Objective is to see objects like humans and possibly even better

• Both eyes measure different distances Shape similarity

• Human vision can infer the context and

• Computer vision is more difficult task then human vision

Human and computer vision identify

Human vision is more powerful than computer vision

• Human can and for computer vision,

• Able to establish the context

Difference in distance Difference in perception

• Sizes of orange circle appear to be different

Most of the applications are based on features of image

• Features of a region of an image is used to represent the region

• Low Level Processing

Texture to determine repetitive pattern

Low Level Features

Lines Corners Salient points

Boundary detection Variation in texture information determines shape of objects

Get stereo image to get depth

• For a moving object, surface closer to camera

Find correspondence between frames through a

Track object using background

• Detect area of interest and predict where the object

• K-means clustering (k=7)

• Does it have people? • What objects are present in the image?

Lines, Corners, Edges Faces with expressions

Low Level High Level

You might also like