0% found this document useful (0 votes)

53 views70 pages

Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech

Image categorization involves mapping images to categories or labels based on their visual content. This can be done by first extracting image features that encode visual properties like color, texture, shapes, etc. These features are then fed into a classifier which learns during a training phase to map features to categories/labels. The trained classifier can then take features from new, unlabeled images and predict their categories. Common applications include object, scene, and fine-grained recognition, as well as semantic segmentation.

Uploaded by

DUDEKULA VIDYASAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views70 pages

Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech

Uploaded by

DUDEKULA VIDYASAGAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 70

Image Features and

Categorization

Computer Vision
Jia-Bin Huang, Virginia Tech
Administrative stuffs

• Final project proposal

• Due 11:55 PM on Mon, Oct 29
• Find group members on Piazza.
• Submission via Canvas

• HW 4
• Due 11:55pm on Wed, Oct 31

• Demo of modern interactive image segmentation

Review: Interpreting Intensity
• Light and color
–What an image records
• Filtering in spatial domain
• Filtering = weighted sum of neighboring pixels
• Smoothing, sharpening, measuring texture
• Filtering in frequency domain
• Filtering = change frequency of the input image
• Denoising, sampling, image compression
• Image pyramid and template matching
• Filtering = a way to find a template
• Image pyramids for coarse-to-fine search and multi-
scale detection
• Edge detection
• Canny edge = smooth -> derivative -> thin ->
threshold -> link
Review: Correspondence and Alignment
• Interest points
• Find distinct and repeatable points in images
• Harris-> corners, DoG -> blobs
• SIFT -> feature descriptor
• Feature tracking and optical flow
• Find motion of a keypoint/pixel over time
• Lucas-Kanade:
• brightness consistency, small motion, spatial coherence
• Handle large motion:
• iterative update + pyramid search
• Fitting and alignment
• find the transformation parameters that
best align matched points
• Object instance recognition
• Keypoint-based object instance recognition and search
Review: Perspective and 3D Geometry
• Projective geometry and camera models
• What’s the mapping between image and world
coordiantes? x  K R t  X
• Single view metrology and camera calibration
• How can we measure the size of 3D objects in an image?
• How can we estimate the camera parameters?
• Photo stitching
• What’s the mapping from two images taken
without camera translation?
• Epipolar Geometry and Stereo Vision
• What’s the mapping from two images taken with camera
translation?
• Structure from motion
• How can we recover 3D points from multiple images?
Review: Grouping and Segmentation
• Grouping and Segmentation
• How do we group pixels into meaningful regions?
• Use of segmentation: efficiency, better features, object
region proposal, wanted the segmented object

• EM Algorithm, Mixture of Gaussians

• How do we deal with missing data?
• Maximum likelihood estimation
• Probabilistic inference
• Expectation-Maximization algorithm

• MRFs and Graph Cut

• How do we encode pixel dependencies?
• Markov Random Fields
• Graph Cuts
Recognition and Learning
• Image Features and Categorization

• Foundations of Deep Learning

• Convolutional Neural Networks

• Object Detection

• Part and Pixel Labeling

• Action Recognition

• Vision and Language

Today: Image features and categorization

• General concepts of categorization

• Why? What? How?

• Image features
• Color, texture, gradient, shape, interest points
• Histograms, feature encoding, and pooling
• CNN as feature

• Image and region categorization

What do you see in this image?

Trees

Bear
Camera

Man
Can I put stuff in it?

Rabbit Grass

Forest
Describe, predict, or interact with the object based
on visual cues

Is it dangerous?
Is it alive?
How fast does it run? Is it soft?
Does it have a tail? Can I poke with it?
Why do we care about categories?
• From an object’s category, we can make predictions about its
behavior in the future, beyond of what is immediately
perceived.
• Pointers to knowledge
• Help to understand individual cases not previously encountered
• Communication
Theory of categorization

How do we determine if something is a member of a

particular category?

• Definitional approach

• Prototype approach

• Exemplar approach
Definitional approach:
classical view of categories
• Plato & Aristotle
• Categories are defined by a list of
properties shared by all elements in a
category
• Category membership is binary
• Every member in the category is equal

The Categories (Aristotle) Aristotle by Francesco Hayez

Slide Credit: A. A. Efros

Prototype or sum of exemplars ?
Prototype Model Exemplars Model

Category judgments are made

by comparing a new exemplar
Category judgments are made to all the old exemplars of a category
by comparing a new exemplar or to the exemplar that is the most
to the prototype. appropriate
Slide Credit: Torralba
Levels of categorization [Rosch 70s]
Definition of Basic Level:
• Similar shape: Basic level categories are the highest-level
category for which their members have similar shapes.
• Similar motor interactions: … for which people interact with its
…
members using similar motor sequences.
• Common attributes: … there are a significant number
of attributes in common between pairs of members. animal
Superordinate
… …
levels
similarity quadruped

…
Basic level dog cat cow

German
Doberman
shepherd
Sub Basic Superordinate
Subordinate … “Fido” …
Rosch et a. Principle of categorization, 1978 level
Image categorization

• Cat vs Dog
Image categorization
• Object recognition

Caltech 101 Average Object Images

Image categorization

• Fine-grained recognition

Visipedia Project
Image categorization
• Place recognition

Places Database [Zhou et al. NIPS 2014]

Image categorization
• Visual font recognition

[Chen et al. CVPR 2014]

Image categorization
• Dating historical photos

1940 1953 1966 1977

[Palermo et al. ECCV 2012]

Image categorization
• Image style recognition

[Karayev et al. BMVC 2014]

Region categorization
• Layout prediction

Assign regions to orientation

Geometric context [Hoiem et al. IJCV 2007]

Assign regions to depth

Make3D [Saxena et al. PAMI 2008]
Region categorization
• Semantic segmentation from RGBD images

[Silberman et al. ECCV 2012]

Region categorization
• Material recognition

[Bell et al. CVPR 2015]

Training phase
Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier
Testing phase
Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier

Testing
Image Trained Prediction
Features Classifier Outdoor
Test Image
• Image features: map images to feature space

x
x
x x
x
x x x
x
o
o x
x
o o x
o o
o
x2 o o oo o
x1
• Classifiers: map feature space to label space
x x
x x
x x x x
x x
x x x x x x
x x
o o
o x o x
x x
o o x o o x
o o o o
o o
x2 o o oo o x2 o o oo o
x1 x1
Different types of classification
• Exemplar-based: transfer category labels from
examples with most similar features
• What similarity function? What parameters?
• Linear classifier: confidence in positive label is a
weighted sum of features
• What are the weights?
• Non-linear classifier: predictions based on more
complex function of features
• What form does the classifier take? Parameters?
• Generative classifier: assign to the label that best
explains the features (makes features most likely)
• What is the probability function and its parameters?
Note: You can always fully design the classifier by hand, but usually this is too
difficult. Typical solution: learn from training examples.
Testing phase
Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier

Testing
Image Trained Prediction
Features Classifier Outdoor
Test Image
Q: What are good features for…
• recognizing a beach?
Q: What are good features for…
• recognizing cloth fabric?
Q: What are good features for…
• recognizing a mug?
What are the right features?

Depend on what you want to know!

• Object: shape
• Local shape info, shading, shadows, texture
• Scene : geometric layout
• linear perspective, gradients, line segments
• Material properties: albedo, feel, hardness
• Color, texture
• Action: motion
• Optical flow, tracked points
General principles of representation
• Coverage
• Ensure that all relevant info is
captured

• Concision
• Minimize number of features without
sacrificing coverage

• Directness
• Ideal features are independently
useful for prediction
Image representations

• Templates
• Intensity, gradients, etc.

• Histograms
• Color, texture, SIFT descriptors, Image Gradient
etc. Intensity template

• Average of features
Image representations: histograms

Global histogram
- Represent distribution of features
• Color, texture, depth, … Space Shuttle
Cargo Bay
Images from Dave Kauchak
Image representations: histograms
• Data samples in 2D
Feature 2

Feature 1
Image representations: histograms
• Probability or count of data in each bin
• Marginal histogram on feature 1
Feature 2

Feature 1
bin
Image representations: histograms
• Marginal histogram on feature 2
Feature 2

bin

Feature 1
Image representations: histograms
• Joint histogram
Feature 2

bin

Feature 1
Modeling multi-dimensional data
Feature 2

Feature 2
Feature 1
Feature 1

Feature 2

Feature 1

Joint histogram Marginal histogram

• Requires lots of data • Requires independent features
• Loss of resolution to • More data/bin than
avoid empty bins
joint histogram
Modeling multi-dimensional data
• Clustering
• Use the same cluster centers for all images
Feature 2

bin

Feature 1
Computing histogram distance
• Histogram intersection

histint( hi , h j )  1   min  hi (m), h j (m) 

m 1

• Chi-squared Histogram matching distance

1 [hi (m)  h j (m)]
K 2

 (hi , h j )  
2

2 m 1 hi (m)  h j (m)

• Earth mover’s distance

(Cross-bin similarity measure)
• minimal cost paid to transform one distribution into the other

[Rubner et al. The Earth Mover's Distance as a Metric for Image Retrieval, IJCV 2000]
Histograms: implementation issues
• Quantization
• Grids: fast but applicable only with few dimensions
• Clustering: slower but can quantize data in higher
dimensions

Few Bins Many Bins

Need less data Need more data
Coarser representation Finer representation

• Matching
• Histogram intersection or Euclidean may be faster
• Chi-squared often works better
• Earth mover’s distance is good for when nearby bins
represent similar values
What kind of things do we compute histograms of?

• Color

Lab* color space HSV color space

• Texture (filter banks or HOG over regions)
What kind of things do we compute
histograms of?
• Histograms of descriptors

SIFT – [Lowe IJCV 2004]

• “Bag of visual words”

Analogy to documents
China is forecasting a trade surplus of $90bn
Of all the sensory impressions proceeding to
(£51bn) to $100bn this year, a threefold
the brain, the visual experiences are the
increase on 2004's $32bn. The Commerce
dominant ones. Our perception of the world
Ministry said the surplus would be created by
around us is based essentially on the
a predicted 30% jump in exports to $750bn,
messages that reach the brain from our eyes.
compared with a 18% rise in imports to
For a long time it was thought that the retinal
$660bn. The figures are likely to further
image was transmitted sensory, brain,
point by point to visual China, trade,
annoy the US, which has long argued that
centers in the brain; the cerebral cortex was
visual, perception,
a movie screen, so to speak, upon which the
China's exports are surplus, commerce,
unfairly helped by a
retinal, cerebral deliberately undervalued yuan. Beijing
image in the eye was projected. Throughcortex,
the exports, imports, US,
agrees the surplus is too high, but says the
discoveries of Hubeleye, cell,weoptical
and Wiesel now yuan, bank, domestic,
yuan is only one factor. Bank of China
know that behind the origin of the visual
perception in the brain nerve, image foreign,
governor Zhou Xiaochuan saidincrease,
the country
there is a considerably
also needed to do more to boost domestic
more complicated course Hubel, Wiesel
of events. By
demand so more goodstrade, value
stayed within the
following the visual impulses along their path
country. China increased the value of the
to the various cell layers of the optical cortex,
yuan against the dollar by 2.1% in July and
Hubel and Wiesel have been able to
permitted it to trade within a narrow band, but
demonstrate that the message about the
the US wants the yuan to be allowed to trade
image falling on the retina undergoes a step-
freely. However, Beijing has made it clear
wise analysis in a system of nerve cells
that it will take its time and tread carefully
stored in columns. In this system each cell
before allowing the yuan to rise further in
has its specific function and is responsible for
value.
a specific detail in the pattern of the retinal
image.

ICCV 2005 short course, L. Fei-Fei

Bag of visual words

• Image
patches

• BoW
histogram

• Codewords
Image categorization with bag of
words

Training
1. Extract keypoints and descriptors for all training images
2. Cluster descriptors
3. Quantize descriptors using cluster centers to get “visual words”
4. Represent each image by normalized counts of “visual words”
5. Train classifier on labeled examples using histogram values as features

Testing
6. Extract keypoints/descriptors and quantize into visual words
7. Compute visual word histogram
8. Compute label or confidence using classifier
Bag of visual words image classification

[Chatfieldet al. BMVC 2011]

Feature encoding
• Hard/soft assignment to clusters

Histogram encoding Kernel codebook encoding

Locality constrained encoding Fisher encoding

[Chatfieldet al. BMVC 2011]
Fisher vector encoding
• Fit Gaussian Mixture Models

• Posterior probability

• First and second order differences to cluster k

[Perronnin et al. ECCV 2010]

Performance comparisons

• Fisher vector encoding outperforms others

• Higher-order statistics helps

[Chatfieldet al. BMVC 2011]

But what about spatial layout?

All of these images have the same color histogram

Spatial pyramid

Compute histogram in each spatial bin

Spatial pyramid

High number of features – PCA to reduce dimensionality

[Lazebnik et al. CVPR 2006]

Pooling

• Average/max pooling

=avg/max

Source: Unsupervised Feature

Learning and Deep Learning

• Second-order pooling
[Joao et al. PAMI 2014]

=avg/max
2012 ImageNet 1K
(Fall 2012)

20
Error

0
CE d am IA rd ISI isio
n
R-X
R
te r / I NR Ox
fo r V
LEA ms CE e
of
A XR Sup
U.
2012 ImageNet 1K
(Fall 2012)

20
Error

0
CE d am IA rd ISI isio
n
R-X
R
te r / I NR Ox
fo r V
LEA ms CE e
of
A XR Sup
U.
Shallow vs. deep learning
Label
Dense
Dense
• Engineered vs. learned
Dense
Dense
features
Dense
Dense

Convolution
Convolution

Label Convolution
Convolution

Classifier
Classifier Convolution
Convolution

Pooling
Pooling Convolution
Convolution

Feature
Feature extraction
extraction Convolution
Convolution

Image
Image Image
Image
Gradient-Based Learning Applied to Document
Recognition, LeCun, Bottou, Bengio and Haffner, Proc. of
the IEEE, 1998

Imagenet Classification with Deep Convolutional Neural

Networks, Krizhevsky, Sutskever, and Hinton, NIPS 2012
Slide Credit: L. Zitnick
U s
GP
Gradient-Based Learning Applied to Document

+
Recognition, LeCun, Bottou, Bengio and Haffner, Proc. of
the IEEE, 1998

t a *
D a
Imagenet Classification with Deep Convolutional Neural
* Rectified activations and dropout
Networks, Krizhevsky, Sutskever, and Hinton, NIPS 2012
Slide Credit: L. Zitnick
Convolutional activation features

[Donahue et al. ICML 2013]

CNN Features off-the-shelf:

an Astounding Baseline for Recognition
[Razavian et al. 2014]
Region representation
• Segment the image into superpixels
• Use features to represent each image segment

Joseph Tighe and Svetlana Lazebnik

Region representation
• Color, texture, BoW
• Only computed within the local region

• Shape of regions

• Position in the image

Working with regions
• Spatial support is important –
multiple segmentation

Geometric context [Hoiem et al. ICCV 2005]

• Spatial consistency – MRF smoothing
Things to remember

• Visual categorization help transfer knowledge

• Image features
• Coverage, concision, directness
• Color, gradients, textures, motion, descriptors
• Histogram, feature encoding, and pooling
• CNN as features

• Image/region categorization
Next lecture –
Foundations of Deep Learning
Training Training
Images
Training Labels

Image Classifier Trained

Features Training Classifier

Testing
Image Trained Prediction
Features Classifier Outdoor
Test Image

CV 2025 Spring 12 Short
No ratings yet
CV 2025 Spring 12 Short
120 pages
03-3 Feature Descriptors
No ratings yet
03-3 Feature Descriptors
58 pages
Machine Vision
100% (4)
Machine Vision
453 pages
Machine Vision
No ratings yet
Machine Vision
453 pages
02 Feature Extraction & DLCV
No ratings yet
02 Feature Extraction & DLCV
165 pages
Ch-3 Image AnalysisComputer Vision
No ratings yet
Ch-3 Image AnalysisComputer Vision
88 pages
CV4 F
No ratings yet
CV4 F
43 pages
CVI Week 2 1 Pre Note
No ratings yet
CVI Week 2 1 Pre Note
56 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Unit 3 - 1 - 1709014556934
No ratings yet
Unit 3 - 1 - 1709014556934
49 pages
Computer Vision Methods For Fast Image Classification and Retrieval 2020
100% (5)
Computer Vision Methods For Fast Image Classification and Retrieval 2020
144 pages
IT5409 Ch4 Part2 Feature ExtractionMatching
No ratings yet
IT5409 Ch4 Part2 Feature ExtractionMatching
85 pages
Lec23 Categorization Wide
No ratings yet
Lec23 Categorization Wide
53 pages
CV Lecture 07 BagOfFeatures
No ratings yet
CV Lecture 07 BagOfFeatures
42 pages
Pattern Recognition: Lecturer
No ratings yet
Pattern Recognition: Lecturer
43 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
DBF81
No ratings yet
DBF81
10 pages
Lecture 1.1
No ratings yet
Lecture 1.1
26 pages
What Computer Vision With The OpenCV
100% (5)
What Computer Vision With The OpenCV
137 pages
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
No ratings yet
Introduction To Object Recognition: Slides Adapted From Fei-Fei Li, Rob Fergus, Antonio Torralba, and Others
60 pages
Object Recog
No ratings yet
Object Recog
102 pages
Intro
No ratings yet
Intro
66 pages
Unit 3-Non CNN Approaches To Object Recognition
No ratings yet
Unit 3-Non CNN Approaches To Object Recognition
26 pages
Bag of Feature
No ratings yet
Bag of Feature
75 pages
Vishal Minor Project 2
No ratings yet
Vishal Minor Project 2
16 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Computer Vision Final Lec
No ratings yet
Computer Vision Final Lec
13 pages
Bag of Features
No ratings yet
Bag of Features
49 pages
Features & Object Recognition and Classification: Ritu Saha MSC 210915
No ratings yet
Features & Object Recognition and Classification: Ritu Saha MSC 210915
14 pages
Pattern Recoginition 5
No ratings yet
Pattern Recoginition 5
43 pages
Local Features and Bag of Words Models
No ratings yet
Local Features and Bag of Words Models
60 pages
Object Categorization Thesis
No ratings yet
Object Categorization Thesis
99 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
Bag-Of-Words Models: Noah Snavely
No ratings yet
Bag-Of-Words Models: Noah Snavely
47 pages
Week5 Computer Vision
No ratings yet
Week5 Computer Vision
58 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
PROJECT Presentation Medical Multimodal Image Retrieval
No ratings yet
PROJECT Presentation Medical Multimodal Image Retrieval
57 pages
RO47002 - Lecture 2A - Case Study Visual Object Detection
No ratings yet
RO47002 - Lecture 2A - Case Study Visual Object Detection
24 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
Computer Vision
No ratings yet
Computer Vision
41 pages
Introduction To Robotics
No ratings yet
Introduction To Robotics
27 pages
Role of AI in Human Life
No ratings yet
Role of AI in Human Life
13 pages
00CV Intro Full
No ratings yet
00CV Intro Full
58 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
Classifying Images: D.A. Forsyth
No ratings yet
Classifying Images: D.A. Forsyth
24 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Artificial Intelligence: Introduction To
100% (1)
Artificial Intelligence: Introduction To
36 pages
Two Types of Image Segmentation Exist:: Semantic Segmentation. Objects Shown in An Image Are Grouped Based On
No ratings yet
Two Types of Image Segmentation Exist:: Semantic Segmentation. Objects Shown in An Image Are Grouped Based On
25 pages
AIML Unit Wise Question Bank
100% (1)
AIML Unit Wise Question Bank
4 pages
CS231A - Computer Vision: Project Proposals
No ratings yet
CS231A - Computer Vision: Project Proposals
46 pages
Computer Vision
No ratings yet
Computer Vision
29 pages
ML Project Docs
No ratings yet
ML Project Docs
45 pages
Chapter 1
No ratings yet
Chapter 1
8 pages
Image Features Detection, Description and Matching: M. Hassaballah, Aly Amin Abdelmgeid and Hammam A. Alshazly
No ratings yet
Image Features Detection, Description and Matching: M. Hassaballah, Aly Amin Abdelmgeid and Hammam A. Alshazly
36 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Marina Ivašić-Kos, Mile Pavlić,: Maja Matetić
No ratings yet
Marina Ivašić-Kos, Mile Pavlić,: Maja Matetić
14 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
LTE Report Sumbagsel
No ratings yet
LTE Report Sumbagsel
381 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
25 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
1 page
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
No ratings yet
Image Segmentation: Ross Whitaker SCI Institute, School of Computing University of Utah
49 pages
Maxon Cinema 4D 2023: A Detailed Guide to Shading, Lighting, and Rendering
From Everand
Maxon Cinema 4D 2023: A Detailed Guide to Shading, Lighting, and Rendering
Pradeep Mamgain
No ratings yet
Projective Geometry and Camera Models: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Projective Geometry and Camera Models: Computer Vision Jia-Bin Huang, Virginia Tech
70 pages
Currency Recognition On Mobile Phones Proposed System Modules
No ratings yet
Currency Recognition On Mobile Phones Proposed System Modules
26 pages
It 504 A Artificial Intelligence Dec 2020
No ratings yet
It 504 A Artificial Intelligence Dec 2020
4 pages
Nano Assesment-1 Question Paper
100% (3)
Nano Assesment-1 Question Paper
3 pages
P95 Course Slides
No ratings yet
P95 Course Slides
86 pages
CSE 185 Introduction To Computer Vision: Fitting and Alignment
No ratings yet
CSE 185 Introduction To Computer Vision: Fitting and Alignment
42 pages
Image Stitching: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Image Stitching: Computer Vision Jia-Bin Huang, Virginia Tech
57 pages
MJFA Tech Skills 3rd Quarter Reviewer
No ratings yet
MJFA Tech Skills 3rd Quarter Reviewer
7 pages
CV2010 2ProjectProposal Hong
No ratings yet
CV2010 2ProjectProposal Hong
3 pages
Eng - RC - Gr8 - Vol3 - Robots That Are Alive - AY2022-23 - TBAS
No ratings yet
Eng - RC - Gr8 - Vol3 - Robots That Are Alive - AY2022-23 - TBAS
5 pages
TAE 4 Tanmay C-23
No ratings yet
TAE 4 Tanmay C-23
12 pages
Von Mises-Fisher Mixture Model-Based Deep Learning: Application To Face Verification
No ratings yet
Von Mises-Fisher Mixture Model-Based Deep Learning: Application To Face Verification
16 pages
CSE 185 Introduction To Computer Vision: Feature Matching
No ratings yet
CSE 185 Introduction To Computer Vision: Feature Matching
48 pages
Interest Points: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Interest Points: Computer Vision Jia-Bin Huang, Virginia Tech
104 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
14 pages
Physical and Datalink Layers: 3 Lectures: Analog Signal
No ratings yet
Physical and Datalink Layers: 3 Lectures: Analog Signal
7 pages
Structure From Motion: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Structure From Motion: Computer Vision Jia-Bin Huang, Virginia Tech
84 pages
.Facial Emotion Recognition Using Convolutional Neural Network
No ratings yet
.Facial Emotion Recognition Using Convolutional Neural Network
4 pages
22 Qos
No ratings yet
22 Qos
47 pages
Traffic Sign Detection
No ratings yet
Traffic Sign Detection
5 pages
Alignment and Object Instance Recognition: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Alignment and Object Instance Recognition: Computer Vision Jia-Bin Huang, Virginia Tech
71 pages
Hidden Variables, The EM Algorithm, and Mixtures of Gaussians
No ratings yet
Hidden Variables, The EM Algorithm, and Mixtures of Gaussians
58 pages
Outline: Peter Steenkiste Departments of Computer Science and Electrical and Computer Engineering How Do Routers Works?
No ratings yet
Outline: Peter Steenkiste Departments of Computer Science and Electrical and Computer Engineering How Do Routers Works?
7 pages
CSE 185 Introduction To Computer Vision: Local Invariant Features
No ratings yet
CSE 185 Introduction To Computer Vision: Local Invariant Features
57 pages
CS5670: Computer Vision: Noah Snavely
No ratings yet
CS5670: Computer Vision: Noah Snavely
65 pages
21 p2p
No ratings yet
21 p2p
64 pages
Professor Yashar Ganjali Department of Computer Science University of Toronto
No ratings yet
Professor Yashar Ganjali Department of Computer Science University of Toronto
56 pages
Introduction To Artificial Learning Lecture One
No ratings yet
Introduction To Artificial Learning Lecture One
16 pages
Professor Yashar Ganjali Department of Computer Science University of Toronto
No ratings yet
Professor Yashar Ganjali Department of Computer Science University of Toronto
46 pages
Outline: DNS Design
No ratings yet
Outline: DNS Design
6 pages
CSC 458/2209: Computer Networks, Fall 2019: Department of Computer Science, University of Toronto
No ratings yet
CSC 458/2209: Computer Networks, Fall 2019: Department of Computer Science, University of Toronto
4 pages
Markov Random Fields and Segmentation With Graph Cuts: Computer Vision Jia-Bin Huang, Virginia Tech
No ratings yet
Markov Random Fields and Segmentation With Graph Cuts: Computer Vision Jia-Bin Huang, Virginia Tech
44 pages
Study of Different Common Robot Configurations With Schematic Diagram
No ratings yet
Study of Different Common Robot Configurations With Schematic Diagram
5 pages
15-441 Computer Networking: Lecture 18 - More TCP & Congestion Control
No ratings yet
15-441 Computer Networking: Lecture 18 - More TCP & Congestion Control
38 pages
CS5760: Computer Vision: Lecture 8: Image Alignment
No ratings yet
CS5760: Computer Vision: Lecture 8: Image Alignment
35 pages
15-441 Computer Networking: Lecture 20 - TCP Performance
No ratings yet
15-441 Computer Networking: Lecture 20 - TCP Performance
35 pages
Shobitha-U-Nayak Resume
No ratings yet
Shobitha-U-Nayak Resume
1 page
Self Organizing Networks
No ratings yet
Self Organizing Networks
9 pages
Rachitresume
No ratings yet
Rachitresume
1 page
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
No ratings yet
Deep Learning State of The Art: Amulya Viswambharan ID 202090007 Kehkshan Fatima ID
17 pages
H05 CSC458 Tutorial II
No ratings yet
H05 CSC458 Tutorial II
16 pages
Literature Review
No ratings yet
Literature Review
4 pages
Batch 10 Signature Verification
No ratings yet
Batch 10 Signature Verification
12 pages
BLEED AI Outline Classical Vision
No ratings yet
BLEED AI Outline Classical Vision
26 pages
St2 For The Sixth Week
No ratings yet
St2 For The Sixth Week
1 page
Multicast Routing: Unicast: One Source To One Destination
No ratings yet
Multicast Routing: Unicast: One Source To One Destination
13 pages
14 Ip Grab Bag
No ratings yet
14 Ip Grab Bag
6 pages
H22 CSC458 - Final Review
No ratings yet
H22 CSC458 - Final Review
12 pages
Good Ideas So Far : Flow Control
No ratings yet
Good Ideas So Far : Flow Control
11 pages
Mechanical Engineering Department December 2019 Djf5042 - Industrial Robotics (End of Chapter 1)
No ratings yet
Mechanical Engineering Department December 2019 Djf5042 - Industrial Robotics (End of Chapter 1)
3 pages
H20 CSC458 Sample Final Solutions
No ratings yet
H20 CSC458 Sample Final Solutions
4 pages
12 BGP
No ratings yet
12 BGP
6 pages
Time Series Prediction Based On Ensemble ANFIS
No ratings yet
Time Series Prediction Based On Ensemble ANFIS
2 pages
Form Monitoring E2E Motoris 4 AGUSTUS 2021
No ratings yet
Form Monitoring E2E Motoris 4 AGUSTUS 2021
5 pages
Algorithms: An Intelligent Coup Agent
No ratings yet
Algorithms: An Intelligent Coup Agent
1 page
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
3 pages

Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech

Uploaded by

Image Features and Categorization: Computer Vision Jia-Bin Huang, Virginia Tech

Uploaded by

Image Features and

• Final project proposal

• Demo of modern interactive image segmentation

• EM Algorithm, Mixture of Gaussians

• MRFs and Graph Cut

• Foundations of Deep Learning

• Convolutional Neural Networks

• Part and Pixel Labeling

• Vision and Language

• General concepts of categorization

• Image and region categorization

How do we determine if something is a member of a

The Categories (Aristotle) Aristotle by Francesco Hayez

Slide Credit: A. A. Efros

Category judgments are made

Caltech 101 Average Object Images

Places Database [Zhou et al. NIPS 2014]

[Chen et al. CVPR 2014]

1940 1953 1966 1977

[Palermo et al. ECCV 2012]

[Karayev et al. BMVC 2014]

Assign regions to orientation

Assign regions to depth

[Silberman et al. ECCV 2012]

[Bell et al. CVPR 2015]

Image Classifier Trained

Image Classifier Trained

Image Classifier Trained

Depend on what you want to know!

Joint histogram Marginal histogram

histint( hi , h j )  1   min  hi (m), h j (m) 

• Chi-squared Histogram matching distance

• Earth mover’s distance

Few Bins Many Bins

L*a*b* color space HSV color space

SIFT – [Lowe IJCV 2004]

• “Bag of visual words”

ICCV 2005 short course, L. Fei-Fei

[Chatfieldet al. BMVC 2011]

Histogram encoding Kernel codebook encoding

Locality constrained encoding Fisher encoding

• First and second order differences to cluster k

[Perronnin et al. ECCV 2010]

• Fisher vector encoding outperforms others

[Chatfieldet al. BMVC 2011]

All of these images have the same color histogram

Compute histogram in each spatial bin

High number of features – PCA to reduce dimensionality

[Lazebnik et al. CVPR 2006]

Source: Unsupervised Feature

Imagenet Classification with Deep Convolutional Neural

[Donahue et al. ICML 2013]

CNN Features off-the-shelf:

Joseph Tighe and Svetlana Lazebnik

• Position in the image

Geometric context [Hoiem et al. ICCV 2005]

• Visual categorization help transfer knowledge

Image Classifier Trained

You might also like

Lab* color space HSV color space