0% found this document useful (0 votes)

4 views42 pages

18. Visual Object Tracking

The document discusses visual object tracking, focusing on the objective of locating objects over time in video sequences. It outlines the formal definition, approaches (probabilistic and discriminative tracking), and challenges such as appearance variations and temporal drift. Additionally, it highlights the integration of CNNs for improved feature extraction in tracking tasks.

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views42 pages

18. Visual Object Tracking

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Visual Object Tracking

Instructor: Seunghoon Hong

Visual object tracking
Objective: locating the object(s) over time in a video

Initial frame

Target Tracking over

Visual Tracking
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.

In learning perspective:
● Classiﬁcation problem with a single object class (= target vs distractors)
● Labeled data is given at only the initial frame
● Optionally requires online learning to adapt the variations in a video
● Online learning is driven by a self-supervision (training data = tracking results)
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.

Two sub-categories:
● Single target tracking
○ Tracking only one object in an video
○ Single-class classiﬁcation (target vs. distractors)
● Multi target tracking
○ Tracking multiple objects in a video
○ Multi-class classiﬁcation (target 1 vs. target 2 vs. target 3 vs. … vs. distractors)
Approaches in single object tracking
● Probabilistic tracking
○ Formulate the localization task as a sequential probabilistic inference problem
○ Given a probability of the initial target location, propagate it over the remaining frames
Approaches in single object tracking
● Probabilistic tracking
○ Formulate the localization task as a sequential probabilistic inference problem
○ Given a probability of the initial target location, propagate it over the remaining frames

● Discriminative tracking
○ Classify the object from the distractors at every frame
○ Can be considered as sequential binary object detection (class = target, background)
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Likelihood Prior
Posterior
the measurement of The belief of object state
the probability of
how likely the without observation
object state given
observation
an observation
coincide with the
given state
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Prior
1 The belief of object state
without observation

2 3 Where is the target

likely to exist?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Likelihood
the measurement of
how likely the
observation
coincide with the
given state
Which region of
image look similar
to the target?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Posterior
the probability of
object state given
an observation

Where is the object

in this frame?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule

Sequential Bayesian ﬁltering

z1:T: object locations in frame 1 to T

x1:T: frames 1 to T
Probabilistic tracking
● Hidden Markov Model

● Markovian assumption
Probabilistic tracking
● Sequential Bayesian ﬁltering

Integration over all object locations!

Likelihood Prior

Likelihood Transition Posterior upto

model the previous frame
Probabilistic tracking
● Approximation by Monte Carlo sampling

where
Probabilistic tracking
● Particle ﬁltering (Sequential Markov-Chain Monte-Carlo)
○ Approximate the prior distribution using Markov-Chain Monte Carlo (MCMC) sampling
Probabilistic tracking pipeline
Frame t-1 Frame t

2. Move samples by
1. Extract samples transition model 3. Re-evaluate likelihood
proportional to using appearance model
previous posterior
Probabilistic tracking pipeline
Frame t

Tracking procedure (simpliﬁed):

1. Sample target states near the previous
target location
2. Evaluate the likelihood based on
appearance model

Example target
appearance model

3. Select the most probable sample as the

target at the current frame

4. Update the target appearance model

using the current tracking results
Attendance check
https://forms.gle/rGpXxLKZ4jbcArid8
Discriminative tracking pipeline
Quick overview: learning tracking-by-detection
● Objective: a ridge regression
Model parameters

Training Training data

labels
Quick overview: learning tracking-by-detection
● Objective: a ridge regression

How do we solve it?

Quick overview: learning tracking-by-detection
● Objective: a ridge regression

We should update this classiﬁer for every frames

(i.e. every time we perform tracking and
get positive/negative samples)

Can we make it faster?

Correlation ﬁltering
● We can make it extremely fast for certain positive/negative sets!
Negative samples
(translated samples)

+30 +15 -15 -30

Base sample
(tracking results)
Correlation ﬁltering
● Representing positive/negative images using circulant matrices

Consider base sample x as n-dimensional array

Circulant matrix

Positive sample

Negative samples
Correlation ﬁltering
● Any circulant matrices can be made diagonal by the Discrete Fourier Transform
(DFT)
DFT matrix
(constant,
independent to x)
DFT of base sample
Correlation ﬁltering
● Putting all together

Circulant matrix

Matrix inner-product

Plug into ridge

regression
Kernelized Correlation ﬁltering
● Easy to extend to kernelized version

ridge regression

ridge regression with

kernel

We can do fast
computation if kernel
matrix K is circulant matrix

Fortunately, it has been

shown that most useful
kernels are circulant[1]

[1] Henriques et al., High-Speed Tracking with Kernelized Correlation Filters, In TPAMI, 2015
Challenges
● Modeling severe appearance variations in a video

ﬁgure credit: Li et al., A survey of appearance models in visual object tracking

Modeling appearance for tracking
● Classic: hand-designed features
○ Color histogram
○ Intensity
○ Object Templates
○ Key-points (SIFT)
○ …
● Issue
○ All prone to overﬁtting
○ Cannot generalize to various appearances
Integrating CNN for appearance modeling
● Beneﬁts
○ Features from a pre-trained CNN can be robust against various appearance changes
○ Especially useful in tracking since we have only one target ground-truth in the initial frame
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
Discussions
● Limitations?
Better representation learning with videos
● MDNet: learn representation for tracking with a large amount of videos
Challenges in visual object tracking
● Temporal drift (i.e. error propagation through time)
○ Drift in posterior estimation: the error in posterior propagates through time
○ Drift in appearance model: if update the appearance model in temporal failure, the error will
propagate

● But why is it so prune to temporal drift?

Summary: Visual tracking
● Object localization in a video
● Probabilistic vs. discriminative tracking
● Modeling target appearance is important
○ Essential to evaluate the affinity of samples in both tracking frameworks
○ Should be able to handle a wide range of appearance variations
○ Should be able to generalize well from a single ground-truth at initial frame
● CNN for visual tracking
○ Applying a pre-trained CNN for feature extraction
○ Training CNN with many heterogeneous videos for tracking

2022 Visual Object Tracking A Survey
No ratings yet
2022 Visual Object Tracking A Survey
42 pages
TDW and WP May 2022 Editorial InterventionIsolation TM2460 3XL David S
No ratings yet
TDW and WP May 2022 Editorial InterventionIsolation TM2460 3XL David S
4 pages
Single Object Tracking A Survey of Methods Dataset
No ratings yet
Single Object Tracking A Survey of Methods Dataset
15 pages
Computer Vision Paper
No ratings yet
Computer Vision Paper
3 pages
Lecture 9.2 Motion & Video Analysis in Computer Vision 2025
No ratings yet
Lecture 9.2 Motion & Video Analysis in Computer Vision 2025
49 pages
Object Tracking Methods-A Review
No ratings yet
Object Tracking Methods-A Review
7 pages
Object Tracking Using Radial Basis Function Networks
No ratings yet
Object Tracking Using Radial Basis Function Networks
11 pages
Tempest 160314194757
No ratings yet
Tempest 160314194757
28 pages
Object Tracking Using Radial Basis Function Networks
No ratings yet
Object Tracking Using Radial Basis Function Networks
9 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
1602 00763
No ratings yet
1602 00763
5 pages
A Review of Visual Moving Target Tracking
No ratings yet
A Review of Visual Moving Target Tracking
30 pages
25 Object Tracking
No ratings yet
25 Object Tracking
29 pages
Object Tracking
No ratings yet
Object Tracking
20 pages
Pedestrian Detection and Tracking
No ratings yet
Pedestrian Detection and Tracking
13 pages
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
No ratings yet
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
6 pages
UNIT 5
No ratings yet
UNIT 5
18 pages
CNNTracking TNN10 Human
No ratings yet
CNNTracking TNN10 Human
14 pages
Combined Major Project
No ratings yet
Combined Major Project
8 pages
CORT: Class-Oriented Real-Time Tracking For Embedded Systems
No ratings yet
CORT: Class-Oriented Real-Time Tracking For Embedded Systems
10 pages
Tag Draft Especializado
No ratings yet
Tag Draft Especializado
14 pages
5 Major Computervision Technique
No ratings yet
5 Major Computervision Technique
10 pages
CV_UNIT_5
No ratings yet
CV_UNIT_5
11 pages
Trackformer
No ratings yet
Trackformer
16 pages
Zhang 2020
No ratings yet
Zhang 2020
5 pages
A Detection-Based Multiple Object Tracking Method: Mei Han Amit Sethi Yihong Gong
No ratings yet
A Detection-Based Multiple Object Tracking Method: Mei Han Amit Sethi Yihong Gong
4 pages
s10489-023-04998-3 (1)
No ratings yet
s10489-023-04998-3 (1)
19 pages
1303.4803v1
No ratings yet
1303.4803v1
42 pages
Real Time Object Detection and Tracking Using Deep Learning and Opencv
No ratings yet
Real Time Object Detection and Tracking Using Deep Learning and Opencv
4 pages
19. CNN for object tracking (1)
No ratings yet
19. CNN for object tracking (1)
44 pages
Object Detection and Tracking in Video Sequences
No ratings yet
Object Detection and Tracking in Video Sequences
6 pages
Object PDF
No ratings yet
Object PDF
6 pages
Marathwada Mitra Mandal's College of Engineering Karvenagar, Pune 52
No ratings yet
Marathwada Mitra Mandal's College of Engineering Karvenagar, Pune 52
16 pages
Smart Cards
No ratings yet
Smart Cards
39 pages
Object Detection and Tracking in Video Sequences
No ratings yet
Object Detection and Tracking in Video Sequences
6 pages
Yilmaz
No ratings yet
Yilmaz
45 pages
11
No ratings yet
11
19 pages
Moving Object Tracking and Detection in Videos Using MATLAB: A Review
No ratings yet
Moving Object Tracking and Detection in Videos Using MATLAB: A Review
9 pages
Object Tracking
100% (1)
Object Tracking
22 pages
Video Object Tracking
No ratings yet
Video Object Tracking
30 pages
Self-Supervised Deep Correlation Tracking
No ratings yet
Self-Supervised Deep Correlation Tracking
10 pages
Ijaerv10n9spl 339
No ratings yet
Ijaerv10n9spl 339
9 pages
Detect To Track and Track To Detect
No ratings yet
Detect To Track and Track To Detect
10 pages
Cviii 2024 Ws Copy
No ratings yet
Cviii 2024 Ws Copy
98 pages
Real-Time People Tracking in A Camera Network: Wasit Limprasert, Andrew Wallace, and Greg Michaelson
No ratings yet
Real-Time People Tracking in A Camera Network: Wasit Limprasert, Andrew Wallace, and Greg Michaelson
9 pages
Guide Prof. P.J Engineer Co-Guide Prof. M.C Patel: Prepared by Parthiv Bharti P09 EC 916
No ratings yet
Guide Prof. P.J Engineer Co-Guide Prof. M.C Patel: Prepared by Parthiv Bharti P09 EC 916
31 pages
Prokaj Persistent Tracking For 2014 CVPR Paper
No ratings yet
Prokaj Persistent Tracking For 2014 CVPR Paper
8 pages
Ilchae Jung Real-Time MDNet ECCV 2018 Paper
No ratings yet
Ilchae Jung Real-Time MDNet ECCV 2018 Paper
16 pages
12 CS1AC16 Detection and Tracking
No ratings yet
12 CS1AC16 Detection and Tracking
4 pages
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
No ratings yet
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
6 pages
Moving Object Analysis Techniques in Videos - A Review: Ritika, Gianetan Singh Sekhon
No ratings yet
Moving Object Analysis Techniques in Videos - A Review: Ritika, Gianetan Singh Sekhon
6 pages
Moving Object Recognization, Tracking and Destruction
No ratings yet
Moving Object Recognization, Tracking and Destruction
45 pages
Final-Report Img Pro
No ratings yet
Final-Report Img Pro
15 pages
Object Tracking Techniques For Video Tracking: A Survey: Mansi Manocha, Parminder Kaur
No ratings yet
Object Tracking Techniques For Video Tracking: A Survey: Mansi Manocha, Parminder Kaur
5 pages
Object Detection
No ratings yet
Object Detection
4 pages
1910.09761
No ratings yet
1910.09761
25 pages
A Pedestrian Detection and Tracking System Based On Video Processing Technology
No ratings yet
A Pedestrian Detection and Tracking System Based On Video Processing Technology
6 pages
Adaptive Probabilistic Visual Tracking with Incremental Subspace Update 1st edition by David Ross, Jongwoo Lim, Ming Hsuan Yang ISBN 3540219835 9783540219835 download
100% (3)
Adaptive Probabilistic Visual Tracking with Incremental Subspace Update 1st edition by David Ross, Jongwoo Lim, Ming Hsuan Yang ISBN 3540219835 9783540219835 download
49 pages
Image Processing: Object Tracking With Color Detection
No ratings yet
Image Processing: Object Tracking With Color Detection
15 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Practical 2 .Ipynb - Colab (1) - Copy (1)
No ratings yet
Practical 2 .Ipynb - Colab (1) - Copy (1)
9 pages
Infineon - Power Supply ICs Overview BR 2017-SG-v01 00-EN
No ratings yet
Infineon - Power Supply ICs Overview BR 2017-SG-v01 00-EN
22 pages
A Practical Animal Detection and Collision Avoidance System Using Computer Vision Technique
No ratings yet
A Practical Animal Detection and Collision Avoidance System Using Computer Vision Technique
12 pages
century 21 keyboarding and information processing 9 edition Edition Jerry W. Robinson - The latest updated ebook is now available for download
100% (1)
century 21 keyboarding and information processing 9 edition Edition Jerry W. Robinson - The latest updated ebook is now available for download
55 pages
3PoleContactors Digital PDF
No ratings yet
3PoleContactors Digital PDF
17 pages
Design Data: Design of Simply Supported RCC Gap Slab of Span 7.647 M
No ratings yet
Design Data: Design of Simply Supported RCC Gap Slab of Span 7.647 M
31 pages
Von Neumann Architecture
No ratings yet
Von Neumann Architecture
8 pages
Physics Paper 2025
No ratings yet
Physics Paper 2025
12 pages
Chem Group 2
No ratings yet
Chem Group 2
32 pages
University of California College of Engineering Department of Electrical Engineering and Computer Sciences
No ratings yet
University of California College of Engineering Department of Electrical Engineering and Computer Sciences
3 pages
Syllabus For Food Technology (Xe: Section G) : Food Chemistry and Nutrition
No ratings yet
Syllabus For Food Technology (Xe: Section G) : Food Chemistry and Nutrition
2 pages
Karakter Goku
No ratings yet
Karakter Goku
8 pages
Accessing Root Canal Systems - Knowledge Base and Clinical Techniques
No ratings yet
Accessing Root Canal Systems - Knowledge Base and Clinical Techniques
19 pages
Thermo-Lag 440 Brochure PDF
No ratings yet
Thermo-Lag 440 Brochure PDF
2 pages
DIAEnergie WebAPI v1.4
No ratings yet
DIAEnergie WebAPI v1.4
9 pages
Siemens Flowrite Vf599 Series 3way Valve
No ratings yet
Siemens Flowrite Vf599 Series 3way Valve
2 pages
Yash Shah 201902241
No ratings yet
Yash Shah 201902241
3 pages
Kenny and Osborne Music Performance B
No ratings yet
Kenny and Osborne Music Performance B
10 pages
EMTH202-TEST 1-21 APRIL2021-with Marking Key
100% (1)
EMTH202-TEST 1-21 APRIL2021-with Marking Key
3 pages
Devanshu File C
No ratings yet
Devanshu File C
12 pages
CEMAT PresentationV71474389
0% (1)
CEMAT PresentationV71474389
112 pages
Cfa二级百题预测金程教育学员版题目
No ratings yet
Cfa二级百题预测金程教育学员版题目
392 pages
Levelling Elements Cat Excpert
No ratings yet
Levelling Elements Cat Excpert
8 pages
SFDFSGDFG
100% (2)
SFDFSGDFG
16 pages
Methods in Educational Research From Theory to Practice Second Edition Marguerite G. Lodico pdf download
No ratings yet
Methods in Educational Research From Theory to Practice Second Edition Marguerite G. Lodico pdf download
49 pages
Acumulador de Freno - Pruebas
No ratings yet
Acumulador de Freno - Pruebas
1 page
Part II Microscopic World I Notes
No ratings yet
Part II Microscopic World I Notes
32 pages
Laplace Transform
No ratings yet
Laplace Transform
54 pages
EC18 Errata - 2024 0221
No ratings yet
EC18 Errata - 2024 0221
18 pages

18. Visual Object Tracking

Uploaded by

18. Visual Object Tracking

Uploaded by

Visual Object Tracking

Instructor: Seunghoon Hong

Target Tracking over

2 3 Where is the target

Where is the object

Sequential Bayesian ﬁltering

z1:T: object locations in frame 1 to T

Integration over all object locations!

Likelihood Transition Posterior upto

Tracking procedure (simpliﬁed):

3. Select the most probable sample as the

4. Update the target appearance model

Training Training data

How do we solve it?

We should update this classiﬁer for every frames

Can we make it faster?

+30 +15 -15 -30

Consider base sample x as n-dimensional array

Plug into ridge

ridge regression with

Fortunately, it has been

ﬁgure credit: Li et al., A survey of appearance models in visual object tracking

● But why is it so prune to temporal drift?

You might also like