Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Li, Congcong; Kowdle, Adarsh; Saxena, Ashutosh; Chen, Tsuhan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1110.5102 (cs)

[Submitted on 24 Oct 2011]

Title:Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Authors:Congcong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen

View PDF

Abstract:Scene understanding includes many related sub-tasks, such as scene categorization, depth estimation, object detection, etc. Each of these sub-tasks is often notoriously hard, and state-of-the-art classifiers already exist for many of them. These classifiers operate on the same raw image and provide correlated outputs. It is desirable to have an algorithm that can capture such correlation without requiring any changes to the inner workings of any classifier.
We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that jointly optimizes all the sub-tasks, while requiring only a `black-box' interface to the original classifier for each sub-task. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later classifiers to provide earlier classifiers information about which error modes to focus on. We show that our method significantly improves performance in all the sub-tasks in the domain of scene understanding, where we consider depth estimation, scene categorization, event categorization, object detection, geometric labeling and saliency detection. Our method also improves performance in two robotic applications: an object-grasping robot and an object-finding robot.

Comments:	14 pages, 11 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:1110.5102 [cs.CV]
	(or arXiv:1110.5102v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1110.5102

Submission history

From: Congcong Li [view email]
[v1] Mon, 24 Oct 2011 00:31:00 UTC (6,019 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators