Watch-n-Patch: Unsupervised Learning of Actions and Relations

Wu, Chenxia; Zhang, Jiemi; Sener, Ozan; Selman, Bart; Savarese, Silvio; Saxena, Ashutosh

Computer Science > Computer Vision and Pattern Recognition

arXiv:1603.03541 (cs)

[Submitted on 11 Mar 2016]

Title:Watch-n-Patch: Unsupervised Learning of Actions and Relations

Authors:Chenxia Wu, Jiemi Zhang, Ozan Sener, Bart Selman, Silvio Savarese, Ashutosh Saxena

View PDF

Abstract:There is a large variation in the activities that humans perform in their everyday lives. We consider modeling these composite human activities which comprises multiple basic level actions in a completely unsupervised setting. Our model learns high-level co-occurrence and temporal relations between the actions. We consider the video as a sequence of short-term action clips, which contains human-words and object-words. An activity is about a set of action-topics and object-topics indicating which actions are present and which objects are interacting with. We then propose a new probabilistic model relating the words and the topics. It allows us to model long-range action relations that commonly exist in the composite activities, which is challenging in previous works. We apply our model to the unsupervised action segmentation and clustering, and to a novel application that detects forgotten actions, which we call action patching. For evaluation, we contribute a new challenging RGB-D activity video dataset recorded by the new Kinect v2, which contains several human daily activities as compositions of multiple actions interacting with different objects. Moreover, we develop a robotic system that watches people and reminds people by applying our action patching algorithm. Our robotic setup can be easily deployed on any assistive robot.

Comments:	arXiv admin note: text overlap with arXiv:1512.04208
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:1603.03541 [cs.CV]
	(or arXiv:1603.03541v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1603.03541

Submission history

From: Chenxia Wu [view email]
[v1] Fri, 11 Mar 2016 07:13:59 UTC (8,276 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Watch-n-Patch: Unsupervised Learning of Actions and Relations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Watch-n-Patch: Unsupervised Learning of Actions and Relations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators