Unbiased Scene Graph Generation in Videos

Nag, Sayak; Min, Kyle; Tripathi, Subarna; Chowdhury, Amit K. Roy

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.00733v1 (cs)

[Submitted on 3 Apr 2023 (this version), latest version 29 Jun 2023 (v3)]

Title:Unbiased Scene Graph Generation in Videos

Authors:Sayak Nag, Kyle Min, Subarna Tripathi, Amit K. Roy Chowdhury

View PDF

Abstract:The task of dynamic scene graph generation (SGG) from videos is complicated and challenging due to the inherent dynamics of a scene, temporal fluctuation of model predictions, and the long-tailed distribution of the visual relationships in addition to the already existing challenges in image-based SGG. Existing methods for dynamic SGG have primarily focused on capturing spatio-temporal context using complex architectures without addressing the challenges mentioned above, especially the long-tailed distribution of relationships. This often leads to the generation of biased scene graphs. To address these challenges, we introduce a new framework called TEMPURA: TEmporal consistency and Memory Prototype guided UnceRtainty Attenuation for unbiased dynamic SGG. TEMPURA employs object-level temporal consistencies via transformer-based sequence modeling, learns to synthesize unbiased relationship representations using memory-guided training, and attenuates the predictive uncertainty of visual relations using a Gaussian Mixture Model (GMM). Extensive experiments demonstrate that our method achieves significant (up to 10% in some cases) performance gain over existing methods highlighting its superiority in generating more unbiased scene graphs.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2304.00733 [cs.CV]
	(or arXiv:2304.00733v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.00733

Submission history

From: Sayak Nag [view email]
[v1] Mon, 3 Apr 2023 06:10:06 UTC (3,011 KB)
[v2] Thu, 6 Apr 2023 21:45:20 UTC (3,011 KB)
[v3] Thu, 29 Jun 2023 23:52:24 UTC (3,011 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unbiased Scene Graph Generation in Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unbiased Scene Graph Generation in Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators