Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Mogadala, Aditya; Kalimuthu, Marimuthu; Klakow, Dietrich

Computer Science > Computer Vision and Pattern Recognition

arXiv:1907.09358v1 (cs)

[Submitted on 22 Jul 2019 (this version), latest version 31 Dec 2021 (v3)]

Title:Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Authors:Aditya Mogadala, Marimuthu Kalimuthu, Dietrich Klakow

View PDF

Abstract:Integration of vision and language tasks has seen a significant growth in the recent times due to surge of interest from multi-disciplinary communities such as deep learning, computer vision, and natural language processing. In this survey, we focus on ten different vision and language integration tasks in terms of their problem formulation, methods, existing datasets, evaluation measures, and comparison of results achieved with the corresponding state-of-the-art methods. This goes beyond earlier surveys which are either task-specific or concentrate only on one type of visual content i.e., image or video. We then conclude the survey by discussing some possible future directions for integration of vision and language research.

Comments:	Submitted to Journal
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1907.09358 [cs.CV]
	(or arXiv:1907.09358v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1907.09358

Submission history

From: Aditya Mogadala [view email]
[v1] Mon, 22 Jul 2019 14:53:48 UTC (3,585 KB)
[v2] Sat, 12 Sep 2020 13:26:29 UTC (3,695 KB)
[v3] Fri, 31 Dec 2021 20:40:20 UTC (3,782 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-07

Change to browse by:

cs
cs.CL
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Aditya Mogadala
Marimuthu Kalimuthu
Dietrich Klakow

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators