Simple Image Description Generator via a Linear Phrase-Based Approach

Lebret, Remi; Pinheiro, Pedro O.; Collobert, Ronan

Computer Science > Computation and Language

arXiv:1412.8419 (cs)

[Submitted on 29 Dec 2014 (v1), last revised 11 Apr 2015 (this version, v3)]

Title:Simple Image Description Generator via a Linear Phrase-Based Approach

Authors:Remi Lebret, Pedro O. Pinheiro, Ronan Collobert

View PDF

Abstract:Generating a novel textual description of an image is an interesting problem that connects computer vision and natural language processing. In this paper, we present a simple model that is able to generate descriptive sentences given a sample image. This model has a strong focus on the syntax of the descriptions. We train a purely bilinear model that learns a metric between an image representation (generated from a previously trained Convolutional Neural Network) and phrases that are used to described them. The system is then able to infer phrases from a given image sample. Based on caption syntax statistics, we propose a simple language model that can produce relevant descriptions for a given test image using the phrases inferred. Our approach, which is considerably simpler than state-of-the-art models, achieves comparable results on the recently release Microsoft COCO dataset.

Comments:	Accepted as a workshop paper at ICLR 2015
Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1412.8419 [cs.CL]
	(or arXiv:1412.8419v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1412.8419

Submission history

From: Pedro O. Pinheiro [view email]
[v1] Mon, 29 Dec 2014 18:43:10 UTC (1,422 KB)
[v2] Wed, 18 Mar 2015 05:09:13 UTC (742 KB)
[v3] Sat, 11 Apr 2015 03:53:26 UTC (1,591 KB)

Computer Science > Computation and Language

Title:Simple Image Description Generator via a Linear Phrase-Based Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Simple Image Description Generator via a Linear Phrase-Based Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators