Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Shin, Andrew; Ushiku, Yoshitaka; Harada, Tatsuya

Computer Science > Computation and Language

arXiv:1805.00460 (cs)

[Submitted on 27 Apr 2018]

Title:Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Authors:Andrew Shin, Yoshitaka Ushiku, Tatsuya Harada

View PDF

Abstract:Image description task has been invariably examined in a static manner with qualitative presumptions held to be universally applicable, regardless of the scope or target of the description. In practice, however, different viewers may pay attention to different aspects of the image, and yield different descriptions or interpretations under various contexts. Such diversity in perspectives is difficult to derive with conventional image description techniques. In this paper, we propose a customized image narrative generation task, in which the users are interactively engaged in the generation process by providing answers to the questions. We further attempt to learn the user's interest via repeating such interactive stages, and to automatically reflect the interest in descriptions for new images. Experimental results demonstrate that our model can generate a variety of descriptions from single image that cover a wider range of topics than conventional models, while being customizable to the target user of interaction.

Comments:	To Appear at CVPR 2018 as spotlight presentation
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:1805.00460 [cs.CL]
	(or arXiv:1805.00460v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1805.00460

Submission history

From: Andrew Shin [view email]
[v1] Fri, 27 Apr 2018 11:27:45 UTC (9,467 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.AI
cs.CV
cs.HC

References & Citations

DBLP - CS Bibliography

listing | bibtex

Andrew Shin
Yoshitaka Ushiku
Tatsuya Harada

export BibTeX citation

Computer Science > Computation and Language

Title:Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators