Multimodal Attribute Extraction

Logan IV, Robert L.; Humeau, Samuel; Singh, Sameer

Computer Science > Computation and Language

arXiv:1711.11118 (cs)

[Submitted on 29 Nov 2017]

Title:Multimodal Attribute Extraction

Authors:Robert L. Logan IV, Samuel Humeau, Sameer Singh

View PDF

Abstract:The broad goal of information extraction is to derive structured information from unstructured data. However, most existing methods focus solely on text, ignoring other types of unstructured data such as images, video and audio which comprise an increasing portion of the information on the web. To address this shortcoming, we propose the task of multimodal attribute extraction. Given a collection of unstructured and semi-structured contextual information about an entity (such as a textual description, or visual depictions) the task is to extract the entity's underlying attributes. In this paper, we provide a dataset containing mixed-media data for over 2 million product items along with 7 million attribute-value pairs describing the items which can be used to train attribute extractors in a weakly supervised manner. We provide a variety of baselines which demonstrate the relative effectiveness of the individual modes of information towards solving the task, as well as study human performance.

Comments:	AKBC 2017 Workshop Paper
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1711.11118 [cs.CL]
	(or arXiv:1711.11118v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1711.11118

Submission history

From: Robert Logan Iv [view email]
[v1] Wed, 29 Nov 2017 21:40:59 UTC (109 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-11

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Robert L. Logan IV
Samuel Humeau
Sameer Singh

export BibTeX citation

Computer Science > Computation and Language

Title:Multimodal Attribute Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multimodal Attribute Extraction

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators