Multilingual Hierarchical Attention Networks for Document Classification

Pappas, Nikolaos; Popescu-Belis, Andrei

Computer Science > Computation and Language

arXiv:1707.00896 (cs)

[Submitted on 4 Jul 2017 (v1), last revised 15 Sep 2017 (this version, v4)]

Title:Multilingual Hierarchical Attention Networks for Document Classification

Authors:Nikolaos Pappas, Andrei Popescu-Belis

View PDF

Abstract:Hierarchical attention networks have recently achieved remarkable performance for document classification in a given language. However, when multilingual document collections are considered, training such models separately for each language entails linear parameter growth and lack of cross-language transfer. Learning a single multilingual model with fewer parameters is therefore a challenging but potentially beneficial objective. To this end, we propose multilingual hierarchical attention networks for learning document structures, with shared encoders and/or shared attention mechanisms across languages, using multi-task learning and an aligned semantic space as input. We evaluate the proposed models on multilingual document classification with disjoint label sets, on a large dataset which we provide, with 600k news documents in 8 languages, and 5k labels. The multilingual models outperform monolingual ones in low-resource as well as full-resource settings, and use fewer parameters, thus confirming their computational efficiency and the utility of cross-language transfer.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1707.00896 [cs.CL]
	(or arXiv:1707.00896v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1707.00896

Submission history

From: Nikolaos Pappas [view email]
[v1] Tue, 4 Jul 2017 10:28:04 UTC (2,326 KB)
[v2] Sun, 9 Jul 2017 10:37:52 UTC (2,334 KB)
[v3] Wed, 6 Sep 2017 15:06:16 UTC (2,337 KB)
[v4] Fri, 15 Sep 2017 10:47:26 UTC (2,337 KB)

Computer Science > Computation and Language

Title:Multilingual Hierarchical Attention Networks for Document Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multilingual Hierarchical Attention Networks for Document Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators