Quasi-Recurrent Neural Networks

Bradbury, James; Merity, Stephen; Xiong, Caiming; Socher, Richard

Computer Science > Neural and Evolutionary Computing

arXiv:1611.01576 (cs)

[Submitted on 5 Nov 2016 (v1), last revised 21 Nov 2016 (this version, v2)]

Title:Quasi-Recurrent Neural Networks

Authors:James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher

View PDF

Abstract:Recurrent neural networks are a powerful tool for modeling sequential data, but the dependence of each timestep's computation on the previous timestep's output limits parallelism and makes RNNs unwieldy for very long sequences. We introduce quasi-recurrent neural networks (QRNNs), an approach to neural sequence modeling that alternates convolutional layers, which apply in parallel across timesteps, and a minimalist recurrent pooling function that applies in parallel across channels. Despite lacking trainable recurrent layers, stacked QRNNs have better predictive accuracy than stacked LSTMs of the same hidden size. Due to their increased parallelism, they are up to 16 times faster at train and test time. Experiments on language modeling, sentiment classification, and character-level neural machine translation demonstrate these advantages and underline the viability of QRNNs as a basic building block for a variety of sequence tasks.

Comments:	Submitted to conference track at ICLR 2017
Subjects:	Neural and Evolutionary Computing (cs.NE); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1611.01576 [cs.NE]
	(or arXiv:1611.01576v2 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1611.01576

Submission history

From: James Bradbury [view email]
[v1] Sat, 5 Nov 2016 00:31:25 UTC (353 KB)
[v2] Mon, 21 Nov 2016 20:52:34 UTC (353 KB)

Computer Science > Neural and Evolutionary Computing

Title:Quasi-Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Quasi-Recurrent Neural Networks

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators