SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Zhang, Rongzhi; Yu, Yue; Zhang, Chao

Computer Science > Computation and Language

arXiv:2010.02322 (cs)

[Submitted on 5 Oct 2020]

Title:SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Authors:Rongzhi Zhang, Yue Yu, Chao Zhang

View PDF

Abstract:Active learning is an important technique for low-resource sequence labeling tasks. However, current active sequence labeling methods use the queried samples alone in each iteration, which is an inefficient way of leveraging human annotations. We propose a simple but effective data augmentation method to improve the label efficiency of active sequence labeling. Our method, SeqMix, simply augments the queried samples by generating extra labeled sequences in each iteration. The key difficulty is to generate plausible sequences along with token-level labels. In SeqMix, we address this challenge by performing mixup for both sequences and token-level labels of the queried samples. Furthermore, we design a discriminator during sequence mixup, which judges whether the generated sequences are plausible or not. Our experiments on Named Entity Recognition and Event Detection tasks show that SeqMix can improve the standard active sequence labeling method by $2.27\%$--$3.75\%$ in terms of $F_1$ scores. The code and data for SeqMix can be found at this https URL

Comments:	EMNLP 2020 Long Paper
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2010.02322 [cs.CL]
	(or arXiv:2010.02322v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2010.02322
Journal reference:	EMNLP 2020

Submission history

From: Yue Yu [view email]
[v1] Mon, 5 Oct 2020 20:27:14 UTC (1,999 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-10

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rongzhi Zhang
Yue Yu
Chao Zhang

export BibTeX citation

Computer Science > Computation and Language

Title:SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators