Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

Srivastava, Vivek; Singh, Mayank

Computer Science > Computation and Language

arXiv:2106.10123 (cs)

[Submitted on 18 Jun 2021]

Title:Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

Authors:Vivek Srivastava, Mayank Singh

View PDF

Abstract:Code-mixing is a frequent communication style among multilingual speakers where they mix words and phrases from two different languages in the same utterance of text or speech. Identifying and filtering code-mixed text is a challenging task due to its co-existence with monolingual and noisy text. Over the years, several code-mixing metrics have been extensively used to identify and validate code-mixed text quality. This paper demonstrates several inherent limitations of code-mixing metrics with examples from the already existing datasets that are popularly used across various experiments.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2106.10123 [cs.CL]
	(or arXiv:2106.10123v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2106.10123

Submission history

From: Vivek Srivastava [view email]
[v1] Fri, 18 Jun 2021 13:26:48 UTC (113 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Mayank Singh

export BibTeX citation

Computer Science > Computation and Language

Title:Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Challenges and Limitations with the Metrics Measuring the Complexity of Code-Mixed Text

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators