Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

Luu, Son T.; Nguyen, Hung P.; Van Nguyen, Kiet; Nguyen, Ngan Luu-Thuy

doi:10.1109/RIVF48685.2020.9140745

Computer Science > Computation and Language

arXiv:2002.00759 (cs)

[Submitted on 31 Jan 2020 (v1), last revised 28 Sep 2020 (this version, v2)]

Title:Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

Authors:Son T. Luu, Hung P. Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

View PDF

Abstract:Hate-speech detection on social network language has become one of the main researching fields recently due to the spreading of social networks like Facebook and Twitter. In Vietnam, the threat of offensive and harassment cause bad impacts for online user. The VLSP - Shared task about Hate Speech Detection on social networks showed many proposed approaches for detecting whatever comment is clean or not. However, this problem still needs further researching. Consequently, we compare traditional machine learning and deep learning on a large dataset about the user's comments on social network in Vietnamese and find out what is the advantage and disadvantage of each model by comparing their accuracy on F1-score, then we pick two models in which has highest accuracy in traditional machine learning models and deep neural models respectively. Next, we compare these two models capable of predicting the right label by referencing their confusion matrices and considering the advantages and disadvantages of each model. Finally, from the comparison result, we propose our ensemble method that concentrates the abilities of traditional methods and deep learning methods.

Comments:	Published in The 2020 RIVF International Conference on Computing and Communication Technologies (RIVF)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2002.00759 [cs.CL]
	(or arXiv:2002.00759v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2002.00759
Related DOI:	https://doi.org/10.1109/RIVF48685.2020.9140745

Submission history

From: Son T. Luu [view email]
[v1] Fri, 31 Jan 2020 09:28:57 UTC (277 KB)
[v2] Mon, 28 Sep 2020 01:54:32 UTC (277 KB)

Computer Science > Computation and Language

Title:Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators