Text Classification Reseach Paper
Text Classification Reseach Paper
References
1. Koloiev, A.S.: Rewrite as a new phenomenon in modern journalism. In: SPU Bulletin.
Philology, vol. 1, 221-226 (2012)
2. Bolshakov, I.A.: Two methods of synonymous paraphrasing in linguistic steganography.
In:Proceedings of the International ConferenceDialogue-2004,http://www.dialog-
21.ru/media/2496/bolshakov.pdf, last accessed 2019/02/10.
3. Petrasova, S., Khairova, N., Lewoniewski, W.: Building the semantic similarity model for
social network data streams. In:Data Stream Mining & Processing, Proceedings of the
2018 IEEE Second International Conference (DSMP), 21-24 (2018)
4. WordNet: https://wordnet.princeton.edu, last accessed 2019/02/10.
5. BBC, https://www.bbc.com/news,last accessed 2019/02/10.
70
Maria Razno[0000-0003-3356-5027]
National Technical University "Kharkiv Polytechnic Institute",
Pushkinska str., 79/2, Kharkiv, Ukraine
Abstract. This article describes the relevance of the word processing task that
is written in human language by the methods of Machine Learning and NLP
approach, that can be used on Python programming language. It also portrays
the concept of Machine Learning, its main varieties and the most popular
Pythonpackages and libraries for working with text data using Machine
Learning methods. The concept of NLP and the most popular python packages
are also presented in the article. The machine learning classification model
algorithm based on the text processing is introduced in the article. It shows how
to use classification machine learning and NLP methods in practice.
Over the last few years machine learning and artificial intelligence have become
very hot topics. Nowadays their methods and approaches are a part of a huge amount
of products, moreover it is a necessary thing in most applications and appliances. An
example of using ML (Machine Learning) can be the automatic determination of
important emails and quick responses in Gmail. Nowadays we can confidently say
that and artificial intelligence with machine learning can push a person out of many
technological processes.
Machine learning is the scientific study of algorithms and statistical methods that
computer systems use to effectively perform a specific task without using explicit
instructions, relying on patterns and inference instead. It is seen as a subset of
artificial intelligence. Machine learning algorithms build a mathematical model of
sample data, known as "training data", in order to make predictions or decisions
without being explicitly programmed to perform the task. There are five types of
machine learning algorithms: supervised, semi-supervised, active learning,
reinforcement and unsupervised learning [1].
Natural language processing is a subfield of computer science, information
engineering, and artificial intelligence concerned with the interactions between
computers and human (natural) languages, in particular, how to program computers in
order to process and analyze large amounts of natural language data. Tasks in natural
71
72
References
1. Langley, P.: Human and machine learning.Machine Learning,1, pp. 243–248 (1986)
2. Masch, C.: Text classification with Convolution Neural Net-works on Yelp, IMDB &
sentence polarity dataset, https://github.com/cmasch/cnn-text-classification,24/02/2019.
3. Moschitti, A., Basili, R.: Complex Linguistic Features for Text Classification: A
Comprehensive Study. In: Lecture Notes in Computer Science vol. 2997, pp. 181-196,
Springer Science + Business Media (2004)
73