NLP Assignment 2
NLP Assignment 2
Aim :- Using programming language Python and suitable libraries perform fundamental
Language processing for three different languages.
Theory:-
NLP stands for Natural Language Processing, which is a part of Computer Science, Human
language, and Artificial Intelligence.
It is the technology that is used by machines to understand, analyse, manipulate, and interpret
human's languages.
It helps developers to organize knowledge for performing tasks such as translation, automatic
summarization, Named Entity Recognition (NER), speech recognition, relationship extraction,
and topic segmentation.
2) Syntactic Analysis
3) Semantic Analysis
4) Discourse Integration
5) Pragmatic Analysis
NLP Libraries -
1) NLTK –
NLTK is a leading platform for building Python programs to work with human language
data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as
WordNet, along with a suite of text processing libraries for classification, tokenization,
stemming, etc. This library provides a practical introduction to programming for
language processing. NLTK has been called “a wonderful tool for teaching and working
in computational linguistics using Python,” and “an amazing library to play with natural
language.”
Features Of NLTK :-
2) iNLTK –
The iNLTK library is the Indian language equivalent of the popular NLTK Python package.
This library is built with the goal of providing features that an NLP application developer
will need.iNLTK provides most of the features that modern NLP tasks require, like
generating a vector embedding for input text, tokenization, sentence similarity etc. in a
very intuitive and easy API interface.
Features Of iNLTK -
Tokenization
Word Embeddings
Text Completion
Similarity of sentences
This feature of iNLTK is very useful for text data augmentation as we can just multiply the
sentences in our training data by populating it with sentences that have a similar meaning.
Code and output -
Conclusion –
In this assignment, we have done natural language processing on three different languages
using different python libraries.