0% found this document useful (0 votes)
60 views

NLP Assignment 2

The document discusses using Python and NLP libraries to perform natural language processing on three languages. It introduces NLP and describes common NLP tasks. Two popular Python NLP libraries are covered - NLTK, which is well known and full-featured, and iNLTK, which focuses on Indian languages. Key features of each library include tokenization, parsing, tagging, and word embeddings. The assignment aims to apply these libraries to perform NLP on three languages.

Uploaded by

Radhe Shyam
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
60 views

NLP Assignment 2

The document discusses using Python and NLP libraries to perform natural language processing on three languages. It introduces NLP and describes common NLP tasks. Two popular Python NLP libraries are covered - NLTK, which is well known and full-featured, and iNLTK, which focuses on Indian languages. Key features of each library include tokenization, parsing, tagging, and word embeddings. The assignment aims to apply these libraries to perform NLP on three languages.

Uploaded by

Radhe Shyam
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Assignment No - 2

Natural Languages Processing

Aim :- Using programming language Python and suitable libraries perform fundamental
Language processing for three different languages.

Theory:-

NLP stands for Natural Language Processing, which is a part of Computer Science, Human
language, and Artificial Intelligence.

It is the technology that is used by machines to understand, analyse, manipulate, and interpret
human's languages.

It helps developers to organize knowledge for performing tasks such as translation, automatic
summarization, Named Entity Recognition (NER), speech recognition, relationship extraction,
and topic segmentation.

Steps Of NLP - 1) Lexical Analysis

2) Syntactic Analysis

3) Semantic Analysis

4) Discourse Integration

5) Pragmatic Analysis

NLP Libraries -

1) NLTK –

NLTK is a leading platform for building Python programs to work with human language
data. It provides easy-to-use interfaces to over 50 corpora and lexical resources such as
WordNet, along with a suite of text processing libraries for classification, tokenization,
stemming, etc. This library provides a practical introduction to programming for
language processing. NLTK has been called “a wonderful tool for teaching and working
in computational linguistics using Python,” and “an amazing library to play with natural
language.”

Features Of NLTK :-

Helps with text classification

Helps with tokenization

Helps with parsing

Helps with part-of-speech tagging

Helps with stemming

The most well-known and full NLP library.

Plenty of approaches to each NLP task.

It Supports a significant number of languages.

2) iNLTK –

The iNLTK library is the Indian language equivalent of the popular NLTK Python package.
This library is built with the goal of providing features that an NLP application developer
will need.iNLTK provides most of the features that modern NLP tasks require, like
generating a vector embedding for input text, tokenization, sentence similarity etc. in a
very intuitive and easy API interface.

Features Of iNLTK -

Tokenization

Word Embeddings

Text Completion

Similarity of sentences

This feature of iNLTK is very useful for text data augmentation as we can just multiply the
sentences in our training data by populating it with sentences that have a similar meaning.
Code and output -
Conclusion –

In this assignment, we have done natural language processing on three different languages
using different python libraries.

You might also like