0% found this document useful (0 votes)

219 views46 pages

Sentiment Analysis On Manipuri Language

1) The document discusses sentiment analysis on the Manipuri language using machine learning techniques like Naive Bayes classifier and deep learning approaches. 2) It outlines the objectives to classify Manipuri sentences as positive or negative sentiment and explores features like TF-IDF. 3) The proposed model involves data collection, preprocessing like transliteration, feature extraction using TF-IDF and sentiment analysis using Naive Bayes and deep learning methods.

Uploaded by

RAHUL KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

219 views46 pages

Sentiment Analysis On Manipuri Language

Uploaded by

RAHUL KUMAR

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

SENTIMENT ANALYSIS ON MANIPURI

LANGUAGE

Project Supervisor

Loitongbam Sanayai Meetei

Dr. Samir Kumar Borgohain
M Tech Scholar : CS-16-25-101 Assistant Professor
Department of CSE, NIT Silchar Department of CSE, NIT Silchar
CONTENT
 Introduction
 Literature Review

 Problem Statement

 Objective

 Proposed Model

 Future Work

 Reference
WHAT IS SENTIMENT ANALYSIS?
LITERATURE SURVEY
Serial Reference Findings Limitations
No.

1 [1] Kishorjith N. The data were processed for Mainly focus on the feature
et al.: Verb based Part of Speech (POS) selection and the sentiment
Manipuri tagging using Conditional decider was based on a simple
sentiment analysis Random Field (CRF). counting method. More methods
Polarity being notified for and algorithm can be
each of the verbs, the implemented and explored
highest number of polarity
being the sentiment decider.

2 [2] Hayeon Jang et

al. : Language- Using SVMLight classifier,
specific sentiment comparisons were done on Contrary to their expectations,
analysis in term frequency- inverse the simple classification method
morphologically document frequency (TF- gets higher results.
rich languages IDF) and all possible
combinations of chunking
and shifters.
LITERATURE SURVEY
Serial Reference Findings Limitations
No.
3 [3] P. Vateekul et al: Two deep learning techniques Since the feature extraction
A Study of for the sentiment uses bog of words, signature
Sentiment Analysis classification of Thai Twitter words may be less.
Using Deep data, i.e., Convolutional
Learning Neural Network and Long
Techniques on Thai Short Term Memory (LSTM).
Twitter Data Both techniques were found to
give significantly higher
accuracies than classical
techniques.
PROBLEM STATEMENT:

 Lack of data

 Lack of polarity tagged data

 Lack of part of speech (POS) tagger

 Most of the data were in Bengali script

OBJECTIVE

 Sentiment analysis on Manipuri Language, that is to classify the

sentence to Positive or Negative sentiment.

 Since no classification model have been applied on Manipur

language in the sentiment analysis [1].
We will exploring the Naïve Bayes classifier and deep learning
approach.
ABOUT MANIPURI LANGUAGE
 Manipuri language, a Tibeto-Burman language spoken
predominantly in Manipur, a northeastern state of India.
 Smaller speech communities exist in the Indian states of Assam,
Mizoram, and Tripura, as well as in Bangladesh and Myanmar
(Burma)
 Subject Object Verb (SOV) language
 E.g. Robert na lafoi chakhre ( its English transliteration is “Robert
banana ate” which in the English language would be “Robert ate banana” ,
which is a Subject Verb Object format)

 Agglutinative language

Language Present Present Past perfect

continuous
English go going went
Hindi jata ja raha gaya
Manipuri chatlage chatli chatlure
PROPOSED MODEL :

Data collection

Pre-processing

Feature Extraction

Sentiment Analysis
PROPOSED MODEL : Data from survey and
articles containing
Manipuri text from
Technology Development
for Indian Languages
Data collection (TDIL)

Pre-processing

Feature Extraction

Sentiment Analysis
PROPOSED MODEL :

Data collection • Transliteration from

Bengali script to English.
• Manual annotation of
polarity to each sentences for
supervised training and
Pre-processing ground truth reference

Feature Extraction

Sentiment Analysis
PROPOSED MODEL :

Data collection

Pre-processing
Term Frequency –
Inverse Document
Frequency (TF-IDF)

Feature Extraction

Sentiment Analysis
TF-IDF
Numerical statistic that is intended to reflect how important a
word is to a document in a collection or corpus.

TF (Term Frequency):
Raw count of a term in a document, i.e. the number of times that
term t occurs in document d.
tf(t,d) = ft,d

IDF (Inverse Document Frequency):

Calculated as:

Finally,
tf-idf = tf(t,d) . idf(t)
TF-IDF EXAMPLE
Dataset:

Doc 1 ei koiba chatpa pammi

Doc 2 esei taba nungai amadi esei tabana pothaba fangi
Doc 3 koiba chatpa matam mangni

TF calculation:
koiba chatpa pammi esei nungai pothaba fangi matam manngi

Doc1 1 1 1
Doc 2 2 1 1 1
Doc 3 1 1 1 1
TF-IDF EXAMPLE
IDF calculation:
koiba chatpa pammi esei nungai pothaba fangi matam manngi
Doc1 0.18 0.18 0.48
Doc 2 0.48 0.48 0.48 0.48
Doc 3 0.18 0.18 0.48 0.48

TF-IDF calculation:

koiba chatpa pammi esei nungai pothaba fangi matam manngi

Doc1 0.18 0.18 0.48
Doc 2 0.95 0.48 0.48 0.48
Doc 3 0.18 0.18 0.48 0.48
PROPOSED MODEL :
Data collection

Pre-processing

Feature Extraction
Naïve Bayes

Sentiment Analysis Machine

Learning Deep Learning
NAIVE BAYES

where,
P(Ck| A ) = probability that a training pattern with A attribute
belongs to class Ck ( Posterior probability )
P( A|Ck) = probability that a training pattern of class Ck to have
A attribute ( Conditional probability )
P(Ck) = probability of a training pattern that belongs to class
Ck ( Prior probability )
P( A ) = probability of a training pattern having attributes A
EXAMPLE
Type Doc Words Class

Training 1 ei koiba chatpa pammi pos

2 esei taba nungai amadi esei tabana pothaba fangi pos

3 koiba chatpa matam mangni neg

Testing 4 esei taba matam mangi

TF-IDF :
koiba chatpa pammi esei nungai pothaba fangi matam manngi

Doc1 0.18 0.18 0.48

Doc 2 0.95 0.48 0.48 0.48
Doc 3 0.18 0.18 0.48 0.48
EXAMPLE
Type Doc Words Class

Training 1 koiba chatpa pammi pos

P(pos) = 2/3 2 esei nungai esei pothaba fangi pos

P(neg) = 1/3 3 koiba chatpa matam mangni neg

Testing 4 esei matam mangi

Conditional probability:
P(esei| pos) = [(0.95*2) + 1] / (8 + 9) = 2.9/17 P(pos|d4)
P(matam| pos) = [0 + 1] / (8 + 9) = 1/17 = 2/3* 2.9/17 * (1/17)2
P(mangi| pos) = [0 + 1] / (8 + 9) = 1/17 = 0.000393

P(esei| neg) = [0 + 1] / (3 + 9) = 1/12 P(neg|d4)

P(matam| neg) = [0.48 + 1] / (3 + 9) = 1.48/12 = 1/3 * 1/12 * (1.48/12)2
P(mangi| neg) = [0.48 + 1] / (3 + 9) = 1.48/12 = 0.000422
DEEP LEARNING

Fig 1. Artificial Neural Network Fig 2. Deep neural network

PROGRESS SO FAR

 Data Collection : we have collected around 2000 sentences in Manipuri

language

 Implementation of Transliteration program in progress

 Manual annotation in progress

FUTURE WORK

 Collect more data

 Implementation of the model

REFERENCE

 Kishorjith N., Dilipkumar, K., Wangkheimayum, H., Shinghajith,

K., Sivaji B.: Verb based Manipuri sentiment analysis. IJNLC
3(3), 1307–2278, 2014

 Hayeon Jang and Hyopil Shin : Language-specific sentiment

analysis in morphologically rich languages. In Coling 2010:
Posters, pages 498–506, Beijing, China, August, 2010.

 P. Vateekul and T. Koomsubha : A Study of Sentiment Analysis

Using Deep Learning Techniques on Thai Twitter Data, 2016.
Thank you

Sentiment Analysis Using Naïve Bayes Classifier
No ratings yet
Sentiment Analysis Using Naïve Bayes Classifier
23 pages
Letu Da Notes-Compiled
No ratings yet
Letu Da Notes-Compiled
438 pages
Polarity Detection of Kannada Documents: Deepamala. N Dr. Ramakanth Kumar. P
100% (1)
Polarity Detection of Kannada Documents: Deepamala. N Dr. Ramakanth Kumar. P
4 pages
AIML IA3 Loki & SG
No ratings yet
AIML IA3 Loki & SG
31 pages
Megersa, Thesis Presentation
No ratings yet
Megersa, Thesis Presentation
40 pages
5905 1322
No ratings yet
5905 1322
41 pages
Lecture 3 Sentiment Analysis
No ratings yet
Lecture 3 Sentiment Analysis
41 pages
Santosh - Bharti - Conf - Dynamic SentiPhraseNet
No ratings yet
Santosh - Bharti - Conf - Dynamic SentiPhraseNet
11 pages
Sentiment Analysis: Using Naïve Bayes Classifier
No ratings yet
Sentiment Analysis: Using Naïve Bayes Classifier
18 pages
AJESVol 12no 2July-December2023pp 28-36
No ratings yet
AJESVol 12no 2July-December2023pp 28-36
10 pages
A Transfer Learning Framework For Sentiment Analysis in Indian Vernaculars
No ratings yet
A Transfer Learning Framework For Sentiment Analysis in Indian Vernaculars
9 pages
Final Conference Submission
No ratings yet
Final Conference Submission
9 pages
A Study of Feature Extraction Techniques For
No ratings yet
A Study of Feature Extraction Techniques For
12 pages
Classifier Series - Naive Bayes Sentiment Analysis
No ratings yet
Classifier Series - Naive Bayes Sentiment Analysis
10 pages
2023 Dravidianlangtech-1 30
No ratings yet
2023 Dravidianlangtech-1 30
6 pages
A Study of The Application of Weight Distributing Method Combining Sentiment Dictionary and TF-IDF For Text Sentiment Analysis
No ratings yet
A Study of The Application of Weight Distributing Method Combining Sentiment Dictionary and TF-IDF For Text Sentiment Analysis
10 pages
1 PB
No ratings yet
1 PB
5 pages
Chi-Square Feature Selection Effect On Naive Bayes Classifier Algorithm Performance For Sentiment Analysis Document
No ratings yet
Chi-Square Feature Selection Effect On Naive Bayes Classifier Algorithm Performance For Sentiment Analysis Document
8 pages
A Comparative Study On TF-IDF Feature Weighting Method and Its Analysis Using Unstructured Dataset
No ratings yet
A Comparative Study On TF-IDF Feature Weighting Method and Its Analysis Using Unstructured Dataset
10 pages
Researchpaper
No ratings yet
Researchpaper
9 pages
Domain: Natural Language Processing Title: Automatic Sentiment Detection in Naturalistic Audio
No ratings yet
Domain: Natural Language Processing Title: Automatic Sentiment Detection in Naturalistic Audio
18 pages
Lecture 5 - Language Representation Tf-Idf
No ratings yet
Lecture 5 - Language Representation Tf-Idf
51 pages
Sentiment Analysis: Srishti Chaubey
No ratings yet
Sentiment Analysis: Srishti Chaubey
40 pages
Multimedia Application L8
No ratings yet
Multimedia Application L8
68 pages
Naive Bayes
No ratings yet
Naive Bayes
56 pages
2024 Dravidianlangtech-1 21
No ratings yet
2024 Dravidianlangtech-1 21
5 pages
TF Idf
No ratings yet
TF Idf
27 pages
Resource Creation Towards Automated Sentiment Analysis in Telugu (A Low Resource Language) and Integrating Multiple Domain Sources To Enhance Sentiment Prediction
No ratings yet
Resource Creation Towards Automated Sentiment Analysis in Telugu (A Low Resource Language) and Integrating Multiple Domain Sources To Enhance Sentiment Prediction
8 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
64 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
4 pages
E-Commerce Data: Topic-10: Text Analytics - Sentiment Analysis & Opinion Mining
No ratings yet
E-Commerce Data: Topic-10: Text Analytics - Sentiment Analysis & Opinion Mining
17 pages
Manuscript Updated-1
No ratings yet
Manuscript Updated-1
10 pages
2023 Dravidianlangtech-1 24
No ratings yet
2023 Dravidianlangtech-1 24
4 pages
BAI601 Module 3 PDF
No ratings yet
BAI601 Module 3 PDF
19 pages
Lect 5
No ratings yet
Lect 5
40 pages
2024 Dravidianlangtech-1 43
No ratings yet
2024 Dravidianlangtech-1 43
5 pages
NLPPR7
No ratings yet
NLPPR7
6 pages
Learning Based Approach For Hindi Text S 77957aeb
No ratings yet
Learning Based Approach For Hindi Text S 77957aeb
8 pages
Survey of Entiment Classification Techniques Used For Ndian Regional Languages
No ratings yet
Survey of Entiment Classification Techniques Used For Ndian Regional Languages
14 pages
Bag of Words
No ratings yet
Bag of Words
19 pages
Robert Chan, Michael Wang, Multiclass Sentiment Analysis of Movie Reviews
No ratings yet
Robert Chan, Michael Wang, Multiclass Sentiment Analysis of Movie Reviews
5 pages
Sentimental Analysis Using NLP
No ratings yet
Sentimental Analysis Using NLP
5 pages
Exploiting Emojis in Sentiment Analysis A Survey
No ratings yet
Exploiting Emojis in Sentiment Analysis A Survey
14 pages
Vietnamese Sentiment Analysis Under Limited Training Data
No ratings yet
Vietnamese Sentiment Analysis Under Limited Training Data
14 pages
W04 3253 PDF
No ratings yet
W04 3253 PDF
7 pages
SSRN Id3349572
No ratings yet
SSRN Id3349572
4 pages
Sentiment Analysis Using Machine Learning Classifiers
No ratings yet
Sentiment Analysis Using Machine Learning Classifiers
41 pages
2012 Liviu P. Dinu, Iulia Iuga, 2012. The Naive Bayes Classifier in Opinion Mining - in Search of The Best Feature
No ratings yet
2012 Liviu P. Dinu, Iulia Iuga, 2012. The Naive Bayes Classifier in Opinion Mining - in Search of The Best Feature
12 pages
Picet Presentation
No ratings yet
Picet Presentation
12 pages
A Novel Machine Learning Approach For Sentiment Analysis Based On Adverb-Adjective-Noun-Verb (AANV) Combinations
No ratings yet
A Novel Machine Learning Approach For Sentiment Analysis Based On Adverb-Adjective-Noun-Verb (AANV) Combinations
5 pages
Natural Language Processing For Sentiment Analysis - Ankur Shukla
No ratings yet
Natural Language Processing For Sentiment Analysis - Ankur Shukla
27 pages
1 s2.0 S187705091630463X Main
No ratings yet
1 s2.0 S187705091630463X Main
6 pages
Cs221 Report
No ratings yet
Cs221 Report
16 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
Sentiment Analysis in Python Using NLTK: December 2016
No ratings yet
Sentiment Analysis in Python Using NLTK: December 2016
3 pages
Feature Extraction Techniques in NLP
No ratings yet
Feature Extraction Techniques in NLP
10 pages
Introduction To Sentiment Analysis PDF
No ratings yet
Introduction To Sentiment Analysis PDF
32 pages
Power HP Ecu PDF
100% (3)
Power HP Ecu PDF
82 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
Sentiment Analysis For Vietnamese: Binh Thanh Kieu Son Bao Pham
No ratings yet
Sentiment Analysis For Vietnamese: Binh Thanh Kieu Son Bao Pham
6 pages
Sentence Level Sentiment Analysis
No ratings yet
Sentence Level Sentiment Analysis
8 pages
Module5 Quiz
100% (1)
Module5 Quiz
34 pages
Islamic Names & Meanings in Urdu - Muslim Boys & Muslim Girls Names
48% (25)
Islamic Names & Meanings in Urdu - Muslim Boys & Muslim Girls Names
2 pages
NEET - Haloalkanes & Haloarenes - (Q+S)
No ratings yet
NEET - Haloalkanes & Haloarenes - (Q+S)
18 pages
1 F40, R-41, In-House IHTM-14 Test Report
No ratings yet
1 F40, R-41, In-House IHTM-14 Test Report
1 page
TA1 English - Mini Excavator
No ratings yet
TA1 English - Mini Excavator
15 pages
List of Facilitation Centre PDF
No ratings yet
List of Facilitation Centre PDF
8 pages
Electric Charges and Fields
No ratings yet
Electric Charges and Fields
58 pages
Mobile SDK Developer Guide
No ratings yet
Mobile SDK Developer Guide
387 pages
"Standing On The Shoulders of Giants": Dominican College of Tarlac
100% (1)
"Standing On The Shoulders of Giants": Dominican College of Tarlac
3 pages
Impact of Load Variation On Power System Stability and Performance of Power System Stabilizers: A Case Study of Peerdawd Gas Power Station, Iraq
No ratings yet
Impact of Load Variation On Power System Stability and Performance of Power System Stabilizers: A Case Study of Peerdawd Gas Power Station, Iraq
15 pages
Crypto8e Merged
100% (1)
Crypto8e Merged
492 pages
Opportunity at Risk
No ratings yet
Opportunity at Risk
88 pages
Lecture 21 Analysis of Rainfall Data
No ratings yet
Lecture 21 Analysis of Rainfall Data
10 pages
Research Reports
No ratings yet
Research Reports
11 pages
Bell's Palsy Treatment and Recovery: The Pharmaceutical Journal
No ratings yet
Bell's Palsy Treatment and Recovery: The Pharmaceutical Journal
5 pages
Official Resume
No ratings yet
Official Resume
1 page
Hazardous Substance Fact Sheet: Right To Know
No ratings yet
Hazardous Substance Fact Sheet: Right To Know
6 pages
Chapter 1 - Notes - Fixed Income Analysis
No ratings yet
Chapter 1 - Notes - Fixed Income Analysis
3 pages
PDF
No ratings yet
PDF
4 pages
Plan
No ratings yet
Plan
1 page
Buy Social Security Number SSN
No ratings yet
Buy Social Security Number SSN
8 pages
Types of Lighting
No ratings yet
Types of Lighting
7 pages
Pac 6500-Sira 16 Atex 2362-00
No ratings yet
Pac 6500-Sira 16 Atex 2362-00
3 pages
Payment Receipt
No ratings yet
Payment Receipt
1 page
Staff Slelection Commission Junior Engineers (Civil, Mechanical, Electrical, Quantity Surveying and Contract) Examination, 2015
No ratings yet
Staff Slelection Commission Junior Engineers (Civil, Mechanical, Electrical, Quantity Surveying and Contract) Examination, 2015
1 page
ECONOMY
No ratings yet
ECONOMY
18 pages
Appointment Reciept
No ratings yet
Appointment Reciept
3 pages
Rajant SpecSheet LX5 Squid Cable 110817
No ratings yet
Rajant SpecSheet LX5 Squid Cable 110817
2 pages
Question: EXAMPLE A Fermentation Medium Contains An Initial Spores Co
No ratings yet
Question: EXAMPLE A Fermentation Medium Contains An Initial Spores Co
2 pages
Layout 2 of 250-300TPH Crushing Plant-20240925
No ratings yet
Layout 2 of 250-300TPH Crushing Plant-20240925
1 page
Refind Conf
No ratings yet
Refind Conf
8 pages
Figlet
No ratings yet
Figlet
10 pages
1st Sem Result
No ratings yet
1st Sem Result
1 page
A Handbook of Computational Linguistics: Artificial Intelligence in Natural Language Processing
From Everand
A Handbook of Computational Linguistics: Artificial Intelligence in Natural Language Processing
Youddha Beer Singh
No ratings yet