0% found this document useful (0 votes)

16 views4 pages

Sentiment Analysis

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views4 pages

Sentiment Analysis

Uploaded by

SE69Shweta Yenaji

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

from nltk.corpus import stopwords
from nltk.tokenize import word_tokenize
from nltk.stem import WordNetLemmatizer
import re

In [18]: nltk.download('punkt')
nltk.download('stopwords')
nltk.download('wordnet')

[nltk_data] Downloading package punkt to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package punkt is already up-to-date!
[nltk_data] Downloading package stopwords to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package stopwords is already up-to-date!
[nltk_data] Downloading package wordnet to
[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...
[nltk_data] Package wordnet is already up-to-date!

Out[18]: True

Method 1
In [19]: df = pd.read_csv('reviews.csv', usecols=['body'])
lemma = WordNetLemmatizer()
stop_words = stopwords.words('english')

In [20]: def text_prep(x):

corp = str(x).lower()
corp = re.sub('[^a-zA-Z]+',' ', corp).strip()
tokens = word_tokenize(corp)
words = [t for t in tokens if t not in stop_words]
lemmatize = [lemma.lemmatize(w) for w in words]

return lemmatize

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

df["preprocess_txt"] = preprocess_tag
df['total_len'] = df['preprocess_txt'].map(lambda x: len(x))

In [24]: file = open('negative-words.txt', 'r')

neg_words = file.read().split()
file = open('positive-words.txt', 'r')
pos_words = file.read().split()

localhost:8888/notebooks/Sentiment_analysis.ipynb 1/4
3/28/24, 3:07 PM Sentiment_analysis

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

df['pos_count'] = num_pos
num_neg = df['preprocess_txt'].map(lambda x: len([i for i in x if i in neg_w
df['neg_count'] = num_neg
df['sentiment'] = round((df['pos_count'] - df['neg_count']) / df['total_len'
df.head()

Out[27]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.00
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 0.07
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 0.09
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 0.07
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 0.04
purpose, offer, except, ...
purpose it ...

Method 2
In [28]: df['sentiment'] = round(df['pos_count'] / (df['neg_count']+1), 2)
df.head()

Out[28]:
body preprocess_txt total_len pos_count neg_count sentiment

I had the Samsung [samsung, awhile,

0 A600 for awhile which absolute, doo, doo, read, 162 18 18 0.95
is abs... re...

Due to a software
[due, software, issue,
1 issue between Nokia 67 8 3 2.00
nokia, sprint, phone, t...
and Spri...

This is a great,
[great, reliable, phone,
2 reliable phone. I also 68 10 4 2.00
also, purchased, phon...
purcha...

I love the phone and

[love, phone, really, need,
3 all, because I really 41 3 0 3.00
one, expect, price...
did...

The phone has been

[phone, great, every,
4 great for every 56 5 3 1.25
purpose, offer, except, ...
purpose it ...

In [30]: nltk.download('vader_lexicon')

[nltk_data] Downloading package vader_lexicon to

[nltk_data] C:\Users\student\AppData\Roaming\nltk_data...

Out[30]: True

localhost:8888/notebooks/Sentiment_analysis.ipynb 2/4
3/28/24, 3:07 PM Sentiment_analysis

Method 3
In [35]: from nltk.sentiment.vader import SentimentIntensityAnalyzer

sent = SentimentIntensityAnalyzer()
df = pd.read_csv('reviews.csv', usecols=['body'])
df['body'].fillna('', inplace=True)
polarity = [round(sent.polarity_scores(str(i))['compound'], 2) for i in df['
df['sentiment_score'] = polarity
print(df.head())

body sentiment_score
0 I had the Samsung A600 for awhile which is abs... 0.86
1 Due to a software issue between Nokia and Spri... 0.89
2 This is a great, reliable phone. I also purcha... 0.80
3 I love the phone and all, because I really did... 0.96
4 The phone has been great for every purpose it ... 0.77

Exra
In [54]: # Create WordNetLemmatizer object
wnl = WordNetLemmatizer()

# single word lemmatization examples

list1 = ['kites', 'babies', 'dogs', 'flying', 'smiling',
'driving', 'tried', 'feet']
for words in list1:
print(words + " ---> " + wnl.lemmatize(words))

print('better' + " ---> " + wnl.lemmatize('better',pos='a'))

kites ---> kite

babies ---> baby
dogs ---> dog
flying ---> flying
smiling ---> smiling
driving ---> driving
tried ---> tried
feet ---> foot
better ---> good

In [59]: sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Get English stopwords
english_stopwords = set(stopwords.words('english'))

# Filter out stopwords
filtered_tokens = [word for word in tokens if word.lower() not in english_st

print(filtered_tokens)

['good', 'cricket', ',', 'best', 'Football', '.']

localhost:8888/notebooks/Sentiment_analysis.ipynb 3/4
3/28/24, 3:07 PM Sentiment_analysis

In [60]: import nltk

from nltk.stem import PorterStemmer

# Sentence to stem
sentence = 'I am good in cricket, but best in Football.'

# Tokenize the sentence
tokens = nltk.word_tokenize(sentence)

# Initialize PorterStemmer
stemmer = PorterStemmer()

# Perform stemming on each token
stemmed_tokens = [stemmer.stem(word) for word in tokens]
print(stemmed_tokens)

['I', 'am', 'good', 'in', 'cricket', ',', 'but', 'best', 'in', 'footbal',
'.']

In [ ]:

localhost:8888/notebooks/Sentiment_analysis.ipynb 4/4

R002 KrishAhuja BDA Lab9.Ipynb - Colab
No ratings yet
R002 KrishAhuja BDA Lab9.Ipynb - Colab
3 pages
British Airways Forage Report
No ratings yet
British Airways Forage Report
12 pages
ASTW RA03 PracticalManual
No ratings yet
ASTW RA03 PracticalManual
18 pages
Chapter 3
No ratings yet
Chapter 3
28 pages
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
No ratings yet
Viva Questions For Opinion Mining Project by NASIR ABBAS - VUBWN
8 pages
Basenlp
No ratings yet
Basenlp
5 pages
Q 3
No ratings yet
Q 3
2 pages
NLP Sentimental Analysis 1736351356
No ratings yet
NLP Sentimental Analysis 1736351356
32 pages
Code
No ratings yet
Code
13 pages
Introduction To Sentiment Analysis PDF
No ratings yet
Introduction To Sentiment Analysis PDF
32 pages
17 Practicals
No ratings yet
17 Practicals
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
AIML IA3 Loki & SG
No ratings yet
AIML IA3 Loki & SG
31 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
37 pages
Detailed Report
No ratings yet
Detailed Report
6 pages
Chapter 2
No ratings yet
Chapter 2
34 pages
Dataset Description: Amazon Reviews of Unlocked Phone
No ratings yet
Dataset Description: Amazon Reviews of Unlocked Phone
4 pages
Lab Manual
No ratings yet
Lab Manual
10 pages
Ment Analysis Text Classification
No ratings yet
Ment Analysis Text Classification
9 pages
Sentimental Analysis
No ratings yet
Sentimental Analysis
3 pages
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
No ratings yet
Pandas PD Numpy NP NLTK NLTK - Sentiment.vader Re Wordcloud Seaborn Sns Matplotlib - Pyplot PLT
6 pages
Sentiment Analysis Using Bert Model
No ratings yet
Sentiment Analysis Using Bert Model
8 pages
Nokia Positive and Negative TM
No ratings yet
Nokia Positive and Negative TM
8 pages
Sentiment Analysis Using Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Machine Learning Algorithms
23 pages
NLPPR7
No ratings yet
NLPPR7
6 pages
Kindle Review Sentiment Analysis - Ipynb - Colab
No ratings yet
Kindle Review Sentiment Analysis - Ipynb - Colab
5 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
DSBA+Master+Codebook+ +Text+Mining+&+TSF
No ratings yet
DSBA+Master+Codebook+ +Text+Mining+&+TSF
11 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
Amazon Assignment Ex
No ratings yet
Amazon Assignment Ex
11 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Web and Social Media Analytics Lab
No ratings yet
Web and Social Media Analytics Lab
34 pages
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
No ratings yet
Adithiyaa BR 23MBA0018 SMA DA Text Mining PDF
6 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Package Sentimentr': R Topics Documented
No ratings yet
Package Sentimentr': R Topics Documented
49 pages
Dav Exp7 56
No ratings yet
Dav Exp7 56
8 pages
Sentiment Analysis of Twitter Data: Radhi D. Desai
No ratings yet
Sentiment Analysis of Twitter Data: Radhi D. Desai
4 pages
Bert Sentiment
No ratings yet
Bert Sentiment
7 pages
Session 7
No ratings yet
Session 7
17 pages
Chapter 10 - Text Analytics
No ratings yet
Chapter 10 - Text Analytics
13 pages
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
No ratings yet
Sentiment Analysis Using Feature Selection and Machine Learning Algorithms
48 pages
Sentiment Analysis Using Text Mining PDF
100% (1)
Sentiment Analysis Using Text Mining PDF
12 pages
Raj DV Exp5
No ratings yet
Raj DV Exp5
6 pages
Natural Language Processing-Section
No ratings yet
Natural Language Processing-Section
29 pages
1a NLTK
No ratings yet
1a NLTK
10 pages
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
No ratings yet
Sentiment Analysis For PolishPoznan Studies in Contemporary Linguistics
24 pages
Social Media Sentimental Analysis 1
No ratings yet
Social Media Sentimental Analysis 1
30 pages
Twitter Analysis
No ratings yet
Twitter Analysis
8 pages
Ai Project
No ratings yet
Ai Project
15 pages
Combine PDF
No ratings yet
Combine PDF
124 pages
IDTA For NLP
No ratings yet
IDTA For NLP
16 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
18 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
14 pages
DSBD 7 Ass
No ratings yet
DSBD 7 Ass
9 pages
Sentiment Analysis JW Marriot
No ratings yet
Sentiment Analysis JW Marriot
16 pages
121a1114 D2 Sma Exp3
No ratings yet
121a1114 D2 Sma Exp3
9 pages
Leveraging Natural Language Processing and Machine Learning For Enhanced Content Rating
No ratings yet
Leveraging Natural Language Processing and Machine Learning For Enhanced Content Rating
8 pages
Sentiment Analysis in Python Using NLTK: December 2016
No ratings yet
Sentiment Analysis in Python Using NLTK: December 2016
3 pages
SSD1963EVAL Rev2A UG Rev1 0b
No ratings yet
SSD1963EVAL Rev2A UG Rev1 0b
16 pages
Smartwatch User Manual
No ratings yet
Smartwatch User Manual
14 pages
Prismatic Whitepaper
No ratings yet
Prismatic Whitepaper
15 pages
OmniNote Final
No ratings yet
OmniNote Final
6 pages
What Is Cluster Computing?: Clear Answers For Common Questions
No ratings yet
What Is Cluster Computing?: Clear Answers For Common Questions
5 pages
Set of 6 Sample Papers of Computer Science
No ratings yet
Set of 6 Sample Papers of Computer Science
59 pages
LinuxFoundationCKADDumps SecretHacksToCrackCKADExam
No ratings yet
LinuxFoundationCKADDumps SecretHacksToCrackCKADExam
5 pages
Rohith P S SRE
No ratings yet
Rohith P S SRE
9 pages
User Manual Mettler Toledo Bcom (TImbangan Buah, Timbangan Swalayan, Timbangan Barcode, Timbangan Digital Barcode) Panduan Penggunaan PDF
No ratings yet
User Manual Mettler Toledo Bcom (TImbangan Buah, Timbangan Swalayan, Timbangan Barcode, Timbangan Digital Barcode) Panduan Penggunaan PDF
31 pages
Attachment
No ratings yet
Attachment
1 page
GC 2024 11 17
No ratings yet
GC 2024 11 17
14 pages
Log2020 08 09
No ratings yet
Log2020 08 09
4 pages
PHP Cheatsheet - by - CodeWithHarry
No ratings yet
PHP Cheatsheet - by - CodeWithHarry
22 pages
CSS Stylesheets
No ratings yet
CSS Stylesheets
31 pages
WEEK 007 008 MODULE Selecting and Cutting Out Part of An Image
No ratings yet
WEEK 007 008 MODULE Selecting and Cutting Out Part of An Image
3 pages
F360 Post Processor Training Guide
No ratings yet
F360 Post Processor Training Guide
219 pages
Introduction To IoT With Machine Learning and Image Processing Using Raspberry Pi (Shrirang Ambaji Kulkarni, Varadrah P. Gurupur Etc.) (Z-Library)
No ratings yet
Introduction To IoT With Machine Learning and Image Processing Using Raspberry Pi (Shrirang Ambaji Kulkarni, Varadrah P. Gurupur Etc.) (Z-Library)
167 pages
IDeliverable - Writing An Orchard Webshop Module From Scratch - Part 1
No ratings yet
IDeliverable - Writing An Orchard Webshop Module From Scratch - Part 1
13 pages
Advanced C++ STL: Tony Wong 2017-03-25
No ratings yet
Advanced C++ STL: Tony Wong 2017-03-25
71 pages
Ruijie Reyee RG-EST and RG-AirMetro Series Wireless Bridges B11P300 Release Notes (V1.3)
No ratings yet
Ruijie Reyee RG-EST and RG-AirMetro Series Wireless Bridges B11P300 Release Notes (V1.3)
12 pages
Tales From The Evil Empire - Recovering The Admin Password in Orchard
No ratings yet
Tales From The Evil Empire - Recovering The Admin Password in Orchard
3 pages
Payment Voucher 9
No ratings yet
Payment Voucher 9
5 pages
GlassJet AR6000 Operation Manual Rev E
No ratings yet
GlassJet AR6000 Operation Manual Rev E
196 pages
2nd Generation Computers
No ratings yet
2nd Generation Computers
3 pages
Oracle1Z0 819dumps2024 FreeQuestionsAndAnswersPDF
No ratings yet
Oracle1Z0 819dumps2024 FreeQuestionsAndAnswersPDF
4 pages
Syllabus - OmniSOC Internship 2024
No ratings yet
Syllabus - OmniSOC Internship 2024
6 pages
Minestar Edge Features and Benefits
100% (1)
Minestar Edge Features and Benefits
11 pages
Datacard 280P Card Personalization System: ID Works Plug-In User's Guide
No ratings yet
Datacard 280P Card Personalization System: ID Works Plug-In User's Guide
30 pages
Vector Part3 Methods-Tech Piece-Recut-Additive en
No ratings yet
Vector Part3 Methods-Tech Piece-Recut-Additive en
6 pages
Mobile Security
No ratings yet
Mobile Security
8 pages

Sentiment Analysis

Uploaded by

Sentiment Analysis

Uploaded by

3/28/24, 3:07 PM Sentiment_analysis

In [17]: import pandas as pd

[nltk_data] Downloading package punkt to

In [20]: def text_prep(x):

In [22]: preprocess_tag = [text_prep(i) for i in df['body']]

In [24]: file = open('negative-words.txt', 'r')

In [27]: num_pos = df['preprocess_txt'].map(lambda x: len([i for i in x if i in pos_w

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

I had the Samsung [samsung, awhile,

I love the phone and

The phone has been

[nltk_data] Downloading package vader_lexicon to

# single word lemmatization examples

kites ---> kite

In [59]: sentence = 'I am good in cricket, but best in Football.'

['good', 'cricket', ',', 'best', 'Football', '.']

In [60]: import nltk

You might also like