0% found this document useful (0 votes)

20 views5 pages

SMA4

The document outlines an experiment on Exploratory Data Analysis (EDA) and visualization of social media data, specifically using Twitter data. It discusses the use of the TextBlob library for sentiment analysis and the creation of Word Clouds for visual representation of data. The conclusion emphasizes the importance of EDA in cleaning and making data meaningful.

Uploaded by

mgade3012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views5 pages

SMA4

Uploaded by

mgade3012

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

DEPARTMENT OF ARTIFICIAL INTELLIGENCE

& DATA SCIENCE

Subject: Social Media Analytics Course Code: CSDOL8023

Semester: VIII Course: AI & DS
Laboratory No: 302 Name of Subject Teacher: Prof. Gitanjali Korgaonkar
Name of Student: Meghana Gade Roll Id: VU2S2223002

Experiment no: 4
Aim: Exploratory Data Analysis and visualization of Social Media Data for business.

Theory:

Exploratory data analysis (EDA) is used by data scientists to analyze and investigate data sets and summarize their main
characteristics, often employing data visualization methods. It helps determine how best to manipulate data sources to get
the answers you need, making it easier for data scientists to discover patterns, spot anomalies, test a hypothesis, or check
assumptions.

TextBlob is a Python (2 and 3) library for processing textual data. It provides a simple API for diving into common
natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis,
classification, translation, and more.

Word Clouds came out to be a game-changer visualization technique for understanding and determining patterns and
evolving trends. Whether to discover the political agendas of aspiring election candidates of a country or to analyze the
customer reviews on the recently launched product, one can get a visual representation by plotting the Word Cloud.

1. Twitter Data Cleaning, Preprocessing and Exploratory Data Analysis:

df1=df1.drop_duplicates("renderedContent") df1.shape
df1.head df1.date.value_counts()
#Heat Map for missing Values plt.figure(figsize=(17, 5))
sns.heatmap(df1.isnull(), cbar=True, yticklabels=False) plt.xlabel("Column_Name", size=14, weight="bold")
plt.title("Places of missing values in column",size=17) plt.show()

import plotly.graph_objects as go
Top_Location_Of_tweet= df1['place'].value_counts().head (10) print(Top_Location_Of_tweet)

def get_subjectivity(text):
return TextBlob(text).sentiment.subjectivity def get_polarity(text):
return TextBlob(text).sentiment.polarity

df1['subjectivity']=df1[ 'renderedContent'].apply(get_subjectivity)
df1[ 'polarity' ]=df1[ 'renderedContent'].apply(get_polarity) df1.head()

2. Sentiment Analysis

df1['textblob_score'] =df1[ 'renderedContent'].apply(lambda x: TextBlob(x).sentiment.polarity) neutral_threshold=0.05

df1['textblob_sentiment']=df1[ 'textblob_score'].apply(lambda c:'positive' if c >= neutral_threshold
else ('Negative' if c <= -(neutral_threshold) else 'Neutral' ) )

textblob_df = df1[['renderedContent','textblob_sentiment','likeCount']] textblob_df

textblob_df["textblob_sentiment"].value_counts() textblob_df["textblob_sentiment"].value_counts().plot.barh(title,
Analysis',color='orange' , width=.4, figsize=(12,8),stacked = True) = 'Sentiment

df_positive=textblob_df[textblob_df['textblob_sentiment']=='positive' ]
df_very_positive=df_positive[df_positive['likeCount']>0] df_very_positive.head()
df_negative=textblob_df[textblob_df['textblob_sentiment']=='Negative' ] df_negative
df_neutral=textblob_df[textblob_df['textblob_sentiment']=='Neutral' ] df_neutral

3. Visualization by creating a Word Cloud

from wordcloud import WordCloud, STOPWORDS from PIL import Image

#Creating the text variable
positive_tw="".join(t for t in df_very_positive.renderedContent)
#Creating word_cloud with text as argument in .generate() rtpthod
word_cloud1=WordCloud (collocations=False, background_color='white'). generate (positive_tw)
# Display the generated Word Cloud plt.imshow(word_cloud1,interpolation='bilinear) plt.axis('off')
plt.show()

#Creating the text variable

negative_tw="".join (t for t in df_negative.rendered Content) #Creating word_cloud with text as argument in.generate()
rtpthod
word_cloud2=WordCloud(collocations=False,background_color='white').generate(negative_tw) # Display the generated
Word Cloud
plt.imshow(word_cloud2,interpolation='bilinear') plt.axis('off')
plt.show()

#Creating the text variable

neutral_tw="".join(tfor tindf_neutral.renderedContent)
#Creating word_cloud with text as argument in. generate() rtpthod
word_cloud2=WordCloud(collocations=False,background_color='white').generate(neutral_tw) # Display the generated Word Cloud
plt.imshow(word_cloud2,interpolation='bilinear') plt.axis('off')
plt.show()
Conclusion:

Exploratory data analysis helps to get data which is cleaned and meaningful. Twitter data is explored using textblob
package and visualization by using wordcloud package of python.

R1 R2 R3
DOP DOS Conduction File Record Viva Voice Total Signature
5 Marks 5 Marks 5 Marks 15 Marks
R1 R2 R3
DOP DOS Conduction File Record Viva Voice Total Signature
5 Marks 5 Marks 5 Marks 15 Marks

Book - Handbook of Collaborative Learning (2013)
100% (1)
Book - Handbook of Collaborative Learning (2013)
498 pages
Libro de Inglés
80% (5)
Libro de Inglés
108 pages
Semantic Roles PDF
100% (2)
Semantic Roles PDF
20 pages
Twitter Sentiment Analysis (NLP) : This Photo CC By-Nc
100% (1)
Twitter Sentiment Analysis (NLP) : This Photo CC By-Nc
18 pages
10) Big Data 4 Social Media Analytics
No ratings yet
10) Big Data 4 Social Media Analytics
65 pages
1python Full-1 Project
No ratings yet
1python Full-1 Project
21 pages
Final Twitter - Sentiment - Analysis - Report
100% (1)
Final Twitter - Sentiment - Analysis - Report
14 pages
Regression Modeling 1698066428
No ratings yet
Regression Modeling 1698066428
23 pages
Twitter Sentiment Analysis Dss
No ratings yet
Twitter Sentiment Analysis Dss
14 pages
By Olivia Wilson
No ratings yet
By Olivia Wilson
11 pages
Analyzing Social Media Data in Python Chapter2
No ratings yet
Analyzing Social Media Data in Python Chapter2
30 pages
All Exp Lab
No ratings yet
All Exp Lab
15 pages
Python Portfolio Project For Data Analyst
No ratings yet
Python Portfolio Project For Data Analyst
13 pages
Social Media Sentimental Analysis 1
No ratings yet
Social Media Sentimental Analysis 1
30 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Notes
No ratings yet
Notes
8 pages
Sarthak Synopsis
No ratings yet
Sarthak Synopsis
7 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
Tiktok Project: Exploratory Data Analysis: Background On The Tiktok Scenario
No ratings yet
Tiktok Project: Exploratory Data Analysis: Background On The Tiktok Scenario
22 pages
EXP5
No ratings yet
EXP5
15 pages
IR Case Study Final Presentation
No ratings yet
IR Case Study Final Presentation
12 pages
Business Data Management DISC 325
No ratings yet
Business Data Management DISC 325
21 pages
Wheeler's Cyclical Model
No ratings yet
Wheeler's Cyclical Model
10 pages
SMA Exp 2
No ratings yet
SMA Exp 2
4 pages
Sma QB Solution Tt2
No ratings yet
Sma QB Solution Tt2
40 pages
FML Project Report
No ratings yet
FML Project Report
18 pages
Chandru Lab 3
No ratings yet
Chandru Lab 3
7 pages
SMA Expt 4
No ratings yet
SMA Expt 4
13 pages
DSDM Unit4
No ratings yet
DSDM Unit4
31 pages
Part C Assignment No 2 Mini Project On Twitter 1
No ratings yet
Part C Assignment No 2 Mini Project On Twitter 1
9 pages
Template For The First Slide of PPT Presentation1
No ratings yet
Template For The First Slide of PPT Presentation1
18 pages
(With Notes) Presupposition and Entailment
100% (1)
(With Notes) Presupposition and Entailment
32 pages
Social Media
No ratings yet
Social Media
7 pages
Thesis - Aru Omarali
No ratings yet
Thesis - Aru Omarali
34 pages
Sma Exp 03 Code Print
No ratings yet
Sma Exp 03 Code Print
5 pages
Sma Exp 4
No ratings yet
Sma Exp 4
5 pages
Business Analytics CA3
No ratings yet
Business Analytics CA3
11 pages
Sma Exp 09 Code Print
No ratings yet
Sma Exp 09 Code Print
5 pages
Lab No 6 - Twitter - Neuro
No ratings yet
Lab No 6 - Twitter - Neuro
2 pages
Sma Exp 04 Code Print
No ratings yet
Sma Exp 04 Code Print
5 pages
DS - Lab Report.
No ratings yet
DS - Lab Report.
25 pages
Raj DV Exp5
No ratings yet
Raj DV Exp5
6 pages
PSOSM Lectures
No ratings yet
PSOSM Lectures
37 pages
Raj DV Exp4
No ratings yet
Raj DV Exp4
4 pages
Twitter Sentiment Analysis Research Paper
No ratings yet
Twitter Sentiment Analysis Research Paper
5 pages
Parent Interview of Social Functioning
100% (1)
Parent Interview of Social Functioning
2 pages
Twitter Sentiment Analysis For Product Review
No ratings yet
Twitter Sentiment Analysis For Product Review
19 pages
Wrangle Report
No ratings yet
Wrangle Report
4 pages
Vaibhav DSBDA Project
No ratings yet
Vaibhav DSBDA Project
16 pages
Michael Brownstein-The Implicit Mind - Cognitive Architecture, The Self, and Ethics-Oxford University Press, USA (2018)
100% (2)
Michael Brownstein-The Implicit Mind - Cognitive Architecture, The Self, and Ethics-Oxford University Press, USA (2018)
273 pages
Wrangle Report
No ratings yet
Wrangle Report
3 pages
Mini Project BDA
No ratings yet
Mini Project BDA
9 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Social Media Se
No ratings yet
Social Media Se
3 pages
Sma 3
No ratings yet
Sma 3
3 pages
Twitter Sentiment Analysis Using Python
No ratings yet
Twitter Sentiment Analysis Using Python
21 pages
Group11 Report
No ratings yet
Group11 Report
18 pages
Tejeshwarm - 38158253 - 286987005 - 22it118 Assignment 1 Psom-2-7
No ratings yet
Tejeshwarm - 38158253 - 286987005 - 22it118 Assignment 1 Psom-2-7
6 pages
App (Linkedin) Reviews Sentiment Analysis Using Python
No ratings yet
App (Linkedin) Reviews Sentiment Analysis Using Python
1 page
Gödel's Incompleteness Theorems
No ratings yet
Gödel's Incompleteness Theorems
6 pages
Sma Exp9
No ratings yet
Sma Exp9
4 pages
Project Demonstration FDS
No ratings yet
Project Demonstration FDS
4 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
4 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
Sentiment Analysis For Twitter Comments Project Exp
No ratings yet
Sentiment Analysis For Twitter Comments Project Exp
5 pages
Expert System
100% (1)
Expert System
54 pages
Primary Mental Abilities 1
100% (1)
Primary Mental Abilities 1
10 pages
The Epistemology of Cognitive Literary Studies: Hart, F. Elizabeth (Faith Elizabeth), 1959
No ratings yet
The Epistemology of Cognitive Literary Studies: Hart, F. Elizabeth (Faith Elizabeth), 1959
22 pages
Hospital Business Plan
No ratings yet
Hospital Business Plan
41 pages
Theories Assignment 1
No ratings yet
Theories Assignment 1
7 pages
Parental Involvement and Their Impact On Reading English of Students Among The Rural School in Malaysia
No ratings yet
Parental Involvement and Their Impact On Reading English of Students Among The Rural School in Malaysia
8 pages
An Essay On The History of Civil Society (1767) - Adam Ferguson Facs
100% (1)
An Essay On The History of Civil Society (1767) - Adam Ferguson Facs
476 pages
Weebly Syllabus April
No ratings yet
Weebly Syllabus April
4 pages
2017 Jess Final Report 1
No ratings yet
2017 Jess Final Report 1
6 pages
Chapter 1 Business Communication: Tip: Success
No ratings yet
Chapter 1 Business Communication: Tip: Success
3 pages
Artificial Intelligence Interview Questions: Click Here
No ratings yet
Artificial Intelligence Interview Questions: Click Here
44 pages
Mensuration 02 - Class Notes
No ratings yet
Mensuration 02 - Class Notes
25 pages
Annurev Food 060721 023619
No ratings yet
Annurev Food 060721 023619
25 pages
Transactional Theory Powerpoint
100% (1)
Transactional Theory Powerpoint
7 pages
Assessment of Tertiary Education Readiness
No ratings yet
Assessment of Tertiary Education Readiness
10 pages
IBM (CSRBOX) Sales Insights
No ratings yet
IBM (CSRBOX) Sales Insights
12 pages
Aifba P6-M
No ratings yet
Aifba P6-M
7 pages
Concord in English Grammar
No ratings yet
Concord in English Grammar
15 pages
DLL Arts Q3 W5
No ratings yet
DLL Arts Q3 W5
7 pages
Dear Me Lesson Plan
No ratings yet
Dear Me Lesson Plan
2 pages
Modern Law Review - March 1986 - Susskind - EXPERT SYSTEMS IN LAW A JURISPRUDENTIAL APPROACH TO ARTIFICIAL INTELLIGENCE
No ratings yet
Modern Law Review - March 1986 - Susskind - EXPERT SYSTEMS IN LAW A JURISPRUDENTIAL APPROACH TO ARTIFICIAL INTELLIGENCE
27 pages
SSC CGL 12 Week Study Plan
No ratings yet
SSC CGL 12 Week Study Plan
3 pages
SMA Exp 2
No ratings yet
SMA Exp 2
3 pages
CS607 MidTerm MCQs With Reference Solved by Arslan 1
No ratings yet
CS607 MidTerm MCQs With Reference Solved by Arslan 1
6 pages
SSC CGL Study Plan
No ratings yet
SSC CGL Study Plan
1 page
Simple Past Tense Recount TEXT Explain and Example
No ratings yet
Simple Past Tense Recount TEXT Explain and Example
5 pages
RPT Bahasa Inggeris Tahun 5
No ratings yet
RPT Bahasa Inggeris Tahun 5
11 pages
Subject: Artificial Intelligence Term: Mid Question No. 1 (10 0.5)
No ratings yet
Subject: Artificial Intelligence Term: Mid Question No. 1 (10 0.5)
3 pages
The Perception Process:: I. How Our Perceptions Affect Our Communication With Others. We Are
No ratings yet
The Perception Process:: I. How Our Perceptions Affect Our Communication With Others. We Are
4 pages
Mastering Microsoft Dynamics 365 Business Central: The complete guide for designing and integrating advanced Business Central solutions
From Everand
Mastering Microsoft Dynamics 365 Business Central: The complete guide for designing and integrating advanced Business Central solutions
Stefano Demiliani
No ratings yet
Getting Started with Greenplum for Big Data Analytics
From Everand
Getting Started with Greenplum for Big Data Analytics
Sunila Gollapudi
No ratings yet
Learn C# Programming by Creating Games with Unity
From Everand
Learn C# Programming by Creating Games with Unity
Patrick Felicia
No ratings yet

SMA4

Uploaded by

SMA4

Uploaded by

DEPARTMENT OF ARTIFICIAL INTELLIGENCE

& DATA SCIENCE

Subject: Social Media Analytics Course Code: CSDOL8023

1. Twitter Data Cleaning, Preprocessing and Exploratory Data Analysis:

df1['textblob_score'] =df1[ 'renderedContent'].apply(lambda x: TextBlob(x).sentiment.polarity) neutral_threshold=0.05

textblob_df = df1[['renderedContent','textblob_sentiment','likeCount']] textblob_df

3. Visualization by creating a Word Cloud

from wordcloud import WordCloud, STOPWORDS from PIL import Image

#Creating the text variable

#Creating the text variable

You might also like