0% found this document useful (0 votes)

333 views8 pages

Twitter Python Assignment

- The document provides instructions for a Twitter sentiment analysis assignment in Python. - Students are asked to access the Twitter API using Python to estimate public sentiment of a term or phrase based on a sample of Twitter data. - The first part of the assignment involves getting Twitter data using the Twitter API and Python. Students are asked to submit the first 20 lines of Twitter data they download. - The second part involves analyzing the sentiment of each tweet using a pre-computed sentiment score file. Students are asked to print the sentiment score of each tweet to stdout.

Uploaded by

Steve Zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

333 views8 pages

Twitter Python Assignment

Uploaded by

Steve Zhang

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

7/1/2014

Assignment Details | Coursera

Twitter Sentiment Analysis in Python: Instructions

Help

Twitter represents a fundamentally new instrument to make social measurements. Millions of people
voluntarily express opinions across any topic imaginable --- this data source is incredibly valuable for
both research and business.
For example, researchers have shown that the "mood" of communication on twitter reflects
biological rhythms and can even used to predict the stock market. A student here at UW used
geocoded tweets to plot a map of locations where "thunder" was mentioned in the context of
a storm system in Summer 2012.
Researchers from Northeastern University and Harvard University studying the characteristics and
dynamics of Twitter have an excellent resource for learning more about how Twitter can be used to
analyze moods at national scale.
In this assignment, you will
access the twitter Application Programming Interface(API) using python
estimate the public's perception (the sentiment) of a particular term or phrase
analyze the relationship between location and mood based on a sample of twitter data
Some points to keep in mind:
This assignment is open-ended in several ways. You'll need to make some decisions about how
best to solve the problem and implement them carefully.
It is perfectly acceptable to discuss your solution on the forum, but don't share code.
Each student must submit their own solution to the problem.
You will have an unlimited number of tries for each submission.
Your code will be run in a protected environment, so you should only use the Python standard
libraries unless you are specifically instructed otherwise. Your code should also not rely on any
external libraries or web services.

The Twitter Application Programming Interface

Twitter provides a very rich REST API for querying the system, accessing data, and control your
account. You can read more about the Twitter API

Python 2.7.3 environment

If you are new to Python, you may find it valuable to work through the codeacademy Python tutorials.
Focus on tutorials 1-9, plus tutorial 12 on File IO. In addition, many students have recommended
Google's Python class.
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

1/8

7/1/2014

Assignment Details | Coursera

You will need to establish a Python programming environment to complete this assignment. You can
install Python yourself by downloading it from the Python website, or can use the class virtual machine.

Unicode strings
Strings in the twitter data prefixed with the letter "u" are unicode strings. For example:
u"This is a string"

Unicode is a standard for representing a much larger variety of characters beyond the roman alphabet
(greek, russian, mathematical symbols, logograms from non-phonetic writing systems such as kanji,
etc.)
In most circumstances, you will be able to use a unicode object just like a string.
If you encounter an error involving printing unicode, you can use the encode method to properly print
the international characters, like this:
unicode_string = u"aaa "
encoded_string = unicode_string.encode('utf-8')
print encoded_string

Getting Started
Once again: If you are new to Python, many students have recommended Google's Python class.

Problem 1: Get Twitter Data

As always, the first step is to make sure your assignment materials up to date.
To access the live stream, you will need to install the oauth2 library so you can properly authenticate.
This library is already installed on the class virtual machine, but you can install it yourself in your
Python environment. (The command $ pip install oauth2 should work for most environments.)
The steps below will help you set up your twitter account to be able to access the live 1% stream.
1.
2.
3.
4.

Create a twitter account if you do not already have one.

Go to https://dev.twitter.com/apps and log in with your twitter credentials.
Click "Create New App"
Fill out the form and agree to the terms. Put in a dummy website if you don't have one you want to
use.

5. On the next page, click the "API Keys" tab along the top, then scroll all the way down until you see
the section "Your Access Token"
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

2/8

7/1/2014

Assignment Details | Coursera

6. Click the button "Create My Access Token". You can Read more about Oauth authorization.
7. You will now copy four values into twitterstream.py. These values are your "API Key", your "API
secret", your "Access token" and your "Access token secret". All four should now be visible on the
API Keys page. (You may see "API Key" referred to as "Consumer key" in some places in the code
or on the web; they are synonyms.) Open twitterstream.py and set the variables corresponding to
the api key, api secret, access token, and access secret. You will see code like the below:
api_key = "<Enter api key>"
api_secret = "<Enter api secret>"
access_token_key = "<Enter your access token key here>"
access_token_secret = "<Enter your access token secret here>"

8. Run the following and make sure you see data flowing and that no errors occur.
$ python twitterstream.py > output.txt

This command pipes the output to a file. Stop the program with Ctrl-C, but wait at least 3 minutes
for data to accumulate. Keep the file output.txt for the duration of the assignment; we will be
reusing it in later problems. Don't use someone else's file; we will check for uniqueness in other
parts of the assignment.
9. If you wish, modify the file to use the twitter search API to search for specific terms. For example, to
search for the term "microsoft", you can pass the following url to the twitterreq function:
https://api.twitter.com/1.1/search/tweets.json?q=microsoft

What to turn in: The first 20 lines of the twitter data you downloaded from the web. You
should save the first 20 lines to a file problem_1_submission.txt by using the following
command:
$ head -n 20 output.txt > problem_1_submission.txt

Problem 2: Derive the sentiment of each tweet

For this part, you will compute the sentiment of each tweet based on the sentiment scores of the terms
in the tweet. The sentiment of a tweet is equivalent to the sum of the sentiment scores for each term in
the tweet.
You are provided with a skeleton file tweet_sentiment.py which accepts two arguments on the
command line: a sentiment file and a tweet file like the one you generated in Problem 1. You can run
the skeleton program like this:
$ python tweet_sentiment.py AFINN-111.txt output.txt
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

3/8

7/1/2014

Assignment Details | Coursera

The file AFINN-111.txt contains a list of pre-computed sentiment scores. Each line in the file contains a
word or phrase followed by a sentiment score. Each word or phrase that is found in a tweet but not
found in AFINN-111.txt should be given a sentiment score of 0. See the file AFINN-README.txt for
more information.
To use the data in the AFINN-111.txt file, you may find it useful to build a dictionary. Note that the
AFINN-111.txt file format is tab-delimited, meaning that the term and the score are separated by a tab
character. A tab character can be identified a "\t".The following snippet may be useful:
afinnfile = open("AFINN-111.txt")
scores = {} # initialize an empty dictionary
for line in afinnfile:
term, score = line.split("\t") # The file is tab-delimited. "\t" means "tab character
"
scores[term] = int(score) # Convert the score to an integer.
print scores.items() # Print every (term, score) pair in the dictionary

The data in the tweet file you generated in Problem 1 is represented as JSON, which stands for
JavaScript Object Notation. It is a simple format for representing nested structures of data --- lists of
lists of dictionaries of lists of .... you get the idea.
Each line of output.txt represents a streaming message. Most, but not all, will be tweets. (The
skeleton program will tell you how many lines are in the file.)
It is straightforward to convert a JSON string into a Python data structure; there is a library to do so
called json.
To use this library, add the following to the top of tweet_sentiment.py
import json

Then, to parse the data in output.txt, you want to apply the function json.loads to every line in
the file.
This function will parse the json data and return a python data stucture; in this case, it returns a
dictionary. If needed, take a moment to read the documentation for Python dictionaries.
You can read the Twitter documentation to understand what information each tweet contains and how
to access it, but it's not too difficult to deduce the structure by direct inspection.
Your script should print to stdout the sentiment of each tweet in the file, one numeric sentiment score
per line. The first score should correspond to the first tweet, the second score should correspond to
the second tweet, and so on. If you sort the scores, they won't match up. If you sort the tweets, they
won't match up. If you put the tweets into a dictionary, the order will not be preserved. Once again:
The nth line of the file you submit should contain only a single number that represents the
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

4/8

7/1/2014

Assignment Details | Coursera

score of the nth tweet in the input file!

NOTE: You must provide a score for every tweet in the sample file, even if that score is zero. You can
assume the sample file will only include English tweets and no other types of streaming messages.
To grade your submission, we will run your program on a tweet file formatted the same way as the
output.txt file you generated in Problem 1.

Hint: This is real-world data, and it can be messy! Refer to the twitter documentation to understand
more about the data structure you are working with. Don't get discouraged, and ask for help on the
forums if you get stuck!

What to turn in: The file

answers.

tweet_sentiment.py

after you've verified that it returns the correct

Problem 3: Derive the sentiment of new terms

In this part you will be creating a script that computes the sentiment for the terms that do not appear
in the file AFINN-111.txt.
Here's how you might think about the problem: We know we can use the sentiment-carrying words in
AFINN-111.txt to deduce the overall sentiment of a tweet. Once you deduce the sentiment of a tweet,
you can work backwards to deduce the sentiment of the non-sentiment carrying words that do not
appear in AFINN-111.txt. For example, if the word soccer always appears in proximity with positive
words like great and fun, then we can deduce that the term soccer itself carries a positive
sentiment.
Don't feel obligated to use it, but the following paper may be helpful for developing a sentiment metric.
Look at the Opinion Estimation subsection of the Text Analysis section in particular.
O'Connor, B., Balasubramanyan, R., Routedge, B., & Smith, N. From Tweets to Polls: Linking Text
Sentiment to Public Opinion Time Series. (ICWSM), May 2010.
You are provided with a skeleton file, term_sentiment.py, which can be executed using the following
command:
$ python term_sentiment.py <sentiment_file> <tweet_file>

Your script should print output to stdout. Each line of output should contain a term, followed by a
space, followed by the sentiment. That is, each line should be in the format <term:string>
<sentiment:float>

For example, if you have the pair ("foo", 103.256) in Python, it should appear in the output as:

https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

5/8

7/1/2014

Assignment Details | Coursera

foo 103.256

The order of your output does not matter.

What to turn in: The file

term_sentiment.py

How we will grade Part 3: We will run your script on a file that contains strongly positive and strongly
negative tweets and verify that the non-sentiment-carrying terms in the strongly positive tweets are
assigned a higher score than the non-sentiment-carrying terms in negative tweets. Your scores need
not (and likely will not) exactly match any specific solution.
If the grader is returning "Formatting error: ", make note of the line of text returned in the message.
This line corresponds to a line of your output. The grader will generate this error if line.split()
does not return exactly two items. One common source of this error is to not remove the two calls to
the "lines" function in the solution template; this function prints the number of lines in each file. Make
sure to check the first two lines of your output!

Problem 4: Compute Term Frequency

Write a Python script frequency.py to compute the term frequency histogram of the livestream data
you harvested from Problem 1.
The frequency of a term can be calculated as [# of occurrences of the term in all tweets]/[# of
occurrences of all terms in all tweets]

Your script will be run from the command line like this:
$ python frequency.py <tweet_file>

You should assume the tweet file contains data formatted the same way as the livestream data.
Your script should print output to stdout. Each line of output should contain a term, followed by a
space, followed by the frequency of that term in the entire file. There should be one line per unique
term in the entire file. Even if 25 tweets contain the word lol, the term lol should only appear once
in your output (and the frequency will be at least 25!) Each line should be in the format <term:string>
<frequency:float>

For example, if you have the pair (bar, 0.1245) in Python it should appear in the output as:
bar 0.1245

If you wish, you may consider a term to be a multi-word phrase, but this is not required. You may
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

6/8

7/1/2014

Assignment Details | Coursera

compute the frequencies of individual tokens only.

Depending on your method of parsing, you may end up computing frequencies for hashtags, links,
stop words, phrases, etc. If you choose to filter out these non-words, that's ok too.

What to turn in: The file

frequency.py

Problem 5: Which State is happiest?

Write a Python script happiest_state.py that returns the name of the happiest state as a string.
Your script happiest_state.py should take a file of tweets as input. It will be called from the command
line like this:
$ python happiest_state.py <sentiment_file> <tweet_file>

The file AFINN-111.txt contains a list of pre-computed sentiment score.

Assume the tweet file contains data formatted the same way as the livestream data.
It's a good idea to make use of your solution to Problem 2.
There are different ways you might assign a location to a tweet. Here are three:
Use the coordinates field, if it exists, to geocode the tweet. This method gives the most reliable
location information, but unfortunately not very many tweets contain this field and you have to
figure out some way of translating the coordinates into a state
Use the place field. This field is hand-entered by the twitter user and may not always be present,
and will not typically contain a state. It might contain a city, though, which you could use to
determine the state.
Use the user field to determine the twitter user's home city and state. This location does not
necessarily correspond to the location where the tweet was posted, but it's reasonable to use it as
a proxy.
You are free to develop your own strategy for determining the state that each tweet originates from.
You can ignore any tweets for which you cannot assign a location in the United States.
In this file, each line is a Tweet object, asdescribed in the twitter documentation.
Note: Not every tweet will have a text field --- again, real data is dirty! Be prepared to debug, and
feel free to throw out tweets that your code can't handle to get something working. For example, you
might choose to ignore all non-English tweets.
Your script should print the two letter state abbreviation of the state with the highest average tweet
https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

7/8

7/1/2014

Assignment Details | Coursera

sentiment to stdout.
Note that you may need a lot of tweets in order to get enough tweets with location data. Let the live
stream run for a while if you wish.
Your script will not have access to the Internet, so you cannot rely on third party services to
resolve geocoded locations!

What to turn in: The file

happiest_state.py

Problem 6: Top ten hash tags

Write a Python script top_ten.py that computes the ten most frequently occurring hashtags from the
data you gathered in Problem 1.
Your script will be run from the command line like this:
$ python top_ten.py <tweet_file>

You should assume the tweet file contains data formatted the same way as the livestream data.
In the tweet file, each line is a Tweet object, as described in the twitter documentation. To find the
hashtags, you should not parse the text field; the hashtags have already been extracted by twitter.
Your script should print to stdout each hashtag-count pair, one per line, in the following format:
Your script should print output to stdout. Each line of output should contain a hashtag, followed by a
space, followed by the frequency of that hashtag in the entire file. There should be one line per
unique hashtag in the entire file. Each line should be in the format <hashtag:string>
<frequency:float>

For example, if you have the pair (bar, 30) in Python it should appear in the output as:
bar 30

What to turn in: the file

top_ten.py

https://class.coursera.org/datasci-002/assignment/view?assignment_id=3

8/8

Twitter Sentiment Analysis in Python
0% (1)
Twitter Sentiment Analysis in Python
9 pages
Twitter Sentiment Analysis Using Python
No ratings yet
Twitter Sentiment Analysis Using Python
21 pages
Tweepy Functions
No ratings yet
Tweepy Functions
49 pages
Tweepy Functions
No ratings yet
Tweepy Functions
34 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
25 pages
Shuaa Digest August 2018
67% (3)
Shuaa Digest August 2018
144 pages
Lec 12
No ratings yet
Lec 12
52 pages
Analyzing Social Media Data in Python Chapter1
No ratings yet
Analyzing Social Media Data in Python Chapter1
21 pages
INDEXReport Ayush
No ratings yet
INDEXReport Ayush
38 pages
Getting Data
No ratings yet
Getting Data
54 pages
Twitter API
No ratings yet
Twitter API
6 pages
SYNOPSIS
No ratings yet
SYNOPSIS
28 pages
ChatGPT Twitter Sentiment Analyzer
No ratings yet
ChatGPT Twitter Sentiment Analyzer
50 pages
Data Science Project
No ratings yet
Data Science Project
34 pages
Ayushi Data Science Final File
No ratings yet
Ayushi Data Science Final File
30 pages
Basic Tweet Preprocessing in Python: 1. Hashtag Extraction Using Regex
No ratings yet
Basic Tweet Preprocessing in Python: 1. Hashtag Extraction Using Regex
2 pages
Chapter3 4
No ratings yet
Chapter3 4
21 pages
CS1026 - Assignment 3
No ratings yet
CS1026 - Assignment 3
3 pages
Tweepy
No ratings yet
Tweepy
44 pages
Chapter3 PDF
No ratings yet
Chapter3 PDF
21 pages
Goblin Slayer, Vol. 13
100% (1)
Goblin Slayer, Vol. 13
213 pages
From These Bare Bones Raw Materials and The Study of Worked Osseous Objects 1st Edition Alice Choyke Instant Download
No ratings yet
From These Bare Bones Raw Materials and The Study of Worked Osseous Objects 1st Edition Alice Choyke Instant Download
52 pages
Sentiment Analysis Python
No ratings yet
Sentiment Analysis Python
3 pages
Curatorial Activism Towards An Ethics of Curating Illustrated Maura Reilly PDF Download
100% (1)
Curatorial Activism Towards An Ethics of Curating Illustrated Maura Reilly PDF Download
44 pages
C1 W2 Assignment
No ratings yet
C1 W2 Assignment
18 pages
Tweepy Documentation: Release 3.7.0
No ratings yet
Tweepy Documentation: Release 3.7.0
48 pages
The How and Why of One Variable Calculus 1st Edition Amol Sasane Download
No ratings yet
The How and Why of One Variable Calculus 1st Edition Amol Sasane Download
64 pages
Make A Twitter Bot in Python - Iterative Code Examples
No ratings yet
Make A Twitter Bot in Python - Iterative Code Examples
13 pages
Individual Assignment #1: Data Source and Libraries
No ratings yet
Individual Assignment #1: Data Source and Libraries
2 pages
Sentiment Analysis - Comparing Algorithms Accuracy
No ratings yet
Sentiment Analysis - Comparing Algorithms Accuracy
22 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Activity. Streaming Twitter Data
No ratings yet
Activity. Streaming Twitter Data
4 pages
Flash Library
100% (1)
Flash Library
53 pages
Part C Assignment No 2 Mini Project On Twitter 1
No ratings yet
Part C Assignment No 2 Mini Project On Twitter 1
9 pages
Blog Post HTML
No ratings yet
Blog Post HTML
6 pages
Import Tweepy
No ratings yet
Import Tweepy
4 pages
Data Mining Twitter
No ratings yet
Data Mining Twitter
7 pages
Osmania University Alimni Association Souvenir
No ratings yet
Osmania University Alimni Association Souvenir
92 pages
CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
No ratings yet
CS5228 Project 2 Twitter Sentiment Analysis Group No.: 29: 1 Problem Statement
15 pages
How I Built My Very First Twitter Bot-That'S Surprisingly Enjoyable
No ratings yet
How I Built My Very First Twitter Bot-That'S Surprisingly Enjoyable
9 pages
AminaRahmanK DL Lab5
No ratings yet
AminaRahmanK DL Lab5
11 pages
Notes
No ratings yet
Notes
6 pages
2.1 Analysing Social Media in Python
No ratings yet
2.1 Analysing Social Media in Python
21 pages
Twitter Sentiment Analysis
No ratings yet
Twitter Sentiment Analysis
13 pages
Unit - Iv - Mining Social Web
No ratings yet
Unit - Iv - Mining Social Web
13 pages
Document Dsbda Codes For Mini Project
No ratings yet
Document Dsbda Codes For Mini Project
9 pages
De Project01 TwitterSentimentAnalysisinHive 101020 0745
No ratings yet
De Project01 TwitterSentimentAnalysisinHive 101020 0745
19 pages
Importing Data in Python Ii: The Twi!er API and Authentication
No ratings yet
Importing Data in Python Ii: The Twi!er API and Authentication
18 pages
Anand Institute of Higher Technology Department of Computer Science and Engineering ACADEMIC YEAR: 2018-19 Mini Project Report
No ratings yet
Anand Institute of Higher Technology Department of Computer Science and Engineering ACADEMIC YEAR: 2018-19 Mini Project Report
9 pages
Good Dogs Don't Make It To The South Pole Chapter Sampler
100% (1)
Good Dogs Don't Make It To The South Pole Chapter Sampler
22 pages
COMP 1410 Assignment 3 (1) .PDF
No ratings yet
COMP 1410 Assignment 3 (1) .PDF
2 pages
Wrangle Report
No ratings yet
Wrangle Report
4 pages
Twitter Sentiment Analysis System
No ratings yet
Twitter Sentiment Analysis System
5 pages
NLP - Twitter Sentiment Analysis With Tensorflow - Sebastian Correa - Medium
No ratings yet
NLP - Twitter Sentiment Analysis With Tensorflow - Sebastian Correa - Medium
13 pages
Processing and Visualizing The Data in Tweets
No ratings yet
Processing and Visualizing The Data in Tweets
9 pages
Web Scrapping
100% (1)
Web Scrapping
5 pages
Military - Brother Jonathan - Rhode Island
100% (5)
Military - Brother Jonathan - Rhode Island
32 pages
Twitter Sentiment Analysis Project Idea
No ratings yet
Twitter Sentiment Analysis Project Idea
3 pages
A Historical Survey of Proposals To Transfer Palestinians From Palestine 1895 - 1947 - Transfer-Chaim-Simons
No ratings yet
A Historical Survey of Proposals To Transfer Palestinians From Palestine 1895 - 1947 - Transfer-Chaim-Simons
297 pages
Paper 2023 Security
No ratings yet
Paper 2023 Security
5 pages
Sentiment Analysis On User-Generated Tweets
No ratings yet
Sentiment Analysis On User-Generated Tweets
15 pages
Part C - Assignment No. 2 Mini-Project On Twitter
No ratings yet
Part C - Assignment No. 2 Mini-Project On Twitter
7 pages
Multimodaly
100% (1)
Multimodaly
423 pages
Rtweet Workshop PDF
No ratings yet
Rtweet Workshop PDF
60 pages
Full Proposal
No ratings yet
Full Proposal
6 pages
Contemporary Arts Exam
100% (1)
Contemporary Arts Exam
5 pages
TM 2
No ratings yet
TM 2
41 pages
7001ICT Programming Principles 1 Assignment: 1 Background
No ratings yet
7001ICT Programming Principles 1 Assignment: 1 Background
3 pages
Form 1 School Library Profile
No ratings yet
Form 1 School Library Profile
9 pages
Programming Documentation
No ratings yet
Programming Documentation
20 pages
Preview 9780784478691
No ratings yet
Preview 9780784478691
12 pages
SQLite Pass - Major Changes
No ratings yet
SQLite Pass - Major Changes
12 pages
Carrie Mae Weems
No ratings yet
Carrie Mae Weems
10 pages
Unit 1 Concept and Need For Information: 1.0 Objectives
No ratings yet
Unit 1 Concept and Need For Information: 1.0 Objectives
10 pages
Advt No 04 of 2020 Details of Advt and Application Form PDF
No ratings yet
Advt No 04 of 2020 Details of Advt and Application Form PDF
5 pages
Butler Music at The Court of Elizabeth I PDF
100% (1)
Butler Music at The Court of Elizabeth I PDF
346 pages
'C' Language Questions Asked in Interviews
0% (3)
'C' Language Questions Asked in Interviews
47 pages
COLONIAL CONQUEST AND RESISTANCE IN SOUTHERN NIGERIA On JSTOR
No ratings yet
COLONIAL CONQUEST AND RESISTANCE IN SOUTHERN NIGERIA On JSTOR
1 page
Javascript Advanced Tutorial
92% (25)
Javascript Advanced Tutorial
9 pages
A Body in The Library - A Murder Mystery Escape Room Game
No ratings yet
A Body in The Library - A Murder Mystery Escape Room Game
3 pages
UPD General Information
No ratings yet
UPD General Information
2 pages
Zodiac Installation Instructions PDF
No ratings yet
Zodiac Installation Instructions PDF
2 pages
American Woodworker - How To Make Bookshelves Bookcases (American Woodworker, Randy Johnson) (Z-Library)
100% (2)
American Woodworker - How To Make Bookshelves Bookcases (American Woodworker, Randy Johnson) (Z-Library)
183 pages
English Congo and Congo English Dictiona
No ratings yet
English Congo and Congo English Dictiona
287 pages
Makerspace Proposal Final
No ratings yet
Makerspace Proposal Final
4 pages
Ireland Travel Brochure
No ratings yet
Ireland Travel Brochure
2 pages
Deposit of Library Material Act 1986 - Act 331
No ratings yet
Deposit of Library Material Act 1986 - Act 331
12 pages

Twitter Python Assignment

Uploaded by

Twitter Python Assignment

Uploaded by

7/1/2014

Assignment Details | Coursera

Twitter Sentiment Analysis in Python: Instructions

The Twitter Application Programming Interface

Python 2.7.3 environment

Assignment Details | Coursera

Problem 1: Get Twitter Data

Create a twitter account if you do not already have one.

Assignment Details | Coursera

Problem 2: Derive the sentiment of each tweet

Assignment Details | Coursera

Assignment Details | Coursera

score of the nth tweet in the input file!

What to turn in: The file

after you've verified that it returns the correct

Problem 3: Derive the sentiment of new terms

Assignment Details | Coursera

The order of your output does not matter.

What to turn in: The file

Problem 4: Compute Term Frequency

Assignment Details | Coursera

compute the frequencies of individual tokens only.

What to turn in: The file

Problem 5: Which State is happiest?

The file AFINN-111.txt contains a list of pre-computed sentiment score.

Assignment Details | Coursera

What to turn in: The file

Problem 6: Top ten hash tags

What to turn in: the file

You might also like