0% found this document useful (0 votes)

19 views7 pages

Data Analytics Project

This document contains code to extract data from Facebook and Twitter APIs and build predictive models for diabetes classification. It includes code to: 1. Extract comments from a public Facebook page post using the Graph API. 2. Extract the most recent tweets matching a keyword search using the Twitter API. 3. Build logistic regression, SVM, random forest, and decision tree models on a diabetes dataset to classify patients and compare their performance.

Uploaded by

vishal.gahlot14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views7 pages

Data Analytics Project

Uploaded by

vishal.gahlot14

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Exercise 1: Extract facebook data available on any public pagle like

Amazon.

Code:
import requests
import json

access_token='EAADraRiwnasBALJgEL4vbyvv2DTJvAYjBlLfk1iO0xgL56Vf70mE1MYlv
dv2A5RupQZBOctpcE8Qdu1COESmobBxTwC6DFTOrbaXCRWcBzsZB6wlZBuzFSx5A
gvXZAfLnp9etZBBTHwCL9U5klw4Q9sBFpmfVAEiJCZBFMD2CXCXyS5sPepoEqCDfY
32DeUUoZD'

post_id = "1973749942700563"

URL = 'https://graph.facebook.com/v3.2/'+post_id+'/comments'

PARAMS = {'access_token':access_token}

# sending get request and saving the response as response object

r = requests.get(url = URL, params = PARAMS)

# extracting data in json format

data = r.json()

for comment in data['data']:

print "----------------------------------------------------------\n\n"
print
"id:",comment['id'],"\n","created_time:",comment['created_time'],"\n","message:",comme
nt['message']
print "----------------------------------------------------------\n\n"
Output:
Exercise 2: Extract 1000 latest tweets from twitter using any keyword.

Code:
from twitter import Twitter,OAuth, TwitterStream
import json

ACCESS_TOKEN =
'1064460892694241281-yHNHebYDMQgaoEjLD8BrcyVpDzIeGf'
ACCESS_SECRET =
'DQFUjh3TklipgH9dN6cGIlCW6KPXok2Q3oiN6HNJARxRM'
CONSUMER_KEY = '4mGaUsqkD2EyHkagZpHKOpBXF'
CONSUMER_SECRET =
'5xI9anpq1O0F5CT5Gj5TC9tzz3s4pDfjxmtLIse88clK9E4REy'

oauth = OAuth(ACCESS_TOKEN, ACCESS_SECRET, CONSUMER_KEY,

CONSUMER_SECRET)
twitter = Twitter(auth=oauth)
#print twitter.GetFriends()
twt = twitter.search.tweets(q='machine learning', result_type='recent',
lang='en', count=5)

i=0
for tweet in twt['statuses']:
print "Tweet_count: ", i
print "id:",tweet['id'],"\n","text:",tweet['text'],"\n\n"
i=i+1
Output:
Exercise 3: Design a predictive model for diabetes on the given
dataset of 535 patients using following machine learning techniques:
1. Logistic Regression
2. SVM
3. Random Forest
4. Decision Tree

Code:
from sklearn import svm
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

import numpy

A = numpy.loadtxt(open("data.csv", "rb"), delimiter=",", skiprows=1)

X_features = A[:,:9]
y_targets = A[:,9:]

X_train, X_test, y_train, y_test = train_test_split(X_features, y_targets, test_size=0.4,

random_state=0)

print "Support Vector Machine:"

svm_model = svm.SVC(kernel='linear', C=1).fit(X_train, y_train.ravel())
print "Score: ", svm_model.score(X_test, y_test.ravel())

print "-----------------------------------------------------------------------"
print "Decision Tree:"
max_score = ()
max_val = 0
for i in range(1,100,2):
dtree_model = DecisionTreeClassifier(max_depth = i).fit(X_train, y_train.ravel())
curr_score = dtree_model.score(X_test, y_test.ravel())
if(max_val<curr_score):
max_val = curr_score
max_score = (i, curr_score)
print "max_score: ",max_score[1], "max_depth: ",max_score[0]

print "-----------------------------------------------------------------------"

print "Random Forest:"

rf_model = RandomForestClassifier(n_estimators=100,n_jobs=-1).fit(X_train,
y_train.ravel())
rf_score = rf_model.score(X_test, y_test.ravel())
print "Score: ", rf_score

print "-----------------------------------------------------------------------"

print "Logistic Regression:"

lr_model = LogisticRegression(penalty='l1',C=50).fit(X_train, y_train.ravel())

print "Score: ", lr_model.score(X_test, y_test.ravel())

Output:

JD Statistician
No ratings yet
JD Statistician
3 pages
1745064423339-Coders_of_Delhi
No ratings yet
1745064423339-Coders_of_Delhi
12 pages
Cyber Security Research Proposal - Sandboxing
100% (4)
Cyber Security Research Proposal - Sandboxing
23 pages
Parser
No ratings yet
Parser
6 pages
Sim 5 Request API Script
No ratings yet
Sim 5 Request API Script
3 pages
SMA 3
No ratings yet
SMA 3
3 pages
TRACKING THE EFFECTIVENESS OF AUTOMATION IN DEVOPS (suprit)
No ratings yet
TRACKING THE EFFECTIVENESS OF AUTOMATION IN DEVOPS (suprit)
9 pages
Python File
No ratings yet
Python File
11 pages
Making Web Requests in Python
No ratings yet
Making Web Requests in Python
2 pages
api
No ratings yet
api
3 pages
SMA exp 2
No ratings yet
SMA exp 2
3 pages
SCRIPT
No ratings yet
SCRIPT
2 pages
XP8NNk33
No ratings yet
XP8NNk33
2 pages
whatsappotplockek_deepu python
No ratings yet
whatsappotplockek_deepu python
2 pages
Bank Kkkkk
No ratings yet
Bank Kkkkk
6 pages
Tweepy Functions (2)
No ratings yet
Tweepy Functions (2)
34 pages
Rwhs (Oct 2024)
No ratings yet
Rwhs (Oct 2024)
6 pages
pdfDownload (4)
No ratings yet
pdfDownload (4)
3 pages
SOCIAL NETWORKING
No ratings yet
SOCIAL NETWORKING
21 pages
Techno 2 Group 9
No ratings yet
Techno 2 Group 9
54 pages
02601649091c6-Chapter 1 Logic Gate SC
No ratings yet
02601649091c6-Chapter 1 Logic Gate SC
3 pages
vertopal.com_ir_op2
No ratings yet
vertopal.com_ir_op2
26 pages
TFn5WGfB
No ratings yet
TFn5WGfB
4 pages
I
No ratings yet
I
54 pages
Real Estate Scraper
No ratings yet
Real Estate Scraper
23 pages
28 Layer PCB An Integrated Multi-Layer PCBs
No ratings yet
28 Layer PCB An Integrated Multi-Layer PCBs
4 pages
Advance Data Mining Assignment
No ratings yet
Advance Data Mining Assignment
10 pages
Picker Wheel - Spin the Wheel to Decide a Random Choice
No ratings yet
Picker Wheel - Spin the Wheel to Decide a Random Choice
1 page
Tweepy Functions
No ratings yet
Tweepy Functions
49 pages
lec8
No ratings yet
lec8
44 pages
Quickstart — Requests 2.28.1 Documentation
No ratings yet
Quickstart — Requests 2.28.1 Documentation
8 pages
computer class 10
No ratings yet
computer class 10
3 pages
Howto Urllib2
100% (2)
Howto Urllib2
11 pages
Sna Seminar
No ratings yet
Sna Seminar
16 pages
another hack test3
No ratings yet
another hack test3
4 pages
COMP1829: Week 3
No ratings yet
COMP1829: Week 3
22 pages
Api and data structure
No ratings yet
Api and data structure
3 pages
2019-03-21 Edgecase Datafeed Article 89 2019-03-21 Stjohn Piano Creating A Bitcoin Transaction With Two Inputs
No ratings yet
2019-03-21 Edgecase Datafeed Article 89 2019-03-21 Stjohn Piano Creating A Bitcoin Transaction With Two Inputs
75 pages
Effect of Garlic Powder (Allium Sativum) On Performance of Broiler Chicken
No ratings yet
Effect of Garlic Powder (Allium Sativum) On Performance of Broiler Chicken
8 pages
Howto Urllib2
No ratings yet
Howto Urllib2
11 pages
seila
No ratings yet
seila
6 pages
DeepSeek - Python Tutorial
No ratings yet
DeepSeek - Python Tutorial
8 pages
Sma 2
No ratings yet
Sma 2
9 pages
Open Facebook Crawler Based On Python
No ratings yet
Open Facebook Crawler Based On Python
6 pages
19ecs448p Secure Software Engineering Lab Manual
No ratings yet
19ecs448p Secure Software Engineering Lab Manual
27 pages
How I Built My Very First Twitter Bot-That'S Surprisingly Enjoyable
No ratings yet
How I Built My Very First Twitter Bot-That'S Surprisingly Enjoyable
9 pages
Data Sheet: Specifications
No ratings yet
Data Sheet: Specifications
3 pages
Algorithm For Robotic Picking in Amazon Fulfillmen
No ratings yet
Algorithm For Robotic Picking in Amazon Fulfillmen
18 pages
Main
No ratings yet
Main
7 pages
19ecs448p Secure Software Engineering Lab Manual
No ratings yet
19ecs448p Secure Software Engineering Lab Manual
27 pages
Unit - 1 - Microcontroller Programming
No ratings yet
Unit - 1 - Microcontroller Programming
99 pages
Dr. Bhimrao Ambedkar University, Agra: Semester Examination December-2021
No ratings yet
Dr. Bhimrao Ambedkar University, Agra: Semester Examination December-2021
2 pages
Agent301
No ratings yet
Agent301
5 pages
Pls Find Here Below
No ratings yet
Pls Find Here Below
5 pages
Energy Flow Through Ecosystems
No ratings yet
Energy Flow Through Ecosystems
23 pages
Operations Research(9)
No ratings yet
Operations Research(9)
215 pages
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
No ratings yet
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
14 pages
Web Scraping and Data Collection CheatSheet 1731972399
No ratings yet
Web Scraping and Data Collection CheatSheet 1731972399
10 pages
Business Analytics Level 1 Quiz - Attempt Review
No ratings yet
Business Analytics Level 1 Quiz - Attempt Review
14 pages
Stalker
No ratings yet
Stalker
24 pages
HOWTO Fetch Internet Resources Using The Urllib Package: Guido Van Rossum and The Python Development Team
No ratings yet
HOWTO Fetch Internet Resources Using The Urllib Package: Guido Van Rossum and The Python Development Team
11 pages
HOWTO Fetch Internet Resources Using Urllib2: Guido Van Rossum and The Python Development Team
No ratings yet
HOWTO Fetch Internet Resources Using Urllib2: Guido Van Rossum and The Python Development Team
10 pages
Winter Service Guide
No ratings yet
Winter Service Guide
32 pages
Main
No ratings yet
Main
9 pages
Sree Saraswathi Thyagaraja College (Autonomous), Pollachi
No ratings yet
Sree Saraswathi Thyagaraja College (Autonomous), Pollachi
19 pages
6087 MSDS PDF
No ratings yet
6087 MSDS PDF
11 pages
CMRP Exam Recommendations 1615402454
No ratings yet
CMRP Exam Recommendations 1615402454
7 pages
Letter of Credit (Defination)
No ratings yet
Letter of Credit (Defination)
34 pages
Programming 2 Lectures
No ratings yet
Programming 2 Lectures
52 pages
Fun With Python
100% (5)
Fun With Python
113 pages
All t8pr
100% (1)
All t8pr
21 pages
Desert Island Top 5
100% (1)
Desert Island Top 5
10 pages
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
From Everand
Angular Generative AI: Building an intelligent CV enhancer with Google Gemini
Abdelfattah Ragab
No ratings yet
Preview Chartbook in Gold We Trust Report 2021
No ratings yet
Preview Chartbook in Gold We Trust Report 2021
88 pages
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
From Everand
Rust Package 100 Knocks: One-Hour Mastery Series 2024 Edition
Kanto
No ratings yet
Blazor and API Example: Classroom Quiz Application
From Everand
Blazor and API Example: Classroom Quiz Application
Taurius Litvinavicius
No ratings yet
AZ-104 - Ans
No ratings yet
AZ-104 - Ans
112 pages
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
From Everand
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
equitypress
4.5/5 (3)
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
From Everand
MCTS 70-515 Exam: Web Applications Development with Microsoft .NET Framework 4 (Exam Prep)
Eddie Vi
4/5 (1)
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
Fresher PyQt5: A Beginner’s Guide to PyQt5
From Everand
Fresher PyQt5: A Beginner’s Guide to PyQt5
Edward Chang
No ratings yet
Javascript Assessment Test
From Everand
Javascript Assessment Test
Edward Yao
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
From Everand
JavaScript Interview Questions, Answers, and Explanations: JavaScript Certification Review
equitypress
No ratings yet
Introduction to PHP, Part 5, Second Edition
From Everand
Introduction to PHP, Part 5, Second Edition
Adam Majczak
No ratings yet
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
From Everand
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
Equity Press
No ratings yet
How to a Developers Guide to 4k: Developer edition, #3
From Everand
How to a Developers Guide to 4k: Developer edition, #3
Xinc Cyberwizard
No ratings yet
Inspiring Powershell Articles
From Everand
Inspiring Powershell Articles
Murat Yildirimoglu
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet

Data Analytics Project

Uploaded by

Data Analytics Project

Uploaded by

Exercise 1: Extract facebook data available on any public pagle like

# sending get request and saving the response as response object

# extracting data in json format

for comment in data['data']:

oauth = OAuth(ACCESS_TOKEN, ACCESS_SECRET, CONSUMER_KEY,

A = numpy.loadtxt(open("data.csv", "rb"), delimiter=",", skiprows=1)

X_train, X_test, y_train, y_test = train_test_split(X_features, y_targets, test_size=0.4,

print "Support Vector Machine:"

print "Random Forest:"

print "Logistic Regression:"

lr_model = LogisticRegression(penalty='l1',C=50).fit(X_train, y_train.ravel())

print "Score: ", lr_model.score(X_test, y_test.ravel())

You might also like