DSDM Unit4

Uploaded by

manekandan8214

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views30 pages

DSDM Unit4

Uploaded by

manekandan8214

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 30

2Marks

Q)What are the steps to process the twitter data?

The following are the steps to gather the data from the Twitter feeds:

● Getting the data

● Data pull
● Data cleaning

Q) What are the steps to getting the Twitter API?

Getting Twitter API keys Firstly, you will need to have a Twitter account and obtain
credentials (consumer key, consumer secret, access token, and access secret) on the
Twitter developer platform to access the Twitter API, following these steps .
○ Create a Twitter user account.
○ Log in with your Twitter user account at
○ Click Create New App.
○ Fill out the form, agree to the terms, and click on Create your Twitter
application.
○ Go to the next page, click on the Keys and Access Tokens tab, and copy
your API key and API secret. Scroll down and click on Create my access
token, and copy your Access token, and Access token secret.

Q) What is Data Extraction?

Having all required authorization keys, we can prepare the toolset for data retrieval. The
Twitter API gives several ways to extract the data, but we will focus on two main
methods:
->Keyword search query to obtain recent, historical tweets
->Streaming facility, to obtain tweets as they are posted

Q) What is Rate limit/paging?

After successful connection to the API, we have to prepare our scripts for data retrieval.
As the API limits data access (Rate Limits), it is necessary to build an efficient
workflow.Twitter allows you to get up to 100 tweets per one call . If we want to retrieve
more and we need to remember already downloaded tweets'IDs not to extract the same
tweets during next calls. This procedure is commonly called paging.
Q) What are Streaming API streams?
Streaming API Another method of obtaining information from Twitter is the streaming
API. It gives access to Twitter's global stream of data. There are several basic
streaming endpoints, each customized to certain use cases. Based on the Twitter
documentation: Public streams: Streams of the public data flowing through Twitter. It is
suitable for following specific users or topics, and data mining. User streams: These are
single-user streams, containing roughly all of the data corresponding with a single user's
view of Twitter.

Q)Define Sentiment Analysis

Sentiment analysis involves classifying comments or opinions in text into categories
such as "positive" or "negative" often with an implicit category of "neutral". A classic
sentiment application would be tracking what people think about different topics.
Sentiment analysis in data science and machine learning is also called "opinion mining"
or in marketing terminology "voice of the customer".
Q)Define VADER
VADER ( Valence Aware Dictionary for Sentiment Reasoning) is a model used for text
sentiment analysis that is sensitive to both polarity (positive/negative) and intensity
(strength) of emotion. It is available in the NLTK package and can be applied directly to
unlabeled text data.

Q) What does the preparation of a custom classifier require ?

The preparation of a custom classifier requires two data sets:
Training data set: The data on which the classifier algorithm learns the model
parameters
Test data set: This is used to determine the accuracy of the algorithm

Q) What is a confusion matrix?

A confusion matrix is a technique for summarizing the performance of a classification
algorithm. It provides information of what the classification model is getting right and
what types of errors it is making. Predictions of the results on a classification problem
are usually visualized by the following matrix :
Q)Define Precision and Recall

Q)Define K-fold cross validation

The input data is split into K parts where one is reserved for testing, and the other K-1
for training. This process is repeated K times and the evaluation metrics are averaged.
This helps in determining how well a model would generalize to new datasets.

Q)Define NER
Named-entity recognition is a subtask of information extraction that seeks to
locate and classify named entities mentioned in unstructured text into predefined
categories such as person names, organizations, locations, medical codes, time
expressions, quantities, monetary values, percentages, etc.
5/10 Marks
Q)Explain about REST API Search endpoint
Q)Explain about Rate Limit paging
Q)Explain about Streaming API
Q)Explain about Data pull and Data Extraction with example
Q)Explain about sentiment analysis
Q)Explain about customized sentiment analysis
\
Q)Explain about Named Entity Recognition
Q)Explain the process of combining NER and sentiment analysis

CEO Database
100% (2)
CEO Database
176 pages
Yamaha R1 Service Manual 2007
100% (1)
Yamaha R1 Service Manual 2007
426 pages
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
75% (4)
Business Data Analysis Using Excel, 2010 (David Whigham) PDF
315 pages
Solution Assigment Chapter 5
No ratings yet
Solution Assigment Chapter 5
11 pages
AWS Solution Architect Certification Exam Practice Paper 2019
From Everand
AWS Solution Architect Certification Exam Practice Paper 2019
Tech Interviews
3.5/5 (3)
Design of Regenerative Pump
No ratings yet
Design of Regenerative Pump
19 pages
DSDM Unit4
No ratings yet
DSDM Unit4
31 pages
Python for Cybersecurity: Using Python for Cyber Offense and Defense
From Everand
Python for Cybersecurity: Using Python for Cyber Offense and Defense
Howard E. Poston, III
No ratings yet
Make AI Work for You While You Nap
From Everand
Make AI Work for You While You Nap
Nexia
No ratings yet
SMA Expt 2
No ratings yet
SMA Expt 2
7 pages
Restricting Unsolicited Approaches and Counterfeit Users: Batch No: 28 Guided by Done by
No ratings yet
Restricting Unsolicited Approaches and Counterfeit Users: Batch No: 28 Guided by Done by
28 pages
Unit - Iv - Mining Social Web
No ratings yet
Unit - Iv - Mining Social Web
13 pages
DA Project Report
No ratings yet
DA Project Report
17 pages
Your Practical Guide to Building an AI Telegram bot Using n8n
From Everand
Your Practical Guide to Building an AI Telegram bot Using n8n
turki alkhwlani
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Advance Data Mining Assignment
No ratings yet
Advance Data Mining Assignment
10 pages
L2 - Data Acquisition
No ratings yet
L2 - Data Acquisition
48 pages
DMW Project Report by Saurabh Zingade
No ratings yet
DMW Project Report by Saurabh Zingade
16 pages
Collection of Raspberry Pi Projects
From Everand
Collection of Raspberry Pi Projects
Guillermo Perez Guillen
5/5 (1)
Practice Questions for UiPath Certified RPA Associate Case Based
From Everand
Practice Questions for UiPath Certified RPA Associate Case Based
Exam OG
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Bigdata Unit-Ii
No ratings yet
Bigdata Unit-Ii
33 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
57 pages
Salesforce Certified Marketing Cloud Consultant Practice Questions And Exam Tests Marketing Cloud Consultant Exam Guidebook And Updated Questions
From Everand
Salesforce Certified Marketing Cloud Consultant Practice Questions And Exam Tests Marketing Cloud Consultant Exam Guidebook And Updated Questions
Idea Link
No ratings yet
10 1109@icict48043 2020 9112546
No ratings yet
10 1109@icict48043 2020 9112546
6 pages
Stma Answer Set 2
No ratings yet
Stma Answer Set 2
6 pages
Sentiment Analysis PDF
No ratings yet
Sentiment Analysis PDF
4 pages
Da 5 Marks
No ratings yet
Da 5 Marks
15 pages
IT Systems Reliability A Complete Guide - 2020 Edition
From Everand
IT Systems Reliability A Complete Guide - 2020 Edition
Gerardus Blokdyk
No ratings yet
Swe2011 Bda - III
No ratings yet
Swe2011 Bda - III
53 pages
DataStreaming L-4
No ratings yet
DataStreaming L-4
16 pages
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Crowd Sourcing Platform IEEE Paper 1
No ratings yet
Crowd Sourcing Platform IEEE Paper 1
7 pages
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
From Everand
Knight's Microsoft Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server Integration, Analysis, and Reporting Services with Excel and SharePoint
Brian Knight
3/5 (1)
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Sentiment Analysis On Twitter Data-Set Using Naive Bayes Algorithm
No ratings yet
Sentiment Analysis On Twitter Data-Set Using Naive Bayes Algorithm
4 pages
Data Mining with Microsoft SQL Server 2008
From Everand
Data Mining with Microsoft SQL Server 2008
Jamie MacLennan
4/5 (1)
Twitter Sentiment Analysis With Textblob
No ratings yet
Twitter Sentiment Analysis With Textblob
6 pages
PRACTICAL GUIDE TO LEARN ALGORITHMS: Master Algorithmic Problem-Solving Techniques (2024 Guide for Beginners)
From Everand
PRACTICAL GUIDE TO LEARN ALGORITHMS: Master Algorithmic Problem-Solving Techniques (2024 Guide for Beginners)
MARTY TWITTY
No ratings yet
Backtrader Essentials: Building Successful Strategies with Python
From Everand
Backtrader Essentials: Building Successful Strategies with Python
Ali AZARY
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
The IT4IT™ Reference Architecture, Version 2.1
From Everand
The IT4IT™ Reference Architecture, Version 2.1
The Open Group
No ratings yet
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
From Everand
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
Anand Vemula
No ratings yet
NCSPCN 12 CRP
No ratings yet
NCSPCN 12 CRP
3 pages
Fake News Synopsis
No ratings yet
Fake News Synopsis
10 pages
Big Data Visualizer Course Notes
No ratings yet
Big Data Visualizer Course Notes
20 pages
Twitter Sentiment Analysis Using Python
No ratings yet
Twitter Sentiment Analysis Using Python
21 pages
The IT4IT™ reference architecture, Version 2.0
From Everand
The IT4IT™ reference architecture, Version 2.0
The Open Group
No ratings yet
Twitter API
No ratings yet
Twitter API
6 pages
Hacker’s Guide to Machine Learning Concepts
From Everand
Hacker’s Guide to Machine Learning Concepts
Trilokesh Khatri
No ratings yet
Mathematica Data Analysis
From Everand
Mathematica Data Analysis
Suchok Sergiy
No ratings yet
Sma 2
No ratings yet
Sma 2
9 pages
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
MMD3
No ratings yet
MMD3
17 pages
Stripe Payment Integration for Beginners: A Practical Guide to Accepting Payments Online
From Everand
Stripe Payment Integration for Beginners: A Practical Guide to Accepting Payments Online
Steven Mcananey
No ratings yet
IT Infrastructure Monitoring The Ultimate Step-By-Step Guide
From Everand
IT Infrastructure Monitoring The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Python Pen-testing Unleashed : Techniques for Ethical Hacking with Python
From Everand
Python Pen-testing Unleashed : Techniques for Ethical Hacking with Python
Pratham Pawar
No ratings yet
T09 Data Streaming
No ratings yet
T09 Data Streaming
52 pages
Bda Mid Ans
No ratings yet
Bda Mid Ans
18 pages
Final Report Data Mining
No ratings yet
Final Report Data Mining
17 pages
Module II
No ratings yet
Module II
22 pages
Tweepy Functions
No ratings yet
Tweepy Functions
49 pages
Getting Started With Quick Test Professional (QTP) And Descriptive Programming
From Everand
Getting Started With Quick Test Professional (QTP) And Descriptive Programming
Gaurav Garg
4.5/5 (2)
Anand Institute of Higher Technology Department of Computer Science and Engineering ACADEMIC YEAR: 2018-19 Mini Project Report
No ratings yet
Anand Institute of Higher Technology Department of Computer Science and Engineering ACADEMIC YEAR: 2018-19 Mini Project Report
9 pages
Leave Application Form: To Be Filled-Out by Employee
No ratings yet
Leave Application Form: To Be Filled-Out by Employee
4 pages
Ride Performance Specialist Group 1
No ratings yet
Ride Performance Specialist Group 1
10 pages
Felcom 12 15 16 Ssas Tie PDF
No ratings yet
Felcom 12 15 16 Ssas Tie PDF
80 pages
Nocom vs. Camerino
0% (1)
Nocom vs. Camerino
7 pages
ID Strategi Pengembangan Cabai Keriting Di
100% (1)
ID Strategi Pengembangan Cabai Keriting Di
12 pages
Safety of Ro-Ro Passenger and Cruise Ships PDF
88% (8)
Safety of Ro-Ro Passenger and Cruise Ships PDF
54 pages
Acob, Jonalyn C. Bsed Iii-English Engl 105 Module 1 Midterm
No ratings yet
Acob, Jonalyn C. Bsed Iii-English Engl 105 Module 1 Midterm
3 pages
Lectura Comprensiva Inglés
No ratings yet
Lectura Comprensiva Inglés
2 pages
Analyzing Malicious Software
100% (1)
Analyzing Malicious Software
47 pages
BS en Iso 14692-3-2017
No ratings yet
BS en Iso 14692-3-2017
46 pages
Lab Assignment 2
No ratings yet
Lab Assignment 2
7 pages
Best Practices For Effectively Implementing An ATP Sanitation Verification Program
100% (1)
Best Practices For Effectively Implementing An ATP Sanitation Verification Program
16 pages
Audit Course 8 Report
No ratings yet
Audit Course 8 Report
15 pages
Spa - For Companies
No ratings yet
Spa - For Companies
2 pages
Calculation of Electrical Induction Near Power Lines
No ratings yet
Calculation of Electrical Induction Near Power Lines
22 pages
Package Desire': R Topics Documented
No ratings yet
Package Desire': R Topics Documented
22 pages
Forbidden Topic in Health Policy Debate - Cost Effectiveness - The New York Times
No ratings yet
Forbidden Topic in Health Policy Debate - Cost Effectiveness - The New York Times
4 pages
2.3.11.a Calculating Property Drainage
No ratings yet
2.3.11.a Calculating Property Drainage
6 pages
DOLE Advisory No - 3 - 09
No ratings yet
DOLE Advisory No - 3 - 09
4 pages
CV Ognjanovic
No ratings yet
CV Ognjanovic
23 pages
Ashour: Personal Info Education
No ratings yet
Ashour: Personal Info Education
2 pages
pp2 Coursework 1 - 201808
No ratings yet
pp2 Coursework 1 - 201808
3 pages
Module 1.session 3.ISCM.2021
No ratings yet
Module 1.session 3.ISCM.2021
18 pages
Michael's Resume 2024
No ratings yet
Michael's Resume 2024
3 pages
Francisco Padilla 1
No ratings yet
Francisco Padilla 1
2 pages

DSDM Unit4

Uploaded by

DSDM Unit4

Uploaded by

2Marks

Q)What are the steps to process the twitter data?

● Getting the data

Q) What are the steps to getting the Twitter API?

Q) What is Data Extraction?

Q) What is Rate limit/paging?

Q)Define Sentiment Analysis

Q) What does the preparation of a custom classifier require ?

Q) What is a confusion matrix?

Q)Define K-fold cross validation

You might also like