DSDM Unit4
DSDM Unit4
The following are the steps to gather the data from the Twitter feeds:
Q)Define VADER
VADER ( Valence Aware Dictionary for Sentiment Reasoning) is a model used for text
sentiment analysis that is sensitive to both polarity (positive/negative) and intensity
(strength) of emotion. It is available in the NLTK package and can be applied directly to
unlabeled text data.
Q) What does the preparation of a custom classifier require ?
The preparation of a custom classifier requires two data sets:
Training data set: The data on which the classifier algorithm learns the model
parameters
Test data set: This is used to determine the accuracy of the algorithm
Q) What is a confusion matrix?
A confusion matrix is a technique for summarizing the performance of a classification
algorithm. It provides information of what the classification model is getting right and
what types of errors it is making. Predictions of the results on a classification problem
are usually visualized by the following matrix :
Model performance evaluation measures how well a trained model performs on unseen
data using metrics like accuracy, precision, recall, and F1-score. Cross-validation, on
the other hand, is a technique for assessing the generalizability of a model by
partitioning data into subsets, training on some while validating on others iteratively.
Q)What is the purpose of labelling data in sentimental analysis.(JAN 2024)(2
MARKS)
Labeling data in sentiment analysis serves to assign sentiment categories (e.g., positive, negative,
neutral) to text samples. This labeled data is essential for training machine learning models to
recognize and predict sentiments in new, unseen text. Accurate labeling provides a foundation for
supervised learning, enabling models to learn patterns and improve prediction accuracy
5/10 Marks