0% found this document useful (0 votes)
40 views2 pages

Sentiment Analysis of IMDB Movie Reviews

Sentiment analysis uses natural language processing to interpret emotions in subjective text like reviews, surveys, and social media. This document discusses using sentiment analysis and a naive bayes classification model to predict if 50,000 IMDB movie reviews are positive or negative based on the text. It outlines the project setup which includes text normalization, removing noise and special characters, stemming, removing stop words, applying a bag of words model, and labeling the text for modeling.

Uploaded by

Mansoor Qaisrani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
40 views2 pages

Sentiment Analysis of IMDB Movie Reviews

Sentiment analysis uses natural language processing to interpret emotions in subjective text like reviews, surveys, and social media. This document discusses using sentiment analysis and a naive bayes classification model to predict if 50,000 IMDB movie reviews are positive or negative based on the text. It outlines the project setup which includes text normalization, removing noise and special characters, stemming, removing stop words, applying a bag of words model, and labeling the text for modeling.

Uploaded by

Mansoor Qaisrani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Sentiment Analysis of IMDB Movie Reviews

Sentiment analysis (or opinion mining) is a natural language


processing technique used to interpret and classify emotions in
subjective data. Sentiment analysis is often performed on textual data to
detect sentiment in emails, survey responses, social media data, and
beyond.
The most popular application areas of sentiment analysis are the
following:
 Social media monitoring.
 Customer support.
 Customer feedback.
 Brand monitoring and reputation management.
 Voice of customer (VoC)
 Voice of employee.
 Product analysis.
 Market research and competitive research.

Problem Statement:

In this, we have to predict the number of positive and negative reviews


based on sentiments by using naïve bayes classification model.

Dataset:

50k IMDB Reviews.

Attributes are Review(text) and Sentiment(Positive or Negative).

Following are the results with Logistic Regressions


Project Setup

Following steps are required for Project Setup

 Text Normalization
 Remove HTML and NOISE Text
 Removing special characters
 Text Stemming
 Removing Stop words
 Applying Bag of Words Model
 Labeling and Modeling the text

You might also like