Experiment 7 ML

Uploaded by

Rishubh Gandhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views3 pages

Experiment 7 ML

Uploaded by

Rishubh Gandhi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Experiment 7

Code:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
data = pd.read_csv('ratings_sample.csv')
data.head()
data['movie_id'] = data['movie_id'].str.replace('+', ' ')
data.describe()
data.info()
data.isnull().sum()
data = data.dropna()
data.isnull().sum()
data.info()
# Assign unique integer IDs to each distinct movie
data['movie_id'] = pd.factorize(data['movie_id'])[0]
data['production_companies'] = pd.factorize(data['production_companies'])[0]
data['production_countries'] = pd.factorize(data['production_countries'])[0]
import nltk
nltk.download('stopwords')
nltk.download('wordnet')
nltk.download('omw-1.4')
import re
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
from nltk.stem import WordNetLemmatizer
# Initialize WordNet lemmatizer and stopwords
lemmatizer = WordNetLemmatizer()
stop_words = set(stopwords.words('english'))
# Function to preprocess text
def preprocess_text(text):
text = re.sub(r'<[^>]+>', '', text)
text = re.sub(r'[^a-zA-Z]', ' ', text)
text = text.lower()
words = word_tokenize(text)
words = [lemmatizer.lemmatize(word) for word in words if word not in stop_words]
processed_text = ' '.join(words)
return processed_text
# Apply preprocessing to the 'overview' column
data['overview'] = data['overview'].apply(preprocess_text)
from sklearn.feature_extraction.text import TfidfVectorizer
tfidf_vectorizer = TfidfVectorizer(max_features=1000)
overview_features = tfidf_vectorizer.fit_transform(data['overview'])
overview_features_array = overview_features.toarray()
# Split the genres into individual genres
genres_list = data['genres'].str.split(' ')
# Get unique genres
unique_genres = set(genre for sublist in genres_list for genre in sublist)
for genre in unique_genres:
data[genre] = data['genres'].str.contains(genre).astype(int)
data.drop('genres', axis=1, inplace=True)
data.info()

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error
tfidf_vectorizer = TfidfVectorizer(max_features=1000)
overview_features = tfidf_vectorizer.fit_transform(data['overview'])
combined_features = overview_features
X_train, X_test, y_train, y_test = train_test_split(combined_features, data['rating'], test_size=0.2,
random_state=42)
lr_model = LinearRegression()
lr_model.fit(X_train, y_train)
y_pred = lr_model.predict(X_test)
mse = mean_squared_error(y_test, y_pred)
print("Mean Squared Error:", mse)
Output:

Working With Time - Lab Solutions Guide: Index Type Sourcetype Interesting Fields
No ratings yet
Working With Time - Lab Solutions Guide: Index Type Sourcetype Interesting Fields
10 pages
Cyberbullying Code
No ratings yet
Cyberbullying Code
6 pages
Natural Language Processing
No ratings yet
Natural Language Processing
22 pages
Shreya Srivastava-27
No ratings yet
Shreya Srivastava-27
3 pages
NLP Lab
No ratings yet
NLP Lab
18 pages
Natural Language Processing
No ratings yet
Natural Language Processing
5 pages
Sma Exp 10 Code Print
No ratings yet
Sma Exp 10 Code Print
7 pages
Code
No ratings yet
Code
13 pages
Importing The Libraries
No ratings yet
Importing The Libraries
3 pages
NLP Tushar
No ratings yet
NLP Tushar
21 pages
ASTW RA03 PracticalManual
No ratings yet
ASTW RA03 PracticalManual
18 pages
Q 3
No ratings yet
Q 3
2 pages
Ment Analysis Text Classification
No ratings yet
Ment Analysis Text Classification
9 pages
Report On - Social Media Research Topic Modeling
No ratings yet
Report On - Social Media Research Topic Modeling
26 pages
Text, Pos, Wor2vec
No ratings yet
Text, Pos, Wor2vec
3 pages
NLP Manual
No ratings yet
NLP Manual
21 pages
9 Feature Engineering Text Data
No ratings yet
9 Feature Engineering Text Data
7 pages
Aped For Fake News
No ratings yet
Aped For Fake News
6 pages
Python Project
No ratings yet
Python Project
2 pages
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE332P LO VL2024250102168 2024-10-07 Reference-Material-I
18 pages
17 - Source Code - nlp-2-5
No ratings yet
17 - Source Code - nlp-2-5
4 pages
Sma 8
No ratings yet
Sma 8
7 pages
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis: 1. BERT
98 pages
T SNE Visualization of Amazon Reviews With Polarity Based Color Coding+
No ratings yet
T SNE Visualization of Amazon Reviews With Polarity Based Color Coding+
29 pages
Python CA 4
No ratings yet
Python CA 4
9 pages
Experiment 3 Word2Vec Custom Vectors Generation and Performing Classification
No ratings yet
Experiment 3 Word2Vec Custom Vectors Generation and Performing Classification
4 pages
Sentiment Analysis
No ratings yet
Sentiment Analysis
4 pages
AIML IA3 Loki & SG
No ratings yet
AIML IA3 Loki & SG
31 pages
Code Text
No ratings yet
Code Text
4 pages
Sample
No ratings yet
Sample
6 pages
Trends Merged
No ratings yet
Trends Merged
10 pages
Amazon Food Review Notes
No ratings yet
Amazon Food Review Notes
37 pages
ML Program Output
No ratings yet
ML Program Output
22 pages
NLP Transformer-Based Models Used For Sentiment Analysis
No ratings yet
NLP Transformer-Based Models Used For Sentiment Analysis
45 pages
Extra Feature NLP
No ratings yet
Extra Feature NLP
5 pages
Assignment
No ratings yet
Assignment
6 pages
Amazon Product Review - Ipynb - Colaboratory
No ratings yet
Amazon Product Review - Ipynb - Colaboratory
7 pages
NLP Assignment2
No ratings yet
NLP Assignment2
7 pages
British Airways Forage Report
No ratings yet
British Airways Forage Report
12 pages
Final Presentation
No ratings yet
Final Presentation
18 pages
Assign 3
No ratings yet
Assign 3
1 page
Feature Extraction Techniques in NLP
No ratings yet
Feature Extraction Techniques in NLP
10 pages
AI Lab Report BIM
No ratings yet
AI Lab Report BIM
34 pages
Ir Lab 2 Ir Learning Outcomes: Pyterrier
No ratings yet
Ir Lab 2 Ir Learning Outcomes: Pyterrier
7 pages
Ds File
No ratings yet
Ds File
58 pages
Experiment 1
No ratings yet
Experiment 1
19 pages
Module 2 Feature Engineering and Text Representation
No ratings yet
Module 2 Feature Engineering and Text Representation
19 pages
Movie Recommend
No ratings yet
Movie Recommend
2 pages
Topic Classifierby David Caleb
No ratings yet
Topic Classifierby David Caleb
7 pages
Self Evaluation Exercises
No ratings yet
Self Evaluation Exercises
12 pages
Naive Bayes
No ratings yet
Naive Bayes
1 page
ML Week10.1
No ratings yet
ML Week10.1
5 pages
Code
No ratings yet
Code
18 pages
Yelp Explorers Report
No ratings yet
Yelp Explorers Report
10 pages
Basenlp
No ratings yet
Basenlp
5 pages
Mids Practical 3
No ratings yet
Mids Practical 3
2 pages
NLP Assignment 4 (22bce9560)
No ratings yet
NLP Assignment 4 (22bce9560)
12 pages
Intel Ai Project
No ratings yet
Intel Ai Project
7 pages
Print: Program 7
No ratings yet
Print: Program 7
3 pages
Group 4 MovieReview
No ratings yet
Group 4 MovieReview
10 pages
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet
VISI Machining 5axis
No ratings yet
VISI Machining 5axis
2 pages
2019 Aug TechTIPS-JUSTIFIED
No ratings yet
2019 Aug TechTIPS-JUSTIFIED
9 pages
Lab 8 (Flip Flops)
No ratings yet
Lab 8 (Flip Flops)
5 pages
Utilization of The AISC Steel Sculpture For An Introductory Construction Plan Reading Course
No ratings yet
Utilization of The AISC Steel Sculpture For An Introductory Construction Plan Reading Course
7 pages
WEG CFW500 Programming Manual 10006739425 Enaa
No ratings yet
WEG CFW500 Programming Manual 10006739425 Enaa
268 pages
BHEL Unit Implements ERP Package
No ratings yet
BHEL Unit Implements ERP Package
9 pages
Les Anecdotes de Florence PDF
100% (1)
Les Anecdotes de Florence PDF
378 pages
Kombidämpfer Genius Joker: Service Manual: Software/Troubleshooting
No ratings yet
Kombidämpfer Genius Joker: Service Manual: Software/Troubleshooting
39 pages
Challenging Revaluation REsult
No ratings yet
Challenging Revaluation REsult
4 pages
Management of Information Systems Assignment 1
No ratings yet
Management of Information Systems Assignment 1
6 pages
AccurioPress C2070 C2070P C2060 Catalog en PDF
No ratings yet
AccurioPress C2070 C2070P C2060 Catalog en PDF
16 pages
SSD 9971
No ratings yet
SSD 9971
4 pages
FM-2 Indexing Module Reference Manual
No ratings yet
FM-2 Indexing Module Reference Manual
290 pages
Ccs347 GD Unit1 QB
No ratings yet
Ccs347 GD Unit1 QB
1 page
Safety Function Module R911336576 - 03
No ratings yet
Safety Function Module R911336576 - 03
40 pages
Application Note: Revision 01
No ratings yet
Application Note: Revision 01
34 pages
PHP - Form Introduction: Dynamic Websites
No ratings yet
PHP - Form Introduction: Dynamic Websites
3 pages
C Bus 5750WPL GY Document
No ratings yet
C Bus 5750WPL GY Document
1 page
Draft National Telecom Policy 2011
No ratings yet
Draft National Telecom Policy 2011
27 pages
Mod 1 Lesson 1 Ict and Its Current State
No ratings yet
Mod 1 Lesson 1 Ict and Its Current State
71 pages
Business Communication Skills UNIT 1
No ratings yet
Business Communication Skills UNIT 1
23 pages
Here Is The Sample of Typing
No ratings yet
Here Is The Sample of Typing
9 pages
Answer Key
No ratings yet
Answer Key
2 pages
Wheatstone Bridge's Sensitivity, Resistors' Values Effect PDF
No ratings yet
Wheatstone Bridge's Sensitivity, Resistors' Values Effect PDF
6 pages
Project
No ratings yet
Project
2 pages
Spectralayers Pro 6: Version History
No ratings yet
Spectralayers Pro 6: Version History
4 pages
PDP Erik Conrath
No ratings yet
PDP Erik Conrath
8 pages
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
No ratings yet
Collaborative Optimization of Dynamic Pricing and Seat Allocation For High-Speed Railways An Empirical Study From China
11 pages
Visual Media Portfolio: Breanne Huber
No ratings yet
Visual Media Portfolio: Breanne Huber
18 pages

Experiment 7 ML

Uploaded by

Experiment 7 ML

Uploaded by

Experiment 7

from sklearn.model_selection import train_test_split

You might also like