0% found this document useful (0 votes)

50 views

Assignment 3 AI

The document describes how to build a movie recommendation system using artificial intelligence techniques. It involves 5 steps: 1) creating a data file of user ratings, 2) computing the Euclidean distance score between users, 3) computing the Pearson correlation score, 4) finding similar users based on their Pearson scores, and 5) generating movie recommendations for a given user based on the ratings of similar users. Code examples are provided for each step to demonstrate how to calculate the scores and recommendations programmatically.

Uploaded by

Imraan Imraan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views

Assignment 3 AI

Uploaded by

Imraan Imraan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

ARTIFICIAL INTELLIGENCE

Assignment # 3

Submitted by:

M Waqas Arif

70067876

Section T

DECEMBER 15, 2020

Artificial Intelligence

Generating movie recommendations

Let's see how to build it.

1- Creating data file for users by the name of movie_ratings.json:

{
"John Carson":
{
"Inception": 2.5,
"Pulp Fiction": 3.5,
"Anger Management": 3.0,
"Fracture": 3.5,
"Serendipity": 2.5,
"Jerry Maguire": 3.0
},
"Michelle Peterson":
{
"Inception": 3.0,
"Pulp Fiction": 3.5,
"Anger Management": 1.5,
"Fracture": 5.0,
"Jerry Maguire": 3.0,
"Serendipity": 3.5
},
"William Reynolds":
{
"Inception": 2.5,
"Pulp Fiction": 3.0,
"Fracture": 3.5,
"Jerry Maguire": 4.0
},
"Jillian Hobart":
{
"Pulp Fiction": 3.5,
"Anger Management": 3.0,
"Jerry Maguire": 4.5,
"Fracture": 4.0,
"Serendipity": 2.5
},
"Melissa Jones":
{
"Inception": 3.0,
"Pulp Fiction": 4.0,
"Anger Management": 2.0,
"Fracture": 3.0,
"Jerry Maguire": 3.0,
"Serendipity": 2.0
},
"Alex Roberts":
{
"Inception": 3.0,
"Pulp Fiction": 4.0,
"Jerry Maguire": 3.0,
"Fracture": 5.0,
"Serendipity": 3.5
},
"Michael Henry":
{
"Pulp Fiction": 4.5,
"Serendipity": 1.0,
"Fracture": 4.0
}
}

2- Computing the Euclidean distance score

Code :
import json
import numpy as np
# Returns the Euclidean distance score between user1 and user2
def euclidean_score(dataset, user1, user2):
if user1 not in dataset:
raise TypeError('User ' + user1 + ' not present in the dataset')

if user2 not in dataset:

raise TypeError('User ' + user2 + ' not present in the dataset')

# Movies rated by both user1 and user2

rated_by_both = {}

for item in dataset[user1]:

if item in dataset[user2]:
rated_by_both[item] = 1
# If there are no common movies, the score is 0
if len(rated_by_both) == 0:
return 0
squared_differences = []

for item in dataset[user1]:

if item in dataset[user2]:
squared_differences.append(np.square(dataset[user1][item] - dataset[user2][item]))

return 1 / (1 + np.sqrt(np.sum(squared_differences)))

if __name__=='__main__':
data_file = 'movie_ratings.json'
with open(data_file, 'r') as f:
data = json.loads(f.read())
user1 = 'John Carson'
user2 = 'Michelle Peterson'

print ("\nEuclidean score:")

print (euclidean_score(data, user1, user2) )

Output:

3- Computing the Pearson correlation score

Code :
import json
import numpy as np
# Returns the Pearson correlation score between user1 and user2
def pearson_score(dataset, user1, user2):
if user1 not in dataset:
raise TypeError('User ' + user1 + ' not present in the dataset')

if user2 not in dataset:

raise TypeError('User ' + user2 + ' not present in the dataset')
# Movies rated by both user1 and user2
rated_by_both = {}

for item in dataset[user1]:

if item in dataset[user2]:
rated_by_both[item] = 1

num_ratings = len(rated_by_both)
# If there are no common movies, the score is 0
if num_ratings == 0:
return 0
# Compute the sum of ratings of all the common preferences
user1_sum = np.sum([dataset[user1][item] for item in rated_by_both])
user2_sum = np.sum([dataset[user2][item] for item in rated_by_both])
# Compute the sum of squared ratings of all the common preferences
user1_squared_sum = np.sum([np.square(dataset[user1][item]) for item in rated_by_both])
user2_squared_sum = np.sum([np.square(dataset[user2][item]) for item in rated_by_both])
# Compute the sum of products of the common ratings
product_sum = np.sum([dataset[user1][item] * dataset[user2][item] for item in
rated_by_both])
# Compute the Pearson correlation
Sxy = product_sum - (user1_sum * user2_sum / num_ratings)
Sxx = user1_squared_sum - np.square(user1_sum) / num_ratings
Syy = user2_squared_sum - np.square(user2_sum) / num_ratings
if Sxx * Syy == 0:
return 0
return Sxy / np.sqrt(Sxx * Syy)
if __name__=='__main__':
data_file = 'movie_ratings.json'

with open(data_file, 'r') as f:

data = json.loads(f.read())

user1 = 'John Carson'

user2 = 'Michelle Peterson'

print ("\nPearson score:")

print (pearson_score(data, user1, user2))

Output:

4- Finding similar users in the dataset

Code:
import json
import numpy as np

from pearson_score import pearson_score

# Finds a specified number of users who are similar to the input user
def find_similar_users(dataset, user, num_users):
if user not in dataset:
raise TypeError('User ' + user + ' not present in the dataset')

# Compute Pearson scores for all the users

scores = np.array([[x, pearson_score(dataset, user, x)] for x in dataset if user != x])
# Sort the scores based on second column
scores_sorted = np.argsort(scores[:, 1])

# Sort the scores in decreasing order (highest score first)

scored_sorted_dec = scores_sorted[::-1]
# Extract top 'k' indices
top_k = scored_sorted_dec[0:num_users]

return scores[top_k]
if __name__=='__main__':
data_file = 'movie_ratings.json'

with open(data_file, 'r') as f:

data = json.loads(f.read())
user = 'John Carson'
print ("\nUsers similar to " + user + ":\n")
similar_users = find_similar_users(data, user, 3)
print ("User\t\t\tSimilarity score\n")
for item in similar_users:
print (item[0], '\t\t', round(float(item[1]), 2))
Output:

5. Generating movie recommendations

Code:
import json
import numpy as np
from pearson_score import pearson_score
# Generate recommendations for a given user
def generate_recommendations(dataset, user):
if user not in dataset:
raise TypeError('User ' + user + ' not present in the dataset')
total_scores = {}
similarity_sums = {}
similarity_score=''
u=''
for u in [x for x in dataset if x != user]:
similarity_score = pearson_score(dataset, user, u)
if similarity_score <= 0:
continue
for item in [x for x in dataset[u] if x not in
dataset[user] or dataset[user][x] == 0]:
total_scores.update({item: dataset[u][item] * similarity_score})
similarity_sums.update({item: similarity_score})
if len(total_scores) == 0:
return ['No recommendations possible']
# Create the normalized list
movie_ranks = np.array([[total/similarity_sums[item], item]
for item, total in total_scores.items()])
# Sort in decreasing order based on the first column
movie_ranks = movie_ranks[np.argsort(movie_ranks[:,0])[::-1]]
# Extract the recommended movies
recommendations = [movie for _, movie in movie_ranks]
return recommendations

data=''
if __name__=='__main__':
data_file = 'movie_ratings.json'
with open(data_file, 'r') as f:
data=json.loads(f.read())

user = 'Michael Henry'

print("\nRecommendations for " + user + ":")
movies=generate_recommendations(data, user)
for i, movie in enumerate(movies):
print(str(i+1) + '. ' + movie)
user = 'John Carson'
print("\nRecommendations for " + user + ":")
movies = generate_recommendations(data, user)
for i, movie in enumerate(movies):
print(str(i+1) + '. ' + movie)
Output:

Overall output:

EST (FLX) License Error Codes PDF
0% (3)
EST (FLX) License Error Codes PDF
8 pages
Assignment 3 RecSys Solution
No ratings yet
Assignment 3 RecSys Solution
2 pages
Codes U1 Text
0% (1)
Codes U1 Text
5 pages
Assignment 03:: Association Rule Mining
No ratings yet
Assignment 03:: Association Rule Mining
3 pages
CANoe Manual en
No ratings yet
CANoe Manual en
209 pages
CODE_RECOMMENDER SYSTEM
No ratings yet
CODE_RECOMMENDER SYSTEM
8 pages
Exp 2_3a10397ea76773097770b923fd29524b
No ratings yet
Exp 2_3a10397ea76773097770b923fd29524b
14 pages
Dl Project
No ratings yet
Dl Project
9 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
28 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
6 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
22 pages
INN AAT REPORT
No ratings yet
INN AAT REPORT
10 pages
AIML Mod4 Loki
No ratings yet
AIML Mod4 Loki
11 pages
from surprise import SVD
No ratings yet
from surprise import SVD
2 pages
Movie Recommendation System KNN (ML-Usecase)
No ratings yet
Movie Recommendation System KNN (ML-Usecase)
7 pages
F24_Proj4
No ratings yet
F24_Proj4
6 pages
PRJ Movie Recommendation Data Science..
No ratings yet
PRJ Movie Recommendation Data Science..
7 pages
Assignment 5zeerak
No ratings yet
Assignment 5zeerak
6 pages
Chapter 9 - Recommendation Systems
No ratings yet
Chapter 9 - Recommendation Systems
12 pages
Movie Rec
No ratings yet
Movie Rec
13 pages
Neel
No ratings yet
Neel
12 pages
NEEL (1)_edited
No ratings yet
NEEL (1)_edited
12 pages
Niranjan, Karthik, Rahul - Rajkumar R - Scope: Movie Recommendation System Based On Script Analysis, Cosine Similarity
No ratings yet
Niranjan, Karthik, Rahul - Rajkumar R - Scope: Movie Recommendation System Based On Script Analysis, Cosine Similarity
1 page
NEEL (1) Edited Edited
No ratings yet
NEEL (1) Edited Edited
12 pages
Recommendation System
No ratings yet
Recommendation System
11 pages
KNN Reccomendation
No ratings yet
KNN Reccomendation
7 pages
Ex5.docx
No ratings yet
Ex5.docx
4 pages
NEEL (1)
No ratings yet
NEEL (1)
12 pages
Recommender System Unit Ii
No ratings yet
Recommender System Unit Ii
14 pages
16 Recommender Systems PDF
No ratings yet
16 Recommender Systems PDF
6 pages
L6 Recommendation
No ratings yet
L6 Recommendation
56 pages
Title: Movie Recommendation System Documentation: 1. Demographic Filtering
No ratings yet
Title: Movie Recommendation System Documentation: 1. Demographic Filtering
4 pages
Movie Embeddings: I, J 1 2 I N N Ij I J Ij 2
No ratings yet
Movie Embeddings: I, J 1 2 I N N Ij I J Ij 2
3 pages
Ex3
No ratings yet
Ex3
2 pages
Assignment 5
No ratings yet
Assignment 5
6 pages
ML Project Movie Recommendation System
No ratings yet
ML Project Movie Recommendation System
2 pages
CCS360 Lab Record
No ratings yet
CCS360 Lab Record
28 pages
math551lab9
No ratings yet
math551lab9
5 pages
Recommender System
No ratings yet
Recommender System
45 pages
RS2
No ratings yet
RS2
16 pages
Data Mining Portfolio
No ratings yet
Data Mining Portfolio
19 pages
System Design
No ratings yet
System Design
25 pages
ML
No ratings yet
ML
8 pages
Social Suggest Team Report
No ratings yet
Social Suggest Team Report
52 pages
Lecture7.2 After Large
No ratings yet
Lecture7.2 After Large
19 pages
Music Reccomendation System
No ratings yet
Music Reccomendation System
32 pages
Movie_Recommendation_System_project[1]
No ratings yet
Movie_Recommendation_System_project[1]
9 pages
To Students Data Mining Part-2 Sept 13_240913_160930
No ratings yet
To Students Data Mining Part-2 Sept 13_240913_160930
5 pages
Lecture9 Recommender Systems V0
No ratings yet
Lecture9 Recommender Systems V0
52 pages
Report Final-MovieLens
No ratings yet
Report Final-MovieLens
47 pages
hw1 Instructions Light Mode
No ratings yet
hw1 Instructions Light Mode
4 pages
Movie Recommendation System in R Jupyter Notebook
No ratings yet
Movie Recommendation System in R Jupyter Notebook
18 pages
Vertopal.com IMDb+Movie+Assignment Stub
No ratings yet
Vertopal.com IMDb+Movie+Assignment Stub
9 pages
Practical File: Deep Learning
No ratings yet
Practical File: Deep Learning
33 pages
1st Harvard Project
No ratings yet
1st Harvard Project
17 pages
CMSC422 Project Presentation
No ratings yet
CMSC422 Project Presentation
17 pages
9,12,19,68 - ML Assignment-2
No ratings yet
9,12,19,68 - ML Assignment-2
5 pages
Survey On Cinematics Recommendation System
No ratings yet
Survey On Cinematics Recommendation System
10 pages
Assignment 2 Oops
No ratings yet
Assignment 2 Oops
10 pages
Project Movielense Solution
No ratings yet
Project Movielense Solution
4 pages
Machine Learning LAB MANUAL
No ratings yet
Machine Learning LAB MANUAL
23 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Nhận biết 2
No ratings yet
Nhận biết 2
4 pages
Cisco Unified Wireless Network Base Security Feaures
No ratings yet
Cisco Unified Wireless Network Base Security Feaures
70 pages
Google Story
No ratings yet
Google Story
31 pages
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
No ratings yet
Convex Optimization - Introduction (S.l. Dr. Ing. Carmen Voicu)
32 pages
Dicas de Packet Trcer
No ratings yet
Dicas de Packet Trcer
5 pages
Simulating With SIMPLIS - Rev 0.21
No ratings yet
Simulating With SIMPLIS - Rev 0.21
93 pages
Stress Engineer Sample
No ratings yet
Stress Engineer Sample
11 pages
The Following Are The Steps On How To Edit Wiki Content in A Safe Way
100% (1)
The Following Are The Steps On How To Edit Wiki Content in A Safe Way
4 pages
Strip Line
No ratings yet
Strip Line
12 pages
Visual Studio Keyboard Shortcuts by Microsoft Learn
No ratings yet
Visual Studio Keyboard Shortcuts by Microsoft Learn
62 pages
Strategic CFO 360_ How New Technologies Are Innovating Finance
No ratings yet
Strategic CFO 360_ How New Technologies Are Innovating Finance
14 pages
TC 20KL03
No ratings yet
TC 20KL03
26 pages
Ssss
No ratings yet
Ssss
2 pages
CSC CC 3rd Ed Revised - Final
No ratings yet
CSC CC 3rd Ed Revised - Final
247 pages
Synology NAS Systems - Optimized Business Solutions
100% (1)
Synology NAS Systems - Optimized Business Solutions
22 pages
Samsung sf6800 Service Manual
No ratings yet
Samsung sf6800 Service Manual
118 pages
Basic Simulation Lab Manual
No ratings yet
Basic Simulation Lab Manual
90 pages
SAP Scheduling
No ratings yet
SAP Scheduling
6 pages
Overleaf Keyboard Shortcuts: Updated March 12, 2020
No ratings yet
Overleaf Keyboard Shortcuts: Updated March 12, 2020
2 pages
840Dsl_TCU30_3_equip_man_0323_en-US
No ratings yet
840Dsl_TCU30_3_equip_man_0323_en-US
92 pages
V-2 PBA SOLUTION Computer Science HSSC-II
No ratings yet
V-2 PBA SOLUTION Computer Science HSSC-II
4 pages
Input - Output Inc - 12-Volt - Batteries.and - Charge
No ratings yet
Input - Output Inc - 12-Volt - Batteries.and - Charge
2 pages
116 Dumitrache Petru
No ratings yet
116 Dumitrache Petru
4 pages
Ai Lab Final
No ratings yet
Ai Lab Final
52 pages
Salesforce Marketing Cloud Interview Questions Answer 1729537252
No ratings yet
Salesforce Marketing Cloud Interview Questions Answer 1729537252
7 pages
NCO, NSO, IMO & IEO 2015 - 2016 Class 3 First Level Sample Papers
50% (4)
NCO, NSO, IMO & IEO 2015 - 2016 Class 3 First Level Sample Papers
15 pages
Kelman Transport X : GE Grid Solutions
No ratings yet
Kelman Transport X : GE Grid Solutions
2 pages

Assignment 3 AI

Uploaded by

Assignment 3 AI

Uploaded by

ARTIFICIAL INTELLIGENCE

DECEMBER 15, 2020

Generating movie recommendations

1- Creating data file for users by the name of movie_ratings.json:

2- Computing the Euclidean distance score

if user2 not in dataset:

# Movies rated by both user1 and user2

for item in dataset[user1]:

for item in dataset[user1]:

print ("\nEuclidean score:")

3- Computing the Pearson correlation score

if user2 not in dataset:

for item in dataset[user1]:

with open(data_file, 'r') as f:

user1 = 'John Carson'

print ("\nPearson score:")

4- Finding similar users in the dataset

from pearson_score import pearson_score

# Compute Pearson scores for all the users

# Sort the scores in decreasing order (highest score first)

with open(data_file, 'r') as f:

5. Generating movie recommendations

user = 'Michael Henry'

You might also like