Subtitle

Uploaded by

shshen.idv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views3 pages

Subtitle

Uploaded by

shshen.idv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 3

So let's take a look at how we can

develop a recommender system if we had features of each item, or

features of each movie. So here's the same data set that we
had previously with the four users having rated some but
not all of the five movies. What if we additionally have
features of the movies? So here I've added two features X1 and
X2, that tell us how much each of these is a romance movie, and
how much each of these is an action movie. So for example Love at Last
is a very romantic movie, so this feature takes on 0.9, but
it's not at all an action movie. So this feature takes on 0. But it turns out
Nonstop Car chases has
just a little bit of romance in it. So it's 0.1, but it has a ton of action. So
that feature takes on the value of 1.0. So you recall that I had used the notation
nu to denote the number of users, which is 4 and m to denote
the number of movies which is 5. I'm going to also introduce n to denote
the number of features we have here. And so n=2, because we have two
features X1 and X2 for each movie. With these features we have for
example that the features for movie one, that is the movie Love at Last,
would be 0.90. And the features for the third movie Cute Puppies of Love would be
0.99 and 0. And let's start by taking a look at
how we might make predictions for Alice's movie ratings. So for user one, that is
Alice, let's say we predict the rating for movie i as w.X(i)+b. So this is just a
lot
like linear regression. For example if we end up choosing
the parameter w(1)=[5,0] and say b(1)=0, then the prediction for movie three where
the features are 0.99 and 0, which is just copied from here,
first feature 0.99, second feature 0. Our prediction would be w.X(3)+b=0.99 times 5
plus 0 times zero, which turns out to be equal to 4.95. And this rating seems
pretty plausible. It looks like Alice has given high ratings
to Love at Last and Romance Forever, to two highly romantic movies, but
given low ratings to the action movies, Nonstop Car Chases and Swords vs Karate. So
if we look at Cute Puppies of Love, well predicting that she might rate
that 4.95 seems quite plausible. And so these parameters w and b for Alice seems
like a reasonable model for
predicting her movie ratings. Just add a little the notation
because we have not just one user but multiple users, or
really nu equals 4 users. I'm going to add a superscript 1 here to
denote that this is the parameter w(1) for user 1 and
add a superscript 1 there as well. And similarly here and here as well,
so that we would actually have different parameters for
each of the 4 users on data set. And more generally in this
model we can for user j, not just user 1 now,
we can predict user j's rating for movie i as w(j).X(i)+b(j). So here the
parameters w(j) and b(j) are the parameters used
to predict user j's rating for movie i which is a function of X(i),
which is the features of movie i. And this is a lot like linear regression,
except that we're fitting a different linear regression model for
each of the 4 users in the dataset. So let's take a look at how we can
formulate the cost function for this algorithm. As a reminder, our notation
is that r(i.,j)=1 if user j has rated movie i or
0 otherwise. And y(i,j)=rating given
by user j on movie i. And on the previous side we defined w(j),
b(j) as the parameters for user j. And X(i) as the feature vector for
movie i. So the model we have is for user j and movie i predict the rating
to be w(j).X(i)+b(j). I'm going to introduce just
one new piece of notation, which is I'm going to use m(j) to denote
the number of movies rated by user j. So if the user has rated 4 movies,
then m(j) would be equal to 4. And if the user has rated 3 movies
then m(j) would be equal to 3. So what we'd like to do is to
learn the parameters w(j) and b(j), given the data that we have. That is given the
ratings a user
has given of a set of movies. So the algorithm we're going to use is
very similar to linear regression. So let's write out the cost function for
learning the parameters w(j) and b(j) for a given user j. And let's just focus on
one
user on user j for now. I'm going to use the mean
squared error criteria. So the cost will be the prediction,
which is w(j).X(i)+b(j) minus the actual rating
that the user had given. So minus y(i,j) squared. And we're trying to
choose parameters w and b to minimize the squared error
between the predicted rating and the actual rating that was observed. But the user
hasn't rated all the movies,
so if we're going to sum over this,
we're going to sum over only over the values
of i where r(i,j)=1. So we're going to sum only over the movies
i that user j has actually rated. So that's what this denotes,
sum of all values of i where r(i,j)=1. Meaning that user j has
rated that movie i. And then finally we can take
the usual normalization 1 over m(j). And this is very much like
the cost function we have for linear regression with m or
really m(j) training examples. Where you're summing over the m(j) movies
for which you have a rating taking a squared error and
the normalizing by this 1 over 2m(j). And this is going to be a cost function J of
w(j), b(j). And if we minimize this as
a function of w(j) and b(j), then you should come up with a pretty
good choice of parameters w(i) and b(j). For making predictions for
user j's ratings. Let me have just one more
term to this cost function, which is the regularization
term to prevent overfitting. And so
here's our usual regularization parameter, lambda divided by 2m(j) and then times
as sum of the squared
values of the parameters w. And so
n is a number of numbers in X(i) and that's the same as a number
of numbers in w(j). If you were to minimize this cost
function J as a function of w and b, you should get a pretty
good set of parameters for predicting user j's ratings for
other movies. Now, before moving on, it turns out
that for recommender systems it would be convenient to actually eliminate
this division by m(j) term, m(j) is just a constant
in this expression. And so, even if you take it out, you should end up with
the same value of w and b. Now let me take this cost function
down here to the bottom and copy it to the next slide. So we have that to learn
the parameters w(j), b(j) for user j. We would minimize this cost function
as a function of w(j) and b(j). But instead of focusing on a single user, let's
look at how we learn
the parameters for all of the users. To learn the parameters w(1), b(1), w(2),
b(2),...,w(nu), b(nu), we would take this cost function on
top and sum it over all the nu users. So we would have sum from
j=1 one to nu of the same cost function that we
had written up above. And this becomes the cost for learning all the parameters for
all of the users. And if we use gradient descent or any other optimization
algorithm to
minimize this as a function of w(1), b(1) all the way through w(nu),
b(nu), then you have a pretty good set of parameters for predicting
movie ratings for all the users. And you may notice that this algorithm
is a lot like linear regression, where that plays a role similar to
the output f(x) of linear regression. Only now we're training a different linear
regression model for each of the nu users. So that's how you can learn parameters
and
predict movie ratings, if you had access to these features X1 and
X2. That tell you how much is each of
the movies, a romance movie, and how much is each of
the movies an action movie? But where do these features come from? And what if you
don't have access to such
features that give you enough detail about the movies with which
to make these predictions? In the next video, we'll look at
the modification of this algorithm. They'll let you make predictions
that you make recommendations. Even if you don't have, in advance,
features that describe the items of the movies in sufficient detail to
run the algorithm that we just saw. Let's go on and
take a look at that in the next video

Essentials of Strategic Management The Quest For Competitive Advantage 8th Edition Gamble Test Bank Available Instantly
No ratings yet
Essentials of Strategic Management The Quest For Competitive Advantage 8th Edition Gamble Test Bank Available Instantly
341 pages
Linear Algebra For Machine Learning
No ratings yet
Linear Algebra For Machine Learning
65 pages
Grid Audit Report Format
100% (1)
Grid Audit Report Format
7 pages
Analysis and Design of (Concentric, Edge, Corner) Footing: Sample Structural Manila
100% (1)
Analysis and Design of (Concentric, Edge, Corner) Footing: Sample Structural Manila
3 pages
Motivational Cognitive Behavioural Therapy Distinctive Features 1st Edition Optimized EPUB Download
100% (19)
Motivational Cognitive Behavioural Therapy Distinctive Features 1st Edition Optimized EPUB Download
16 pages
Literature Review On Accessibility
100% (1)
Literature Review On Accessibility
7 pages
Evolution of Stars
No ratings yet
Evolution of Stars
3 pages
Kinetic Theory & Thermal Properties Notes IGCSE AVG
100% (3)
Kinetic Theory & Thermal Properties Notes IGCSE AVG
12 pages
Nissan - Resilience Strategy
0% (1)
Nissan - Resilience Strategy
2 pages
Control Lab1
0% (1)
Control Lab1
59 pages
Python Programs by Narayana
100% (1)
Python Programs by Narayana
18 pages
Gender: Project All Numerates Pre-Test Results
100% (1)
Gender: Project All Numerates Pre-Test Results
6 pages
Rajan Dhabalia Netflix Prize
100% (1)
Rajan Dhabalia Netflix Prize
9 pages
Practical File: Deep Learning
No ratings yet
Practical File: Deep Learning
33 pages
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
No ratings yet
WS - 3 Class X Phy CH - 10 (Light - Refraction) - 1
3 pages
Predict
No ratings yet
Predict
196 pages
Recommender System
No ratings yet
Recommender System
45 pages
DM - Lecture 5
No ratings yet
DM - Lecture 5
75 pages
SPM Swivels Operation Instruction and Service Manual
No ratings yet
SPM Swivels Operation Instruction and Service Manual
44 pages
Social Suggest Team Report
No ratings yet
Social Suggest Team Report
52 pages
Evidence and Practice
No ratings yet
Evidence and Practice
18 pages
RecSysEvaluation - 1
No ratings yet
RecSysEvaluation - 1
100 pages
Recommendation Systems
No ratings yet
Recommendation Systems
62 pages
FRC Series
No ratings yet
FRC Series
1 page
A Recommender System: John Urbanic
No ratings yet
A Recommender System: John Urbanic
36 pages
第十讲-Recommender Systems
No ratings yet
第十讲-Recommender Systems
81 pages
Introduction To Algorithms For Behavior Based Recommendation
No ratings yet
Introduction To Algorithms For Behavior Based Recommendation
36 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
48 pages
MOvie Recommendation System Project Report
No ratings yet
MOvie Recommendation System Project Report
30 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Movie Recomendation System Using R
No ratings yet
Movie Recomendation System Using R
41 pages
15 - Matrix Factorization
No ratings yet
15 - Matrix Factorization
55 pages
B. Stage 1 and 2
No ratings yet
B. Stage 1 and 2
20 pages
Business Case Studies
No ratings yet
Business Case Studies
10 pages
Movie Rec
No ratings yet
Movie Rec
13 pages
Lec 5-d Analytics Recommenders
No ratings yet
Lec 5-d Analytics Recommenders
39 pages
41 Perusse Alexander Aperusse PDF
No ratings yet
41 Perusse Alexander Aperusse PDF
7 pages
Appm 3310 Final Project
No ratings yet
Appm 3310 Final Project
13 pages
Pospiszyl 2023 The Fifth Element The Enlightenment and The Draining of Eastern Europe
No ratings yet
Pospiszyl 2023 The Fifth Element The Enlightenment and The Draining of Eastern Europe
28 pages
Unit - IV
No ratings yet
Unit - IV
78 pages
Anand Yadav Internship
No ratings yet
Anand Yadav Internship
12 pages
Matrix Factorization
No ratings yet
Matrix Factorization
18 pages
Report
No ratings yet
Report
11 pages
Big Data Topic8b (Recommendation System) (Thanh Binh Nguyen) .TextMark
No ratings yet
Big Data Topic8b (Recommendation System) (Thanh Binh Nguyen) .TextMark
33 pages
Topic 2
No ratings yet
Topic 2
17 pages
Chapter 1 THE PROBLEM AND ITS BACKGROUND
No ratings yet
Chapter 1 THE PROBLEM AND ITS BACKGROUND
10 pages
Bda Mini Project Report
No ratings yet
Bda Mini Project Report
23 pages
Team Renegades MMLA Report
No ratings yet
Team Renegades MMLA Report
27 pages
DataMiningAssignment 4
No ratings yet
DataMiningAssignment 4
13 pages
Exp 2
No ratings yet
Exp 2
14 pages
CMSC422 Project Presentation
No ratings yet
CMSC422 Project Presentation
17 pages
Module 4
No ratings yet
Module 4
20 pages
ALS Large-Scale Parallel Collaborative Filtering For The Netflix Prize
No ratings yet
ALS Large-Scale Parallel Collaborative Filtering For The Netflix Prize
12 pages
SDM - Task B - Group 1G - Movies
No ratings yet
SDM - Task B - Group 1G - Movies
11 pages
Lecture9 Recommender Systems V0
No ratings yet
Lecture9 Recommender Systems V0
52 pages
5 Versionfinal
No ratings yet
5 Versionfinal
8 pages
Paper 4 PDF
No ratings yet
Paper 4 PDF
5 pages
Inn Aat Report
No ratings yet
Inn Aat Report
10 pages
Recommender Systems
No ratings yet
Recommender Systems
8 pages
Recommendation System Based On Collaborative Filtering: Zheng Wen December 12, 2008
No ratings yet
Recommendation System Based On Collaborative Filtering: Zheng Wen December 12, 2008
10 pages
Title Obvhbresearch Project
No ratings yet
Title Obvhbresearch Project
7 pages
Predicting Movie Rating Prior To Release
No ratings yet
Predicting Movie Rating Prior To Release
15 pages
ML
No ratings yet
ML
10 pages
CFE Final Course Output 2024 2025 1
No ratings yet
CFE Final Course Output 2024 2025 1
8 pages
Restricted Boltzmann Machines For Collaborative Filtering
No ratings yet
Restricted Boltzmann Machines For Collaborative Filtering
8 pages
285 Project Paper
No ratings yet
285 Project Paper
7 pages
ML Case Study
No ratings yet
ML Case Study
4 pages
Math 551 Lab 9
No ratings yet
Math 551 Lab 9
5 pages
Global Baseline Estimate - 12S21009
No ratings yet
Global Baseline Estimate - 12S21009
8 pages
Predicting Movie Ratings With Multimodal Data: Yichen Yang Ruoyun Ma Min Haeng Cho
No ratings yet
Predicting Movie Ratings With Multimodal Data: Yichen Yang Ruoyun Ma Min Haeng Cho
6 pages
Bayesian Probabilistic Matrix Factorization
No ratings yet
Bayesian Probabilistic Matrix Factorization
11 pages
16 Recommender Systems PDF
No ratings yet
16 Recommender Systems PDF
6 pages
Andriy Mnih and Ruslan Salakhutdinov: Atrix Factorization Methods For Collaborative Filtering
No ratings yet
Andriy Mnih and Ruslan Salakhutdinov: Atrix Factorization Methods For Collaborative Filtering
24 pages
Iml Project Proposal
No ratings yet
Iml Project Proposal
5 pages
Origins of Lift
No ratings yet
Origins of Lift
5 pages
Acre
No ratings yet
Acre
6 pages
Exam TDT4215 2018 Answers
No ratings yet
Exam TDT4215 2018 Answers
9 pages
Sales Prediction Using Regression Analysis: Problem Statement
No ratings yet
Sales Prediction Using Regression Analysis: Problem Statement
3 pages
Subtitle
No ratings yet
Subtitle
4 pages
GraphNeuralNetworks CaseStudyWalkthrough
No ratings yet
GraphNeuralNetworks CaseStudyWalkthrough
5 pages
Numpy NP Pandas PD Sklearn - Model - Selection: Import As Import As From Import
No ratings yet
Numpy NP Pandas PD Sklearn - Model - Selection: Import As Import As From Import
3 pages
IMDB - Movie Recomendation-DA Project
No ratings yet
IMDB - Movie Recomendation-DA Project
4 pages
Izar Net 2 14
No ratings yet
Izar Net 2 14
3 pages
Practical Work 1 - Recommender Systems
No ratings yet
Practical Work 1 - Recommender Systems
3 pages
Guidelines ITR 2020-21-For Mentor and Students
No ratings yet
Guidelines ITR 2020-21-For Mentor and Students
2 pages
Subtitle
No ratings yet
Subtitle
2 pages
Item Analysis Procedures 1
No ratings yet
Item Analysis Procedures 1
2 pages
Invoice 10
No ratings yet
Invoice 10
1 page
Motion Field: Exploring the Dynamics of Computer Vision: Motion Field Unveiled
From Everand
Motion Field: Exploring the Dynamics of Computer Vision: Motion Field Unveiled
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Subtitle

Uploaded by

Subtitle

Uploaded by

So let's take a look at how we can

develop a recommender system if we had features of each item, or

You might also like