0% found this document useful (0 votes)

3 views

Lecture 2 Part1

Uploaded by

kashifkhanpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Lecture 2 Part1

Uploaded by

kashifkhanpc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Recommender

Systems

Lecture 2: Neighborhood-
Based Collaborative Filtering
Imranuddin
Part 1
Neighborhood-based Collaborative
Filtering algorithms

• Also refered to as memory-based algorithms

• These were amongst earliest algorithms developed for
Collaborative Filtering
• They are based on the fact that similar users display similar
pattern of rating behavior and similar items receive similar
ratings.
• They are of two types
• User-Based Collaborative Filtering
• Item-Based Collaborative Filtering
Types of Collaborative Filtering

• 1. User-based collaborative filtering: In this case, the ratings provided by similar

users to a target user A are used to make recommendations for A. The predicted
ratings of A are computed as the weighted average values of these “peer group”
ratings for each item.
• 2. Item-based collaborative filtering: In order to make recommendations for target
item B, the first step is to determine a set S of items, which are most similar to
item B. Then, in order to predict the rating of any particular user A for item B, the
ratings in set S, which are specified by A, are determined. The weighted average
of these ratings is used to compute the predicted rating of user A for item B.

• Note: An important distinction between user-based collaborative filtering and item-based collaborative filtering algorithms is that
the ratings in the former case are predicted using the ratings of neighboring users, whereas the ratings in the latter case are
predicted using the user’s own ratings on neighboring (i.e., closely related) items. In the former case, neighborhoods are defined by
similarities among users (rows of ratings matrix), whereas in the latter case, neighborhoods are defined by similarities among items
(columns of ratings matrix).
Collaborative Filtering Problem
Formulation
• We assume that the user-item ratings matrix is an incomplete m × n matrix R = [ruj ]
containing m users and n items. It is assumed that only a small subset of the ratings matrix
is specified or observed. Neighborhood-based collaborative filtering algorithms can be
formulated in one of two ways:
• 1. Predicting the rating value of a user-item combination: This is the simplest and most
primitive formulation of a recommender system. In this case, the missing rating ruj of the
user u for item j is predicted.
• 2. Determining the top-k items or top-k users: In most practical settings, the merchant is not
necessarily looking for specific ratings values of user-item combinations. Rather, it is more
interesting to learn the top-k most relevant items for a particular user, or the top-k most
relevant users for a particular item. The problem of determining the top-k items is more
common than that of finding the top-k users. This is because the former formulation is used
to present lists of recommended items to users. In traditional recommender algorithms, the
“top-k problem” almost always refers to the process of finding the top-k items, rather than
the top-k users. However, the latter formulation is also useful to the merchant because it can
be used to determine the best users to target with marketing efforts.
Key Properties of Ratings Matrices

• We assume that the ratings matrix is denoted by R, and it is an (m×n)

matrix containing m users and n items. Therefore, the rating of user u for
item j is denoted by ruj . Only a small subset of the entries in the ratings
matrix are typically specified.
• The specified entries of the matrix are referred to as the training data,
whereas the unspecified entries of the matrix are referred to as the test
data
• This definition has a direct analog in classification, regression, and
semisupervised learning algorithms.
• In that case, all the unspecified entries belong to a special column, which is
known as the class variable or dependent variable. Therefore, the
recommendation problem can be viewed as a generalization of the problem
of classification and regression
Ratings Types

1. Continuous ratings
2. Interval-based ratings
3. Ordinal ratings
4. Binary ratings
5. Unary ratings

• Note: Indirect derivation of unary ratings from customer actions is also

referred to as implicit feedback, because the customer does not explicitly
provide feedback.
• Such types of “ratings” are often easier to obtain because users are far more
likely to interact with items on an online site than to explicitly rate them
Long-Tail Property

• The distribution of ratings among

items often satisfies a property in
real-world settings, which is
referred to as the long-tail
property. According to this
property, only a small fraction of
the items are rated frequently.
Such items are referred to as
popular items. The vast majority of
items are rated rarely. This results
in a highly skewed distribution of
the underlying ratings.
Predicting Ratings with Neighborhood-Based
Methods

• There are two basic principles used in neighborhood-based

models:
• 1. User-based models: Similar users have similar ratings on the
same item. Therefore, if Alice and Bob have rated movies in a
similar way in the past, then one can use Alice’s observed
ratings on the movie Terminator to predict Bob’s unobserved
ratings on this movie.
• 2. Item-based models: Similar items are rated in a similar way
by the same user. Therefore, Bob’s ratings on similar science
fiction movies like Alien and Predator can be used to predict his
rating on Terminator.
Example
Example

For the m× n ratings matrix R = [ruj ] with m users and n items, let Iu denote the set of item indices for
which ratings have been specified by user (row) u.

For example, if the ratings of the first, third, and fifth items (columns) of user (row) u are specified
(observed. and the remaining are missing, then we have Iu = {1, 3, 5}. Therefore, the set of items
rated by both users u and v is given by Iu ∩ Iv. For example, if user v has rated the first four items,
then Iv = {1, 2, 3, 4}, and Iu ∩ Iv = {1, 3, 5} ∩ {1, 2, 3, 4} = {1, 3}. It is possible (and quite common)
for Iu ∩ Iv to be an empty set because ratings matrices are generally sparse. The set Iu ∩ Iv defines the
mutually observed ratings, which are used to compute the similarity between the uth and vth users for
neighborhood computation.
Strictly speaking, the traditional definition of Pearson(u, v) mandates that the values of μu and μv
should be computed only over the items that are rated both by users u and v
Example

The mean-centered rating suj of a user u for item j is

defined by subtracting her mean rating from the raw
rating ruj . ->

Overall neighborhood-based
prediction function ->
Example

The mean-centered ratings :

Explanation: (rating-mean)
So for item 1
User1, 7-5.5=1.5,
User2, 6-4.8=1.8
For item 6
User1, 5-5.5= -1.5,
By using the Pearson-weighted average of the raw User2, 4-4.8= -0.8
ratings of users 1 and 2, the following predictions
are obtained for user 3 with respect to her unrated
items 1 and 6:
Similarity Function Variants
• Several other variants of the similarity function are
used in practice. One variant is to use the cosine
function on the raw ratings rather than the mean-
centered ratings:

• In some implementations of the raw cosine, the

normalization factors in the denominator are based on
all the specified items and not the mutually rated items
• The reliability of the similarity function Sim(u, v) is
often affected by the number of common ratings |Iu ∩
Iv| between users u and v.
• When the two users have only a small number of
ratings in common, the similarity function should be
reduced with a discount factor to de-emphasize the
importance of that user pair. This method is referred to
as significance weighting.
• The discount factor kicks in when the number of
common ratings between the two users is less than a
Variants of the Prediction Function

Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
UNIT III
No ratings yet
UNIT III
13 pages
Module5 Recommender Systems PartB
No ratings yet
Module5 Recommender Systems PartB
57 pages
AStudyof Mathematical Modelfor Collaborative Filtering
No ratings yet
AStudyof Mathematical Modelfor Collaborative Filtering
10 pages
Recommender Systems-Unit Iii
No ratings yet
Recommender Systems-Unit Iii
9 pages
Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
Recommended
No ratings yet
Recommended
8 pages
User-Based Neighborhood Models
No ratings yet
User-Based Neighborhood Models
8 pages
RS+LVC+2+Post-Session+Summary+.docx
No ratings yet
RS+LVC+2+Post-Session+Summary+.docx
13 pages
RS Part 1
No ratings yet
RS Part 1
40 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
12 pages
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
No ratings yet
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
4 pages
.Trashed-1724941095-Recommender Systems
No ratings yet
.Trashed-1724941095-Recommender Systems
30 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
Recommender Systems Notes
No ratings yet
Recommender Systems Notes
16 pages
Recommendations Using Collaborative Filtering
No ratings yet
Recommendations Using Collaborative Filtering
37 pages
CSE545 sp23 (9) Recommendation Systems 4-10
No ratings yet
CSE545 sp23 (9) Recommendation Systems 4-10
72 pages
Book Based Question
No ratings yet
Book Based Question
2 pages
Understanding Recommender Systems: Archa E S M180017MS
No ratings yet
Understanding Recommender Systems: Archa E S M180017MS
4 pages
L6 Recommendation
No ratings yet
L6 Recommendation
56 pages
All Merge Chap 1
No ratings yet
All Merge Chap 1
69 pages
M02 User-Based CF V02
No ratings yet
M02 User-Based CF V02
20 pages
CS345A Data Mining: Recommendation Systems
No ratings yet
CS345A Data Mining: Recommendation Systems
26 pages
Unifying User-Based and Item-Based Collaborative Filtering Approaches by Similarity Fusion
No ratings yet
Unifying User-Based and Item-Based Collaborative Filtering Approaches by Similarity Fusion
8 pages
Week 6 Recommender
No ratings yet
Week 6 Recommender
17 pages
Recommender System - New
No ratings yet
Recommender System - New
49 pages
CS583 Recommender Systems
No ratings yet
CS583 Recommender Systems
40 pages
Recommender: An Analysis of Collaborative Filtering Techniques
No ratings yet
Recommender: An Analysis of Collaborative Filtering Techniques
5 pages
Module 5
No ratings yet
Module 5
8 pages
Module 2
No ratings yet
Module 2
53 pages
AN OPTIMIZED ITEM-BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHM
No ratings yet
AN OPTIMIZED ITEM-BASED COLLABORATIVE FILTERING RECOMMENDATION ALGORITHM
5 pages
A Review of Information Filtering-CF
No ratings yet
A Review of Information Filtering-CF
47 pages
mod4
No ratings yet
mod4
6 pages
Collaborative Filtering & Content-Based Recommending: CS 293S. T. Yang Slides Based On R. Mooney at UT Austin
No ratings yet
Collaborative Filtering & Content-Based Recommending: CS 293S. T. Yang Slides Based On R. Mooney at UT Austin
22 pages
Movie Recommendation System: CSN-382 Project
No ratings yet
Movie Recommendation System: CSN-382 Project
25 pages
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
100% (1)
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
4 pages
Is593-Lecture04 Recommendation Systems
No ratings yet
Is593-Lecture04 Recommendation Systems
51 pages
12-recsys-1 - converted
No ratings yet
12-recsys-1 - converted
11 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Module 4
No ratings yet
Module 4
20 pages
Slides Lecture 2 RecSys
No ratings yet
Slides Lecture 2 RecSys
86 pages
Unit-3
No ratings yet
Unit-3
21 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
第十讲-Recommender Systems
No ratings yet
第十讲-Recommender Systems
81 pages
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
No ratings yet
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
8 pages
8 Recommender
No ratings yet
8 Recommender
139 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
Recommender System
No ratings yet
Recommender System
26 pages
We Know You Will Like This: Introduction To Recommendation Engines
No ratings yet
We Know You Will Like This: Introduction To Recommendation Engines
33 pages
Recommender System
No ratings yet
Recommender System
20 pages
Book Recommendation Project
No ratings yet
Book Recommendation Project
15 pages
Filtering and Recommender Systems: Content-Based and Collaborative
No ratings yet
Filtering and Recommender Systems: Content-Based and Collaborative
30 pages
A Collaborative Filtering Recommendation Algorithm Based on Item Genre and Rating Similarity
No ratings yet
A Collaborative Filtering Recommendation Algorithm Based on Item Genre and Rating Similarity
4 pages
A Study of Different Similarity Metrics and Prediction Approaches in Collaborative Filtering For Recommendation
No ratings yet
A Study of Different Similarity Metrics and Prediction Approaches in Collaborative Filtering For Recommendation
150 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
recSys
No ratings yet
recSys
104 pages
Title_obvhbResearch_Project
No ratings yet
Title_obvhbResearch_Project
7 pages
An Item Based Collaborative Filtering Recommendation Algorithm Using Rough Set Prediction
No ratings yet
An Item Based Collaborative Filtering Recommendation Algorithm Using Rough Set Prediction
4 pages
Unit Iii Collaborative Filtering
No ratings yet
Unit Iii Collaborative Filtering
51 pages
Implementation and Comparison of Recommender Systems Using Various Models
100% (1)
Implementation and Comparison of Recommender Systems Using Various Models
13 pages
MP (CGR)
No ratings yet
MP (CGR)
13 pages
Project Reportonline Shopping System
No ratings yet
Project Reportonline Shopping System
27 pages
C Handbook - Flavio Copes - Free Download, Borrow, and Streaming - Internet Archive
No ratings yet
C Handbook - Flavio Copes - Free Download, Borrow, and Streaming - Internet Archive
4 pages
3.Matrices MCQs
No ratings yet
3.Matrices MCQs
5 pages
Benefits and Challenges of Information S
No ratings yet
Benefits and Challenges of Information S
11 pages
Computer Networks
No ratings yet
Computer Networks
10 pages
OCS351- AI ML Fundamentals Syllabus
No ratings yet
OCS351- AI ML Fundamentals Syllabus
2 pages
Instructions:: Lab7 and Lab8
No ratings yet
Instructions:: Lab7 and Lab8
5 pages
DD10 Operating Manual Part1
No ratings yet
DD10 Operating Manual Part1
178 pages
Ritesh - Kumar - Resume - 14 04 2023 23 23 48
No ratings yet
Ritesh - Kumar - Resume - 14 04 2023 23 23 48
1 page
Chapter - 1 Windows Movie Maker - Part 2
No ratings yet
Chapter - 1 Windows Movie Maker - Part 2
3 pages
Class Test 1 Solution
No ratings yet
Class Test 1 Solution
7 pages
Htaccess
No ratings yet
Htaccess
6 pages
16.1.2 Lab - Implement a GRE Tunnel - ILM- Student 2024
No ratings yet
16.1.2 Lab - Implement a GRE Tunnel - ILM- Student 2024
6 pages
DPP (7-9) 11th J-Batch Maths
No ratings yet
DPP (7-9) 11th J-Batch Maths
10 pages
Download ebooks file Big C 2nd Edition Cay S. Horstmann all chapters
100% (7)
Download ebooks file Big C 2nd Edition Cay S. Horstmann all chapters
61 pages
Cloud Application Development
No ratings yet
Cloud Application Development
18 pages
operation, programming, maintenance, and safety aspects of Yamazaki Mazak CNC machine tools
No ratings yet
operation, programming, maintenance, and safety aspects of Yamazaki Mazak CNC machine tools
2 pages
Mobile SEO - How To Optimize Your Site For Any Device
No ratings yet
Mobile SEO - How To Optimize Your Site For Any Device
25 pages
00_introduct… - JupyterLab
No ratings yet
00_introduct… - JupyterLab
3 pages
Section 4 Data Center Electrical Design - Architecture Resilience
No ratings yet
Section 4 Data Center Electrical Design - Architecture Resilience
41 pages
Chapter 2-3 - Structured Cabling Overview
No ratings yet
Chapter 2-3 - Structured Cabling Overview
14 pages
Z80 Microprocessor Kit Programming Lab Book
No ratings yet
Z80 Microprocessor Kit Programming Lab Book
56 pages
ToGirma
No ratings yet
ToGirma
45 pages
Netskope Cloud-Firewall
No ratings yet
Netskope Cloud-Firewall
2 pages
INS - 4360704
No ratings yet
INS - 4360704
8 pages
IS - Report Temp
No ratings yet
IS - Report Temp
7 pages
Form 6 Recloser Control Product Aid Pa280009en
No ratings yet
Form 6 Recloser Control Product Aid Pa280009en
2 pages
Plot and Navigate A Virtual Maze: Capstone Project
No ratings yet
Plot and Navigate A Virtual Maze: Capstone Project
22 pages
Lesson02
No ratings yet
Lesson02
13 pages

Lecture 2 Part1

Uploaded by

Lecture 2 Part1

Uploaded by

Recommender

• Also refered to as memory-based algorithms

• 1. User-based collaborative filtering: In this case, the ratings provided by similar

• We assume that the ratings matrix is denoted by R, and it is an (m×n)

• Note: Indirect derivation of unary ratings from customer actions is also

• The distribution of ratings among

• There are two basic principles used in neighborhood-based

The mean-centered rating suj of a user u for item j is

The mean-centered ratings :

• In some implementations of the raw cosine, the

You might also like