0% found this document useful (0 votes)

19 views46 pages

DM Lect 6 - Recommender Systems

The document discusses recommender systems, which provide personalized recommendations to users based on their preferences and behaviors. It covers various types of recommender systems, including collaborative filtering, content-based, and hybrid systems, along with their inputs, outputs, and challenges such as cold start and sparsity. Additionally, it highlights evaluation metrics and advanced techniques like context-aware and trust-based systems.

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views46 pages

DM Lect 6 - Recommender Systems

Uploaded by

mohamed2004mowaffak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

Data Mining

Recommender Systems

Dr. Wedad Hussein

[email protected]

Dr. Mahmoud Mounir

[email protected]
Web Personalization

• Definition: the process of customizing

a Web site to the needs of specific
users.
• The success of Web personalization
depends on its ability to anticipate the
users’ needs or next moves and
recommend the suitable objects.
How many times have you seen this
statement?
“People who downloaded / bought /
liked this item also downloaded / bought
/ liked items X and Y”

Association Rules
Recommender Systems!
Recommender Systems
• Definition: systems that produce
individualized recommendations as output.
• They can guide the user in a personalized
way to interesting or useful objects in a
large space of possible options.
• Recommender Systems vs. Information
Retrieval Systems??
• "individuality" and "personalized
recommendations"
Taxonomy

Recommender
Systems

Collaborative
Content-Based Hybrid
Filtering
Inputs
1. Ratings: The opinions of users in the
items available in the system.
2. Demographic Data: Data about the
user like age, gender and education.
3. Content Data: Textual data related to
the contents of the items to be
recommended.
Collaborative Filtering
Systems
Collaborative Filtering

• Identify like-minded users.

• Search for the “Neighborhood” of the user; that
is the group of users exhibiting similar behavior
to the current user.
• Builds a user-item matrix containing the ratings
of users to all items whenever available.
• e-Commerce systems like e-Bay and Amazon
are using collaborative filtering to present their
users with a recommended list of products.
Outputs

• Collaborative filtering systems could be

used for one of two purposes:
• Prediction: Generates a value indicating
the expected rating of an item by the
current user.
• Recommendation: Produces a list of N
items that the user is expected to like (Top
N recommendations).
User-item matrix

Item 1 Item 2 … … Item m

User 1 R11 R12 … … R1m
User 2 R21 R22 … … R2m
… … … … … …
… … … … … …
User n Rn1 Rn2 … … Rnm
Representation
Current User Users
1 1st item rate
0 Dislike
?
0
1 Like
1

Items
? 1
Unknown
0
1
1
0
1
1
1
1
0 14th item rate
Neighborhood Formation

• Similarity between users in the user-item

matrix should be calculated.
• Users similar to the active user will form a
proximity-based neighborhood with him.
• Implemented in two steps:
1. Similarity between all users is calculated.
2. Similarities of users are processed to find
neighbors.
Similarity Measures
• Cosine Similarity
Similarity Measures
• Pearson Correlation
Predicting Ratings

• Select from the set of nearest neighbors

the users that rated the target item.
• The predicted rating is given by:

• Where n is the neighborhood size and

simuj is the similarity between current user
u and user j.
Top-N Recommendation

• Perform a frequency count of the items

that each neighbor user has purchased or
rated.
• Exclude items already rated by the active
user.
• Sort the remaining items according to their
frequency counts.
• Return the N most frequent items, as the
recommendation for active user.
Item-based CF

• The item-based approach works by

comparing items based on their pattern
of ratings across users. The similarity of
items i and j is computed as follows:
sim(i, j ) =
 uU
(ru ,i − ru )(ru , j − ru )

uU u,i u
( r − r ) 2
uU u, j u
( r − r ) 2
Item-based Recommendation
• After computing the similarity between items we
select a set of k most similar items to the target
item and generate a predicted value of user u’s
rating

p (u, i ) =
 r
jJ u , j
 sim(i, j )
 jJ
sim(i, j )
where J is the set of k similar items

• Advantages?
Evaluation

• Mean Absolute Error (MAE):

• Root Mean Square Error (RMSE):

• Where arij is the actual rating provided by user i for

item j, rij is the predicted rating and ni is the
number of items already rated by the user.
Evaluation

• Coverage: the percentage of items for which

the system can provide recommendations.

• Where m is the number of users, npi is the

number of items for which the system was able
to provide recommendations and ni is the
number of items already rated by the user.
Challenges with
Collaborative Filtering
Systems
Cold Start Problem
• Cold start refers newly added items or users.
• An item cannot be recommended until it has been
rated by a number of users.
• For a user, a system can’t find his set of nearest
neighbors unless he has rated a number of items.

• Solution?
• Integrating other sources of information.
• Collecting preferences / profiles over multiple sites
Sparsity of User-item Matrix
• A user typically only rates a very small
portion of the items.
• We need to find commonly rated items to
locate neighbors.
• Given the sparsity of the matrix the decisions
are usually based on very few items making
the predictions inaccurate and unreliable.
• Solution?
• Default Voting
• Clustering
• Dimensionality Reduction
Other Challenges
• Scalability: The calculations grow with the
number of users and items.
• Solution?
• Clustering

• Popularity Bias (The Gray Sheep Problem): The

system is not capable of offering accurate
recommendations for users with unique tastes
Content-Based Systems
Content-Based Recommendation

• In content-based recommendations the

system tries to recommend items that
matches the User Profile.
• The Profile is based on items user has
liked in the past or explicit interests that
he defines.
• A content-based recommender system
matches the profile of the item to the user
profile to decide on its relevancy to the
user.
Content-Based Systems

• They usually use the vector-space

model to represent items.
• Advantage: They can overcome the
cold start problem (new items)
• Disadvantage: Over specialization.
Approaches

A. Case-Based Reasoning:
• calculate similarity based on the attributes of the
item.
• Recommends items which are most similar to the
items the user has liked before.
• Still suffer from a new user problem.
B. Attribute-Based Techniques:
• Include information about the user in the
recommendation process.
• overcome the new user problem.
• Disadvantage: do not adapt to new ratings added
since user information is static.
Hybrid Systems
Hybrid Systems

• Hybrid schemes attempt to combine user

ratings and content information to yield
better recommendations.
• Methods:
• Generate recommendations from both
techniques separately and later combining the
recommendation lists.
• Incorporate content information into the data
collected by collaborative filtering systems
1. Feature Augmentation

• Based on content features additional

ratings are created.
• E.g. User X likes Items 1 and 3:
• Item7 is similar to 1 and 3 by a degree of
0.75
• Thus User X likes Item7 by 0.75
• User-item matrix becomes less sparse.
2. Weighted Hybrid

Score
Candidate Recommender 1
Weighted
Combination
Recommender 2
Score

Combined Score
Weighted Hybrid Example
n

rec weighted
(u, i ) =   k  reck (u, i )
k =1

Recommender 1 Recommender 2
Item1 0.5 1 Item1 0.8 2
Item2 0 Item2 0.9 1
Item3 0.3 2 Item3 0.4 3
Item4 0.1 3 Item4 0
Item5 0 Item5 0

Recommender weighted(0.5:0.5)
Item1 0.65 1
How are
Item2 0.45 2
weights
Item3 0.35 3
assigned?
Item4 0.05 4
Item5 0.00
Assigning Weights

• The weights are assigned in one of two

ways:
• Training Phase: During this phase training
data are fed to the two recommender systems
and weights are assigned according to the
accuracy of the predictions.
• Adjustable Weights: Start with equal
weights and these weights are adjusted
periodically to reflect the accuracy of
prediction.
3. Switching Hybrid

User Profile Recommender 1

?
Selection
Criteria

Recommender 2

Selected
Score
Recommender
4. Cascade Hybrid

Cascade Hybrid

Primary
Candidate Recommender

Score

Secondary Score
Recommender Combined Score
Method
• Each recommender system filters the list of
items produced by the previous one.
• Subsequent recommender may not introduce
additional items
• For all k > 1

reck (u , i ) : reck −1 (u , i )  0
reck (u , i ) = 
 0 : otherwise
Cascade Hybrid Example
Recommender 1 Recommender 2
Item1 0.5 1 Item1 0.8 2
Item2 0 Item2 0.9 1
Item3 0.3 2 Item3 0.4 3
Item4 0.1 3 Item4 0
Item5 0 Item5 0

Removing no-go items Ordering and refinement

Recommender 3
Item1 0.80 1
Item2 0.00
Item3 0.40 2
Item4 0.00
Item5 0.00
Other Techniques
1. Context-Aware Recommender Systems

• Recommend a vacation
• Winter vs. summer
• Recommend a purchase
• Gift vs. for yourself
• Recommend a movie
• With friends vs. with family
What Other Techniques Ignore

• What is the user doing when asking for

a recommendation?
• Where (and when) the user is located?
• What does the user really want (e.g.,
improve his knowledge or really buy a
product)?
• Is the user alone or with other fellows?
Challenges

• Obtain sufficient and reliable data

describing the user context.
• Selecting the right information.
• Computational model: how to extend
Collaborative Filtering to include
contextual dimensions?
2. Social (Trust based) Systems

• Intuition – Users tend to receive advice

from people they trust, i.e., from their
friends.
• Trusted friends can be defined explicitly
by the users or inferred from social
networks they are registered to.
Trust- based Collaborative
Filtering
Active users’ trusted
friends

Active user

3
?

Rating
prediction
Recommended Readings

• Vozalis, E., Margaritis, K., Analysis of

Recommender Systems’ Algorithms, In The Sixth
Hellenic European Conference on Computer
Mathematics and its Applications (HERCMA 2003),
Athens, Greece, 2003, pp. 1-14.
• Recommender Systems Handbook, Ricci, F.;
Rokach, L.; Shapira, B.; Kantor, P.B. (Eds.)2011.
• Burke, R.D., Hybrid Recommender Systems:
Survey and Experiments, User Modeling and User-
Adapted Interaction 12(4), 2002, pp. 331-370.
Thank You

Recommendation System Final
No ratings yet
Recommendation System Final
16 pages
ISO IEC 33001 Complete Self-Assessment Guide
From Everand
ISO IEC 33001 Complete Self-Assessment Guide
Gerardus Blokdyk
No ratings yet
Value Proposition: HP Indigo 7500 Digital Press Presentation
No ratings yet
Value Proposition: HP Indigo 7500 Digital Press Presentation
57 pages
Recommender Systems
No ratings yet
Recommender Systems
12 pages
Book Recommendation System Project
No ratings yet
Book Recommendation System Project
14 pages
Unit 1 Recommender Systems
No ratings yet
Unit 1 Recommender Systems
33 pages
TECHNICAL+NOTE Recommender+Systems+v.27
No ratings yet
TECHNICAL+NOTE Recommender+Systems+v.27
16 pages
Recommendation System
No ratings yet
Recommendation System
21 pages
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
No ratings yet
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
19 pages
Unit-1 - Introduction
No ratings yet
Unit-1 - Introduction
46 pages
Recommender Systems
No ratings yet
Recommender Systems
23 pages
Recommender System - New
No ratings yet
Recommender System - New
49 pages
Slides Lecture 2 RecSys
No ratings yet
Slides Lecture 2 RecSys
86 pages
Module 5
No ratings yet
Module 5
8 pages
An Introduction To Recommender Systems
No ratings yet
An Introduction To Recommender Systems
6 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
DM Lec 6
No ratings yet
DM Lec 6
4 pages
RS Part 1
No ratings yet
RS Part 1
40 pages
Recommended System
No ratings yet
Recommended System
33 pages
Module 5
No ratings yet
Module 5
50 pages
Unit 3
No ratings yet
Unit 3
21 pages
Module4 RecommenderSystem
No ratings yet
Module4 RecommenderSystem
11 pages
Recommender System
No ratings yet
Recommender System
26 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
LITERATURE SURVEY ON RECOMMENDATION ENGINEaper
No ratings yet
LITERATURE SURVEY ON RECOMMENDATION ENGINEaper
9 pages
Recommendation Systems
No ratings yet
Recommendation Systems
12 pages
Session 1 2
No ratings yet
Session 1 2
92 pages
Recommendation System-WPS Office
No ratings yet
Recommendation System-WPS Office
18 pages
Recommender Systems Notes
No ratings yet
Recommender Systems Notes
16 pages
Aai - Unit 3
No ratings yet
Aai - Unit 3
25 pages
Recommendation Systems: Department of Computer Science Engineering University School of Information and Technology
No ratings yet
Recommendation Systems: Department of Computer Science Engineering University School of Information and Technology
6 pages
RecommenderSystems Shortened
No ratings yet
RecommenderSystems Shortened
95 pages
10 Recommender Systems
No ratings yet
10 Recommender Systems
35 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
Review of Clustering-Based Recommender Systems
No ratings yet
Review of Clustering-Based Recommender Systems
22 pages
Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
On Rec Sys
No ratings yet
On Rec Sys
145 pages
IDEA - Collaborative Filtering Techniques in Recommendation Systems
No ratings yet
IDEA - Collaborative Filtering Techniques in Recommendation Systems
11 pages
Notes On Recommender Systems
No ratings yet
Notes On Recommender Systems
72 pages
Filtering and Recommender Systems: Content-Based and Collaborative
No ratings yet
Filtering and Recommender Systems: Content-Based and Collaborative
30 pages
MS - BDA Lec - Recommendation Systems I
No ratings yet
MS - BDA Lec - Recommendation Systems I
31 pages
Recommender - Introduction
No ratings yet
Recommender - Introduction
25 pages
Recommender Systems Asanov
No ratings yet
Recommender Systems Asanov
7 pages
Unit III Collaborative Filtering Final
No ratings yet
Unit III Collaborative Filtering Final
65 pages
Music Recommendation
100% (1)
Music Recommendation
113 pages
Survey Paper On Recommendation Engine
No ratings yet
Survey Paper On Recommendation Engine
9 pages
Movie Recommendations
No ratings yet
Movie Recommendations
12 pages
Unit V Chapter II
No ratings yet
Unit V Chapter II
22 pages
Other Techiniques
No ratings yet
Other Techiniques
63 pages
Unit Iii-Collaborative Filtering
No ratings yet
Unit Iii-Collaborative Filtering
34 pages
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
No ratings yet
Unit III - 3.1 - Recommender Systems at CSJMU - 6 Slides Handouts
3 pages
Lect 13 DM
No ratings yet
Lect 13 DM
20 pages
Recommendation System
No ratings yet
Recommendation System
32 pages
Recommendation System
No ratings yet
Recommendation System
19 pages
2023 Scopus Kids Hobby Prediction
No ratings yet
2023 Scopus Kids Hobby Prediction
6 pages
Unit Iii Collaborative Filtering
No ratings yet
Unit Iii Collaborative Filtering
51 pages
Recommendation Systems: A Review
No ratings yet
Recommendation Systems: A Review
6 pages
Book Recommendation System
No ratings yet
Book Recommendation System
8 pages
Sharma 2021
No ratings yet
Sharma 2021
16 pages
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 2
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 2
2 pages
Statistical Inference INF312 - Is - Lecture 03 - Part 3
No ratings yet
Statistical Inference INF312 - Is - Lecture 03 - Part 3
18 pages
Networks Lecture 5
No ratings yet
Networks Lecture 5
29 pages
Lecture 1 - Introduction To Data Security
No ratings yet
Lecture 1 - Introduction To Data Security
46 pages
5-Data Analytics in A Business Operations and BI Marketing Models
No ratings yet
5-Data Analytics in A Business Operations and BI Marketing Models
29 pages
DM Lect 9 - Classification - Decision Trees
No ratings yet
DM Lect 9 - Classification - Decision Trees
39 pages
Lecture 5 Modes of Operation
No ratings yet
Lecture 5 Modes of Operation
30 pages
Lec5-Regular Simplex Method and Dual Simplex Method
No ratings yet
Lec5-Regular Simplex Method and Dual Simplex Method
48 pages
3-Data Fundamentals For BI - Part2
No ratings yet
3-Data Fundamentals For BI - Part2
44 pages
Networks Lecture 1
No ratings yet
Networks Lecture 1
28 pages
1-Introduction To Business Intelligence in A Business Environment
No ratings yet
1-Introduction To Business Intelligence in A Business Environment
40 pages
Networks Lecture 2
No ratings yet
Networks Lecture 2
21 pages
Spider V 20 MkII Manual - English
No ratings yet
Spider V 20 MkII Manual - English
7 pages
C# Array PDF
No ratings yet
C# Array PDF
13 pages
A Comparative Study Between Android Ios
No ratings yet
A Comparative Study Between Android Ios
7 pages
Schneider Electric Altivar Machine ATV320 DTM Library V1.7.7 ReleaseNotes
No ratings yet
Schneider Electric Altivar Machine ATV320 DTM Library V1.7.7 ReleaseNotes
8 pages
5G in Military Usage
No ratings yet
5G in Military Usage
1 page
Advanced Java Programming Chapter 5 - Network Programming
No ratings yet
Advanced Java Programming Chapter 5 - Network Programming
39 pages
Detecting Suspicious File Migration or Replication in The Cloud
No ratings yet
Detecting Suspicious File Migration or Replication in The Cloud
14 pages
Assignment - Telecommunication Principles
No ratings yet
Assignment - Telecommunication Principles
3 pages
VRF PRO V6x
No ratings yet
VRF PRO V6x
65 pages
CSC 101 - Ams 103 (Introduction To Computer Science)
No ratings yet
CSC 101 - Ams 103 (Introduction To Computer Science)
9 pages
Interactive Map
No ratings yet
Interactive Map
2 pages
Intrusion Detection With Suricata
No ratings yet
Intrusion Detection With Suricata
32 pages
Answer
No ratings yet
Answer
9 pages
Csi ZG520 Ec-3r First Sem 2023-2024
No ratings yet
Csi ZG520 Ec-3r First Sem 2023-2024
4 pages
4.production System Modeling
No ratings yet
4.production System Modeling
56 pages
Sentry Hps HT 10-80kva
No ratings yet
Sentry Hps HT 10-80kva
14 pages
Dumpsys ANR WindowManager
No ratings yet
Dumpsys ANR WindowManager
1,707 pages
Digital Signal Processing: M.Sivakumar
100% (1)
Digital Signal Processing: M.Sivakumar
44 pages
NPTEL Online Course Details For ECE
No ratings yet
NPTEL Online Course Details For ECE
4 pages
Brkucc 2801
No ratings yet
Brkucc 2801
231 pages
Quick Setup Guide: MFC-L2717DW / MFC-L2710DW / MFC-L2690DWXL / MFC-L2690DW / DCP-L2550DW / HL-L2390DW
No ratings yet
Quick Setup Guide: MFC-L2717DW / MFC-L2710DW / MFC-L2690DWXL / MFC-L2690DW / DCP-L2550DW / HL-L2390DW
2 pages
Aditya's Resume
No ratings yet
Aditya's Resume
1 page
Unit 3 PHP
No ratings yet
Unit 3 PHP
18 pages
Project Diary
No ratings yet
Project Diary
20 pages
Question Paper Part-2 Virtual ITT Batch - 010 (Rewari Branch of NIRC of ICAI) Project Work Based Questions 275 Marks
No ratings yet
Question Paper Part-2 Virtual ITT Batch - 010 (Rewari Branch of NIRC of ICAI) Project Work Based Questions 275 Marks
4 pages
Nmap
No ratings yet
Nmap
2 pages
Linear Programming - 17 March 23
No ratings yet
Linear Programming - 17 March 23
8 pages
Data Flow Diagrams
No ratings yet
Data Flow Diagrams
26 pages
Natural General Intelligence How Understanding The Brain Can Help Us Build Ai 1nbsped 0192843885 9780192843883 Compress
No ratings yet
Natural General Intelligence How Understanding The Brain Can Help Us Build Ai 1nbsped 0192843885 9780192843883 Compress
341 pages

DM Lect 6 - Recommender Systems

Uploaded by

DM Lect 6 - Recommender Systems

Uploaded by

Data Mining

Dr. Wedad Hussein

Dr. Mahmoud Mounir

• Definition: the process of customizing

• Identify like-minded users.

• Collaborative filtering systems could be

Item 1 Item 2 … … Item m

• Similarity between users in the user-item

• Select from the set of nearest neighbors

• Where n is the neighborhood size and

• Perform a frequency count of the items

• The item-based approach works by

• Mean Absolute Error (MAE):

• Root Mean Square Error (RMSE):

• Where arij is the actual rating provided by user i for

• Coverage: the percentage of items for which

• Where m is the number of users, npi is the

• Popularity Bias (The Gray Sheep Problem): The

• In content-based recommendations the

• They usually use the vector-space

• Hybrid schemes attempt to combine user

• Based on content features additional

• The weights are assigned in one of two

User Profile Recommender 1

Removing no-go items Ordering and refinement

• What is the user doing when asking for

• Obtain sufficient and reliable data

• Intuition – Users tend to receive advice

• Vozalis, E., Margaritis, K., Analysis of

You might also like