Recommender System
Recommender System
Topics
Recommender System
▪ Introduction & Applications
▪ Types of Recommender System
▪ Concepts & Methods
▪ Evaluation Criteria
Introduction : Why?
▪ How do we buy things in our day-to-day lives?
▪ We ask our friends, research the product specifications,
compare the product with similar products on the Internet,
read the feedback from anonymous users, and then we
make decisions.
▪ How would it be if there is some mechanism that does all
these tasks automatically and recommends the products
best suited for you efficiently?
▪ The answer is a Recommender System
Introduction: What?
▪ “Friends you may know" - Facebook
▪ “People you may know” – LinkedIn
▪ What are these recommendations ?
▪ These features recommends a list of people whom you might
know
▪ who are similar to you based on your friends
▪ friends of friends in your close circle
▪ geographical location
▪ Skillsets
▪ Groups
▪ liked pages….
▪ recommendations are specific to you and differ from user to user.
INTRODUCTION: DEFINITION
▪ Recommender systems are the software tools and techniques that
provide suggestions, such as useful products on e-commerce
websites, videos on YouTube, friends' recommendations on
Facebook, book recommendations on Amazon, news
recommendations on online news websites, and the so on.
▪ The main goal of recommender systems is to provide suggestions
to online users to make better decisions from many alternatives
available over the Web.
▪ A better recommender system is directed more towards
personalized recommendations by taking into consideration the
available digital footprint of the user and information about a
product, such as specifications, feedback from the users,
comparison with other products, and so on, before making
recommendations.
▪ You can also apply apart from SVD, Deep Learning for Model-based
recommender systems
Types of Recommender system
▪ Content-based recommender systems
▪ Collaborative filtering recommender systems
Data pre-processing techniques
▪ Similarity measurements
▪ Euclidean distance
▪ Cosine distance
▪ Pearson coefficient
▪ Dimensionality-reduction techniques
▪ Principal component analysis (PCA)
▪ Singular Value Decomposition etc..
Similarity Measure - Euclidian distance
▪ (x, y) are two consecutive data points, and n is the number of attributes for the
dataset.
Similarity Measure - Cosine distance
Ram 1 5 4
Shyam 2 3 4
Seeta 4 5 2
Geeta 4 5
Collaborative filtering
▪ Item-based collaborative filtering: This
recommends to a user the items that are most
similar to the user's purchases
▪ User-based collaborative filtering: This
recommends to a user the items that are the most
preferred by similar users
Item-based collaborative filtering (IBCF)
▪ Collaborative filtering is a branch of recommendation that takes
account of the information about different users. The word
"collaborative" refers to the fact that users collaborate with each other
to recommend items. In fact, the algorithms take account of user
purchases and preferences.
▪ The starting point is a rating matrix in which rows correspond to users
and columns correspond to items.
▪ For each two items, measure how similar they are in terms of having
received similar ratings by similar users
▪ For each item, identify the similar items and store as item-item similarity
matrix
▪ For each user whose recommendations need to be generated, identify the
items that are most similar to this user‘s already purchased items.
▪ Item-item similarity values act as weight
▪ This is done by calculating new rated items as
▪ Pick the top-rated items.
User-based collaborative filtering (UBCF)
▪ Measure how similar each user is to the new one. Like IBCF,
popular similarity measures are correlation and cosine.
▪ Identify the most similar users.
▪ You can also take account of the users whose similarity is above a
defined threshold
▪ Rate the items purchased by the most similar users. The rating is
the average rating among similar users :
▪ Take Weighted average rating, using the similarities as weights
▪ Relative difference is applied to overcome user bias for poor raters and
generous raters