0% found this document useful (0 votes)

14 views13 pages

Unit Iii

The document discusses collaborative filtering (CF) methods used in recommender systems, focusing on user-based and item-based approaches. It explains the processes involved in these methods, including similarity calculation, neighborhood selection, and the challenges faced, such as the cold-start problem and data sparsity. Additionally, it covers key components of neighborhood methods, such as rating normalization and similarity weight computation, emphasizing the importance of these techniques for improving recommendation accuracy.

Uploaded by

ramya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views13 pages

Unit Iii

Uploaded by

ramya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

lOMoARcPSD|52617882

UNIT III

Recommender System (Anna University)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Downloaded by Mr. Bharath S ([email protected])
lOMoARcPSD|52617882

UNIT III
COLLABORATIVE FILTERING
A systematic approach, Nearest-neighbour collaborative filtering (CF), user-based and item-based
CF, components of neighbourhood methods (rating normalization, similarity weight computation, and
neighbourhood selection.

A systematic approach - Nearest-neighbour collaborative filtering (CF)

Collaborative Filtering (CF) methods collect preferences in the form of ratings or signals from many
users (hence the name) and then recommend items to a user based on item interactions that people
have with similar tastes as this user had in the past. In other words, these methods assume that if
person X likes a subset of items that person Y likes, then X is more likely to have the same opinion as
Y for a given item compared to a random person that may or may not have the same preferences.
The main idea with neighborhood-based methods is to leverage either user-user similarity or item-
item similarity to make recommendations. These methods assume that similar users tend to have
similar behaviors when rating items. We can also expand this assumption to items as well: similar
items tend to receive similar ratings from the same user.
In these methods, the interactions between users and items are generally represented by a user-item
matrix, where each row represents a user and each column represents an item, while the cells
represent the interaction between the two, which, in most cases, are the item ratings made by users. In
this context, we can define two types of neighborhood-based methods:
 User-based Collaborative Filtering: Ratings given by users like a user U are used to make
recommendations. More specifically, to predict U's rating for a given item I, we calculate the
weighted average of the rating r of k similar users (neighbors) to U, where the weights are
determined by the similarity between U and each of the similar users.

 Item-based Collaborative Filtering: Ratings of a group of similar items are used to make
recommendations for item I. Similarly, to predict I's rating given by a user U, we calculate the
weighted average of the rating r of k similar items (neighbors) to I, where the weights are
determined by the similarity between I and each of the similar items.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Comparison between User-based and Item-based Methods

The difference is subtle, but user-based collaborative filtering predicts a user’s rating by using the
ratings of neighboring users, while item-based collaborative filtering leverages the user's ratings on
neighboring items, which allows for more consistent predictions because it follows the rating
behaviors of that user. In the former case, the similarity is calculated between the rows of the user-
item matrix, while the latter looks at similarities between the columns of the matrix.
These approaches also differ in the way they solve problems. It is common to use the item
neighborhood to recommend a list of top k items to a user. On the other hand, it is interesting to
retrieve the top k users from a segment to target them for marketing campaigns.
To understand the reasoning behind a recommendation, item-based methods provide better
explanations than user-based methods. This is because item-based recommendations can use the item
neighborhood to explain the results in the form of "you bought this, so these are the recommended
items". The item neighborhood can also be useful for suggesting product bundles to maximize sales.
On the other hand, user-based methods’ recommendations usually cannot be explained directly
because neighbor users are anonymized for privacy reasons.
Additionally, item-based methods may only recommend items very similar to what the user already
liked, whereas user-based methods often recommend a more diverse set of items. This can encourage
users to try new items and potentially keep their engagement and interest.
Another significant difference between these approaches is related to ratings. Calculating the
similarity between users to predict ratings may be misleading because users may rate items in a
different manner. When you present a range of values to the user, he/she might interpret them
differently. For instance, in a 5-star rating system, a user may rate an item as 3 because it does what it
is expected to do and nothing more, while others might use 3 to rate an item that barely works. Some
users rate items highly and others rate items less favorably. To address this issue, the ratings should
be mean centered by the user, meaning the user’s mean rating is subtracted from their raw rating, and
the target user’s mean rating is added to the calculation, as in the example below:

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Neighborhood Models in Practice

Let’s say we have the following small sample of a user-item matrix, where items are from a digital
commerce store. Notice there are missing ratings, which means users typically do not rate all
products.

Note: Product images extracted from Amazon Marketplace

To show how the algorithm works in practice, let’s assume we have built an item-based model. Note
that the steps of the algorithm would be analogous to the user-based model, except for the perspective
changes and focus on similarities between rows (users).
Remember that neighborhood CF algorithms rely on the ratings and similarity between items/users, so
the first step is to define which similarity metric to use. One of the most common choices is the
Pearson similarity, which measures how correlated a pair of vectors are. The range of values scales
from -1 to 1, where those values indicate negative and positive correlations, respectively, and 0
indicates no correlation between vectors. This is the Pearson similarity equation for item-based
models:

During this first phase, it’s usual to precompute the similarity matrix beforehand to obtain a good
performance during inference time. In the case of item-based models, an item-item similarity matrix is
built by applying the similarity metric between all pairs of items. Since the matrix is sparse, we only
consider the set of mutually rated pairs of items during the similarity computation. For instance, the
similarity between items from columns 1 and 4 of the image above will be computed as the similarity
between vectors [4,3,5] and [5,3,4]. It’s possible that a pair of items may show no co-ratings by users
due to the sparsity of the matrix, resulting in an empty set. In that case, a value of 0 similarity is
assigned for that pair. To improve computational efficiency, it is common to consider only the k
nearest neighbors of an item during inference time.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Let’s say we want to predict how Madison rated the Animal Farm book and we defined k=2 as the
number of nearest neighbors to consider during calculation. To simplify the example, we will only
manually calculate the similarities between the target item and items from columns 2 and 4 because
they are the nearest neighbors for this item. When calculating the mean rating during similarity
computation, we will consider only the set of ratings that are mutually exclusive between items.
The image below shows how the neighborhood is formed. The circle in red is the value we’re trying
to predict. The squares in green are ratings from Madison that are going to be used to infer the rating
for the target item. The other two ratings marked with an X are not considered because k=2. The
rectangles in orange show a set of mutually exclusive ratings between the target item and the item
from column 2, while the rectangles in blue show the same, but for the common ratings between the
target item and item from column 4.

These are the common set of ratings between the target item (item 3) and the first neighbor (item 2):
[4,3,3] and [4,4,3]. The first step is to calculate the mean in each set:

The Pearson similarity formula centers the ratings by their mean, so we can transform this vector and
then plug the results into the equation:

To simplify the calculations, we separate the numerator and denominator:

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Then finally compute the similarity between items 3 and 2.

The same calculation is done for the similarity between items 3 and 4:

Next, we calculate the mean for each item, considering all the item’s ratings:

Then, we can plug in the values we found together with Madison’s ratings for Items 2 and 4 (1 and 3,
respectively) in the equation below:

So,

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Since ratings are discrete numbers, we round this value to 2. It’s important to note that in a real-world
setting, it’s often recommended to use neighborhood methods only when k is above a certain
threshold because, when the number of neighbors is small, the predictions are usually not precise. An
alternative would be to use Content-based filtering when we do not have enough data about the user-
item relationship.
Online and Offline Phases
Neighborhood-based methods separate the computations into two phases: offline, where the model is
fitted; and online, where inferences are made. In the offline phase, the user-user (or item-item)
similarity values are precomputed, and the k most similar users or items are predetermined. These pre-
computed values are leveraged to make fast predictions during the online phase.
Although good online performance is a major benefit of these methods, there is also a known
disadvantage: the similarity matrix can get huge depending on the number of users/items in the
system, causing the offline phase to not scale well. In addition to that, as the methods do not
traditionally adapt to change, they need to update the precomputed similarities and nearest neighbors
to account for new users and items, which makes the retraining process even more challenging.
The Cold-Start Problem
As previously stated, the effectiveness of Neighborhood-based CF algorithms depends on user-item
interaction data. However, the user-item matrix is often sparse, with many missing values, as most
users only interact with a small fraction of the items. This sparsity can lead to inaccurate
recommendations due to the small neighborhood size.
This challenge is known as the cold-start problem, which is more apparent when new users or items
enter the system. In this setting, the algorithm does not have enough data to form the nearest
neighbors, so, the system cannot make useful recommendations.
Another important property of the user-item matrix is that the distribution of ratings among items
displays a long-tail pattern. This means that only a small subset of items receives a significant number
of ratings and are considered popular, while most items receive few ratings or no ratings at all. As a
result, it is difficult to make precise predictions about items in the long tail using these methods, and
that can be a problem because items that are less rated may provide large profit margins. This is
something that is explored by Chris Anderson in his book, “The long tail”. This problem can also
result in a lack of diversity in recommendations because the algorithm will usually only recommend
popular items.
To address these limitations to some extent, alternative algorithms can be used, such as matrix
factorization and hybrid algorithms, which combine CF with Content-based Filtering methods. The
next blog posts of this series will explore these topics in greater detail.
user-based and item-based CF

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Collaborative Filtering (CF) is a popular technique in recommendation systems that helps predict a
user's preferences by leveraging the preferences of other users or items. There are two main types of
collaborative filtering: user-based and item-based.
1. User-Based Collaborative Filtering:
 Idea: This approach relies on the assumption that users who have agreed in the past
tend to agree again in the future. In other words, it recommends items to a user based
on the preferences of users with similar tastes.
 Workflow:
1. Similarity Calculation: Measure the similarity between users based on their
historical interactions or preferences. Common similarity metrics include
cosine similarity, Pearson correlation, or Jaccard similarity.
2. Neighborhood Selection: Identify a subset of users (neighborhood) who are
most similar to the target user.
3. Prediction: Predict the target user's preference for a particular item by
aggregating the preferences of the selected neighborhood. This can be done
by taking a weighted average of their ratings, for example.
 Advantages:
 Intuitive approach based on the idea of finding like-minded users.
 Easily interpretable.
 Challenges:
 Cold-start problem for new users.
 Scalability issues with a large user base.
 Sparsity of data can lead to unreliable predictions.
2. Item-Based Collaborative Filtering:
 Idea: This approach focuses on the similarity between items rather than users. It
recommends items to a user based on the similarity between the items the user has
liked or interacted with in the past.
 Workflow:
1. Similarity Calculation: Measure the similarity between items based on the
users who have interacted with them. Similarity metrics are typically the
same as those used in user-based CF.
2. Neighborhood Selection: Identify a subset of items that are most similar to
the target item.
3. Prediction: Predict the target user's preference for a particular item based on
their historical preferences for similar items. This is done by aggregating the
ratings of the selected neighborhood.
 Advantages:

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

 Overcomes some of the scalability issues associated with user-based CF.

 Works well with a large number of users.
 Challenges:
 Cold-start problem for new items.
 May not capture user preferences as effectively as user-based CF in certain
situations.
Comparison:
 Scalability: Item-based CF tends to scale better with a large number of users, while user-
based CF can face challenges.
 Sparsity: Both approaches can suffer from sparsity issues, where the user-item interaction
matrix is mostly empty.
 Performance: The performance of user-based or item-based CF depends on the
characteristics of the dataset and the specific application.

components of neighbourhood methods (rating normalization, similarity weight computation,

and neighbourhood selection.
Neighborhood methods in collaborative filtering involve several key components, including rating
normalization, similarity weight computation, and neighborhood selection. Let's delve into each of
these components in detail:
1. Rating Normalization:
Rating normalization is a crucial step in collaborative filtering systems, particularly in user-based
collaborative filtering. Its primary purpose is to address variations in individual users' rating scales
and tendencies, making the recommendations more accurate and robust. There are various methods
for rating normalization, but one common approach is mean centering.
Mean Centering:
 Objective: Subtracting the mean rating of a user (or item) from each of their ratings.
 Formula: Normalized Rating (r') = Rating (r) - Mean Rating of the User (or Item)
 Example:
 Let's consider User A and their ratings for movies: [5, 4, 3, 5, 2]. The mean rating for
User A is (5+4+3+5+2)/5 = 3.8.
 The normalized ratings would be: [1.2, 0.2, -0.8, 1.2, -1.8].
Purpose of Rating Normalization:
1. Bias Correction: Users may have different rating scales. Some users might generally rate
items higher or lower than others. Rating normalization helps in mitigating these biases.
2. Focus on Relative Preferences: By centering ratings around the user's mean, the
collaborative filtering algorithm focuses on capturing the relative preferences of the user
rather than absolute values.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

3. Improved Model Performance: Normalized ratings provide a more consistent basis for
measuring similarity between users or items, leading to more accurate predictions.
Z-Score Normalization: In addition to mean centering, another normalization technique is Z-score
normalization. This involves scaling the ratings by the standard deviation of a user's (or item's)
ratings.
 Formula: Normalized Rating (r') = (Rating (r) - Mean Rating of the User) / Standard
Deviation of Ratings of the User
When to Use:
 Z-score normalization is particularly useful when dealing with users who have a wide range
of rating scales or exhibit extreme rating behaviors.
Example:
 If a user tends to give ratings that are consistently higher or lower than the average, Z-score
normalization will scale those ratings based on how much they deviate from the mean.

2. Similarity Weight Computation:

Similarity weight computation is a key component in collaborative filtering algorithms,
determining the degree of similarity between users or items based on their historical
preferences. The computed similarity weights guide the recommendation system in
identifying the most relevant neighbors for making predictions or recommendations. Several
similarity metrics can be employed in this process, and the choice of metric depends on the
characteristics of the data and the requirements of the recommendation system. Here, we'll
explore common similarity metrics and their application:
Common Similarity Metrics:
1. Cosine Similarity:
 Formula:

 Measures the cosine of the angle between two vectors representing user or item
preferences.
 Appropriate for scenarios where the magnitude of the vectors is essential.
2. Pearson Correlation:
 Formula:

 Measures the linear correlation between two vectors, considering both the magnitude
and direction of the ratings.
 Suitable for situations where the absolute ratings are significant.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

3. Jaccard Similarity:
 Formula:

 Used for binary preference data (like item liked/disliked).

 Ignores non-overlapping elements in the sets.
Example:
Let's consider two users, User A and User B, and their rated items:
 User A: [5, 4, 3, 5, 2]
 User B: [4, 3, 2, 5, 1]

Purpose of Similarity Weight Computation:

a. Neighbor Identification: Computes a quantitative measure of similarity to identify users or
items that are most similar to the target user or item.
b. Weighted Aggregation: The similarity weights serve as weights in weighted averages or
other aggregation methods when predicting a user's preference for an item.
c. Personalization: Allows the recommendation system to personalize recommendations based
on the preferences of similar users or items.

3. Neighborhood Selection:
Neighborhood selection is a critical step in collaborative filtering algorithms, where the goal is to
identify a subset of users or items (the neighborhood) that are most similar to the target user or item.
This selected subset is then used to make predictions or recommendations for the target user. The
neighborhood selection process involves deciding which users or items to include in the neighborhood
and, in some cases, setting a limit on the number of neighbors to consider.
Methods of Neighborhood Selection:
1. Top-N Neighbors:
 Objective: Select the N most similar users or items based on the computed similarity
weights.
 Process: Rank all potential neighbors based on their similarity weights and select the
top N for inclusion in the neighborhood.
 Example: If N is set to 10, the top 10 most similar users to the target user form the
neighborhood for making recommendations.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

 Purpose: Focuses on the most similar entities, ensuring a balance between accuracy
and computational efficiency.
2. Threshold-Based Selection:
 Objective: Include only those users or items with a similarity score above a certain
threshold.
 Process: Set a similarity threshold, and include users or items in the neighborhood
only if their similarity score surpasses this threshold.
 Example: If the similarity threshold is set to 0.8, only users or items with a similarity
score of 0.8 or higher are included in the neighborhood.
 Purpose: Provides more flexibility in the size of the neighborhood and can be used to
filter out less relevant or less similar entities.
Considerations in Neighborhood Selection:
1. Computational Complexity:
 Selecting too many neighbors can lead to increased computational complexity,
especially in large datasets.
 Balancing the number of neighbors to include is crucial for achieving a trade-off
between accuracy and efficiency.
2. Sparsity of Data:
 In sparse datasets, where users have only interacted with a small fraction of items, it
may be challenging to find a sufficient number of neighbors.
 Threshold-based methods can be useful in such scenarios.
3. Impact on Cold Start:
 Neighborhood selection methods should account for the "cold start" problem, where
new users or items have limited interaction history.
 In such cases, alternative methods like content-based recommendations or hybrid
models may be employed.
Example:
Let's consider a user-based collaborative filtering scenario. If User A is the target user, the
neighborhood selection process may involve computing similarity weights with all other users and
selecting the top 10 most similar users as neighbors for User A.
Purpose of Neighborhood Selection:
1. Relevance: Ensures that the selected neighbors are the most relevant and similar entities to
the target user or item.
2. Computational Efficiency: Manages computational complexity by limiting the number of
neighbors, balancing accuracy with efficiency.
3. Personalization: Allows the recommendation system to tailor recommendations based on the
preferences of a select group of similar users or items.

Downloaded by Mr. Bharath S ([email protected])

lOMoARcPSD|52617882

Downloaded by Mr. Bharath S ([email protected])

8 Recommender
No ratings yet
8 Recommender
139 pages
Rec Sys
No ratings yet
Rec Sys
104 pages
Module5 Recommender Systems PartB
No ratings yet
Module5 Recommender Systems PartB
57 pages
DOST Scholar's Handbook
No ratings yet
DOST Scholar's Handbook
52 pages
UNIT III - Recommender Systems
No ratings yet
UNIT III - Recommender Systems
11 pages
A Study of Different Similarity Metrics and Prediction Approaches in Collaborative Filtering For Recommendation
No ratings yet
A Study of Different Similarity Metrics and Prediction Approaches in Collaborative Filtering For Recommendation
150 pages
Recommender Systems Notes
No ratings yet
Recommender Systems Notes
16 pages
Chapter4 - Web Based Personalization Systems - Part2 - Collaborative Filtering - KNN
No ratings yet
Chapter4 - Web Based Personalization Systems - Part2 - Collaborative Filtering - KNN
22 pages
CSE545 sp23 (9) Recommendation Systems 4-10
No ratings yet
CSE545 sp23 (9) Recommendation Systems 4-10
72 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
Unit..3 Rs
No ratings yet
Unit..3 Rs
8 pages
Recommender System Assignment
100% (1)
Recommender System Assignment
8 pages
RS LVC 2 Post-Session Summary
No ratings yet
RS LVC 2 Post-Session Summary
13 pages
Lecture 2 Part1
No ratings yet
Lecture 2 Part1
14 pages
M02 User-Based CF V02
No ratings yet
M02 User-Based CF V02
20 pages
Recommender Systems-Chapter 4
No ratings yet
Recommender Systems-Chapter 4
76 pages
Module 4 - Notes - 13 12 2024
No ratings yet
Module 4 - Notes - 13 12 2024
21 pages
.Trashed-1724941095-Recommender Systems
No ratings yet
.Trashed-1724941095-Recommender Systems
30 pages
All Merge Chap 1
No ratings yet
All Merge Chap 1
69 pages
12 Recsys 1
No ratings yet
12 Recsys 1
11 pages
Recommendation Systems
No ratings yet
Recommendation Systems
37 pages
Unit Iii Collaborative Filtering
No ratings yet
Unit Iii Collaborative Filtering
51 pages
(2012) Sistemasderecomendacion
No ratings yet
(2012) Sistemasderecomendacion
18 pages
Unit 3
No ratings yet
Unit 3
21 pages
ITEM-ITEM Complete Lecture
No ratings yet
ITEM-ITEM Complete Lecture
19 pages
Week 6 Recommender
No ratings yet
Week 6 Recommender
17 pages
RS Part 1
No ratings yet
RS Part 1
40 pages
Recommended
No ratings yet
Recommended
8 pages
CS583 Recommender Systems
No ratings yet
CS583 Recommender Systems
40 pages
Recommender System - New
No ratings yet
Recommender System - New
49 pages
RecommenderSystems Shortened
No ratings yet
RecommenderSystems Shortened
95 pages
Book Based Question
No ratings yet
Book Based Question
2 pages
RMBI1020 - Data Analytics For Business - Collaborative Filtering
No ratings yet
RMBI1020 - Data Analytics For Business - Collaborative Filtering
34 pages
Is593-Lecture04 Recommendation Systems
No ratings yet
Is593-Lecture04 Recommendation Systems
51 pages
SK2024-PPT G7 Eng Q2 Week3
No ratings yet
SK2024-PPT G7 Eng Q2 Week3
50 pages
Understanding Recommender Systems: Archa E S M180017MS
No ratings yet
Understanding Recommender Systems: Archa E S M180017MS
4 pages
Article 34
No ratings yet
Article 34
8 pages
A Review of Information Filtering-CF
No ratings yet
A Review of Information Filtering-CF
47 pages
An Optimized Item-Based Collaborative Filtering Recommendation Algorithm
No ratings yet
An Optimized Item-Based Collaborative Filtering Recommendation Algorithm
5 pages
Recommendation System
No ratings yet
Recommendation System
32 pages
A Collaborative Filtering Recommendation Algorithm Based On Item Genre and Rating Similarity
No ratings yet
A Collaborative Filtering Recommendation Algorithm Based On Item Genre and Rating Similarity
4 pages
User-Based Neighborhood Models
No ratings yet
User-Based Neighborhood Models
8 pages
An Item Based Collaborative Filtering Recommendation Algorithm Using Rough Set Prediction
No ratings yet
An Item Based Collaborative Filtering Recommendation Algorithm Using Rough Set Prediction
4 pages
Collaborative Filtering & Content-Based Recommending: CS 293S. T. Yang Slides Based On R. Mooney at UT Austin
No ratings yet
Collaborative Filtering & Content-Based Recommending: CS 293S. T. Yang Slides Based On R. Mooney at UT Austin
22 pages
Title Obvhbresearch Project
No ratings yet
Title Obvhbresearch Project
7 pages
Background and Related Knowledge: Standard Item-Based Collaborative Filtering
No ratings yet
Background and Related Knowledge: Standard Item-Based Collaborative Filtering
2 pages
Recommender Systems-Unit Iii
No ratings yet
Recommender Systems-Unit Iii
9 pages
Unsupervised Learning Algorithm 1
No ratings yet
Unsupervised Learning Algorithm 1
3 pages
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
No ratings yet
A Personalized Recommender Integrating Item-Based and User-Based Collaborative Filtering
4 pages
AStudyof Mathematical Modelfor Collaborative Filtering
No ratings yet
AStudyof Mathematical Modelfor Collaborative Filtering
10 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
12 pages
An Item-Based Collaborative Filtering Recommendation Algorithm Using Slope
No ratings yet
An Item-Based Collaborative Filtering Recommendation Algorithm Using Slope
3 pages
Case Interview Abbreviated Guide
100% (3)
Case Interview Abbreviated Guide
16 pages
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
100% (1)
Combining Memory-Based and Model-Based Collaborative Filtering in Recommender System
4 pages
CS345A Data Mining: Recommendation Systems
No ratings yet
CS345A Data Mining: Recommendation Systems
26 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
L6 Recommendation
No ratings yet
L6 Recommendation
56 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
Political Realism in International Relation
No ratings yet
Political Realism in International Relation
3 pages
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
No ratings yet
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
8 pages
Second Edition Harry Potter and The Tabletop RPG
No ratings yet
Second Edition Harry Potter and The Tabletop RPG
53 pages
Canada Check List Imm5881e
No ratings yet
Canada Check List Imm5881e
7 pages
Lesson Plan 7th Grade
100% (1)
Lesson Plan 7th Grade
5 pages
Movie Recommendation System: CSN-382 Project
No ratings yet
Movie Recommendation System: CSN-382 Project
25 pages
Recommender Systems
No ratings yet
Recommender Systems
12 pages
CCS341 - Data Warehousing - Unit 4 Notes
0% (1)
CCS341 - Data Warehousing - Unit 4 Notes
19 pages
2 Bac Global Test S1 2018 Option 1
100% (1)
2 Bac Global Test S1 2018 Option 1
2 pages
Pie Speaking Part 1
No ratings yet
Pie Speaking Part 1
7 pages
Teacher's M&E Report - 1ST QUARTER
No ratings yet
Teacher's M&E Report - 1ST QUARTER
3 pages
Engineering Economics and FA
No ratings yet
Engineering Economics and FA
3 pages
Culminating Performance Task: Task: Systematic Review of Literature Through Repertory Grid (Repgrid) Analysis
No ratings yet
Culminating Performance Task: Task: Systematic Review of Literature Through Repertory Grid (Repgrid) Analysis
10 pages
Time Minutes Learning Areas: Morning Session
No ratings yet
Time Minutes Learning Areas: Morning Session
8 pages
Jamia Millia Islamia: For Office Use Only
No ratings yet
Jamia Millia Islamia: For Office Use Only
4 pages
DM Unit 1&2 Notes
No ratings yet
DM Unit 1&2 Notes
45 pages
Allahabad HC Order Writ Shiv Kumar Pathak Writ WRIA (A) - 57476 - 2013
No ratings yet
Allahabad HC Order Writ Shiv Kumar Pathak Writ WRIA (A) - 57476 - 2013
35 pages
Syllabus - Ethics & CSR
No ratings yet
Syllabus - Ethics & CSR
4 pages
Sample APA Paper PDF
No ratings yet
Sample APA Paper PDF
9 pages
SSC Topic Weightage Detailed
No ratings yet
SSC Topic Weightage Detailed
3 pages
Exam B - Respuestas
No ratings yet
Exam B - Respuestas
2 pages
Abigail Okonsintroductiontopsychologynewformatting
No ratings yet
Abigail Okonsintroductiontopsychologynewformatting
134 pages
Student Resilience
No ratings yet
Student Resilience
46 pages
Cs3303 Web Technology QB
No ratings yet
Cs3303 Web Technology QB
14 pages
Asmph Admissions Faqs Sy2015-2016 v1
No ratings yet
Asmph Admissions Faqs Sy2015-2016 v1
7 pages
Qualitative CW 2
No ratings yet
Qualitative CW 2
15 pages
CN Lab Marks Dummy Sheets
No ratings yet
CN Lab Marks Dummy Sheets
7 pages
Employability Skills
No ratings yet
Employability Skills
2 pages
ALS SHS CLASS PROGRAM 1st Semester
No ratings yet
ALS SHS CLASS PROGRAM 1st Semester
3 pages
Metas Application Form
No ratings yet
Metas Application Form
2 pages
WDFP Practical Question Set A
No ratings yet
WDFP Practical Question Set A
3 pages
Introduction For RRL
No ratings yet
Introduction For RRL
6 pages
WDFP Practical Question Set-1
No ratings yet
WDFP Practical Question Set-1
2 pages
CNA Class Plan - Inter 2 - Unit 1 - Exercises 01-07
No ratings yet
CNA Class Plan - Inter 2 - Unit 1 - Exercises 01-07
3 pages
Mid Term II Examination Schedule
No ratings yet
Mid Term II Examination Schedule
9 pages
Growth Mindset - Commercial Support 2020
No ratings yet
Growth Mindset - Commercial Support 2020
5 pages
Data Structure Design Set B
No ratings yet
Data Structure Design Set B
2 pages
Curriculum Vitae: Personal Details
No ratings yet
Curriculum Vitae: Personal Details
2 pages
Registration Number: Ranchi
No ratings yet
Registration Number: Ranchi
2 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Unit Iii

Uploaded by

Unit Iii

Uploaded by

lOMoARcPSD|52617882

Recommender System (Anna University)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

A systematic approach - Nearest-neighbour collaborative filtering (CF)

Downloaded by Mr. Bharath S ([email protected])

Comparison between User-based and Item-based Methods

Downloaded by Mr. Bharath S ([email protected])

Neighborhood Models in Practice

Note: Product images extracted from Amazon Marketplace

Downloaded by Mr. Bharath S ([email protected])

To simplify the calculations, we separate the numerator and denominator:

Downloaded by Mr. Bharath S ([email protected])

Then finally compute the similarity between items 3 and 2.

Downloaded by Mr. Bharath S ([email protected])

Downloaded by Mr. Bharath S ([email protected])

Downloaded by Mr. Bharath S ([email protected])

 Overcomes some of the scalability issues associated with user-based CF.

components of neighbourhood methods (rating normalization, similarity weight computation,

Downloaded by Mr. Bharath S ([email protected])

2. Similarity Weight Computation:

Downloaded by Mr. Bharath S ([email protected])

 Used for binary preference data (like item liked/disliked).

Purpose of Similarity Weight Computation:

Downloaded by Mr. Bharath S ([email protected])

Downloaded by Mr. Bharath S ([email protected])

Downloaded by Mr. Bharath S ([email protected])

You might also like