0% found this document useful (0 votes)

10 views8 pages

Recommender System

A recommender system predicts user preferences for items, enhancing user engagement and driving business through recommendations in various domains like movies, products, and stocks. It utilizes a utility matrix to represent user-item preferences and employs methods such as content-based filtering and collaborative filtering to generate recommendations. Challenges include cold start, scalability, and sparsity, with solutions involving memory-based and model-based approaches like matrix factorization.

Uploaded by

jorajeca772

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views8 pages

Recommender System

Uploaded by

jorajeca772

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

lOMoARcPSD|16994344

Recommender system

Computer Science and Engineering (BMS College of Engineering)

Scan to open on Studocu

Studocu is not sponsored or endorsed by any college or university

Downloaded by krishna veni ([email protected])
lOMoARcPSD|16994344

Recommendation system
Recommender system, or a recommendation system (sometimes replacing 'system' with a
synonym such as platform or engine), is a subclass of information filtering system that seeks to
predict the "rating" or "preference" a user would give to an item.
Recommender systems are used in a variety of areas, with commonly recognised examples
taking the form of playlist generators for video and music services, product recommenders for
online stores, or content recommenders for social media platforms and open web content
recommenders. These systems can operate using a single input, like music, or multiple inputs
within and across platforms like news, books, and search queries. There are also popular
recommender systems for specific topics like restaurants and online application. Recommender
systems have also been developed to explore research articles and experts, collaborators, and
financial services.

1. Movie/Book/News Recommendations 4 Suggest new content that increases user

engagement. The aim is to introduce users to new content that may interest them and
encourage them to consume more content on our platform.
2. Stock Recommendations 4 Suggest stocks that are most profitable to the clients. The
recommendations may be stocks that they have traded in historically. Novelty does not
matter here; profitability of the stock does.
3. Product Recommendations 4 Suggest a mix of old and new products. The old products
from users9 historical transactions serve as a reminder of their frequent purchases. Also,
it is important to suggest new products that the users may like to try.

In all of these problems, the common thread is that they aim to increase customer satisfaction
and in turn drive business in the form of increased commissions, greater sales, etc. Whatever the
use case may be, the data is typically in the following format:

 Customer ID, Product ID (Movie/Stock/Product), No: of Units/Rating,

Transaction Date

 Any other feature like the details of the product or demographics of the
customer

A Model for Recommendation Systems

In this section we introduce a model for recommendation systems, based on a utility matrix of
preferences.

The Utility Matrix

In a recommendation-system application there are two classes of entities, which we shall refer to
as users and items. Users have preferences for certain items, and these preferences must be
teased out of the data. The data itself is represented as a utility matrix, giving for each user-item
pair, a value that represents what is known about the degree of preference of that user for that
item. Values come from an ordered set, e.g., integers 135 representing the number of stars that
the user gave as a rating for that item. We assume that the matrix is sparse, meaning that most
entries are <unknown.= An unknown rating implies that we have no explicit information about the
user9s preference for the item.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

Example : In Fig. we see an example utility matrix, representing users9 ratings of movies on a
135 scale, with 5 the highest rating. Blanks represent the situation where the user has not rated
the movie. The movie names are HP1, HP2, and HP3 for Harry Potter I, II, and III, TW for
Twilight, and SW1,SW2, and SW3 for Star Wars episodes 1, 2, and 3. The users are represented
by capital letters A through D.

Notice that most user-movie pairs have blanks, meaning the user has not rated the movie. In
practice, the matrix would be even sparser, with the typical user rating only a tiny fraction of all
available movies.

The goal of a recommendation system is to predict the blanks in the utility matrix. For example,
would user A like SW2? There is little evidence from the tiny matrix in Fig. 9.1. We might design
our recommendation system to take into account properties of movies, such as their producer,
director, stars,or even the similarity of their names. If so, we might then note the similarity
between SW1 and SW2, and then conclude that since A did not like SW1, they were unlikely to
enjoy SW2 either. Alternatively, with much more data, we might observe that the people who
rated both SW1 and SW2 tended to give them similar ratings. Thus, we could conclude that A
would also give SW2 a low rating, similar to A9s rating of SW1.

Methods
There are two basic architec-tures for a recommendation system:

1.Content-Based systems focus on properties of items. Similarity of items is determined by

measuring the similarity in their properties.

2. Collaborative-Filtering systems focus on the relationship between users and items. Similarity
of items is determined by the similarity of the ratings of those items by the users who have rated
both items.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

Content based filtering

Content-based filtering methods are based on a description of the item and a profile of the user's
preferences. These methods are best suited to situations where there is known data on an item
(name, location, description, etc.), but not on the user. Content-based recommenders treat
recommendation as a user-specific classification problem and learn a classifier for the user's
likes and dislikes based on an item's features.
In this system, keywords are used to describe the items and a user profile is built to indicate the
type of item this user likes.
To create a user profile, the system mostly focuses on two types of information:
1. A model of the user's preference.
2. A history of the user's interaction with the recommender system.
Basically, these methods use an item profile (i.e., a set of discrete attributes and features)
characterizing the item within the system. To abstract the features of the items in the system, an
item presentation algorithm is applied. A widely used algorithm is the tf3idf representation (also
called vector space representation).[46] The system creates a content-based profile of users based
on a weighted vector of item features. The weights denote the importance of each feature to the
user and can be computed from individually rated content vectors using a variety of techniques.

Consider an example of recommending news articles to users. Let9s say we have 100 articles
and a vocabulary of size N. We first compute the tf-idf score for each of the words for every
article. Then we construct 2 vectors:

1. Item vector: This is a vector of length N. It contains 1 for words that have a high tf-idf
score in that article, otherwise 0.

2. User vector: Again a 1xN vector. For every word, we store the probability of the word occurring
(i.e. having a high tf-idf score) in articles that the user has consumed. Note here, that the user
vector is based on the attributes of the item (tf-idf score of words in this case).

Once we have these profiles, we compute similarities between the users and the items. The
items that are recommended are the ones that 1) the user has the highest similarity with or 2)
has the highest similarity with the other items the user has read. There are multiple ways of doing
this. Let9s look at 2 common methods:

1.Cosine Similarity:
To compute similarity between the user and item, we simply take the cosine similarity between
the user vector and the item vector. This gives us user-item similarity.

To recommend items that are most similar to the items the user has bought, we compute cosine
similarity between the articles the user has read and other articles. The ones that are most
similar are recommended. Thus this is item-item similarity.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

Cosine similarity is best suited when you have high dimensional features, especially in
information retrieval and text mining.

2. Jaccard similarity:
Also known as intersection over union, the formula is as follows:

This is used for item-item similarity. We compare item vectors with each other and return the
items that are most similar.

Jaccard similarity is useful only when the vectors contain binary values. If they have rankings or
ratings that can take on multiple values, Jaccard similarity is not applicable.

In addition to the similarity methods, for content based recommendation, we can treat
recommendation as a simple machine learning problem. Here, regular machine learning
algorithms like random forest, XGBoost, etc., come in handy.

This method is useful when we have a whole lot of 8external9 features, like weather conditions,
market factors, etc. which are not a property of the user or the product and can be highly
variable. For example, the previous day9s opening and closing price play an important role in
determining the profitability of investing in a particular stock. This comes under the class of
supervised problems where the label is whether the user liked/clicked on a product or not(0/1) or
the rating the user gave that product or the number of units the user bought.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

Collaborative Filtering

Collaborative filtering is based on the assumption that people who agreed in the past will agree in
the future, and that they will like similar kinds of items as they liked in the past. The system
generates recommendations using only information about rating profiles for different users or
items. By locating peer users/items with a rating history similar to the current user or item, they
generate recommendations using this neighborhood.

The underlying assumption of the collaborative filtering approach is that if A and B buy similar
products, A is more likely to buy a product that B has bought than a product which a random
person has bought. Unlike content based, there are no features corresponding to users or items
here. All we have is the Utility Matrix. This is what it looks like:

A, B, C, D are the users, and the columns represent movies. The values represent ratings (135)
a user has given a movie. In other cases, these values could be 0/1 depending on whether the
user watched the movie or not.
When building a model from a user's behavior, a distinction is often made between explicit
and implicit forms of data collection.
Examples of explicit data collection include the following:

 Asking a user to rate an item on a sliding scale.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

 Asking a user to search.

 Asking a user to rank a collection of items from favorite to least favorite.
 Presenting two items to a user and asking him/her to choose the better one of them.
 Asking a user to create a list of items that he/she likes .
Examples of implicit data collection include the following:

 Observing the items that a user views in an online store.

 Analyzing item/user viewing times.
 Keeping a record of the items that a user purchases online.
 Obtaining a list of items that a user has listened to or watched on his/her computer.
 Analyzing the user's social network and discovering similar likes and dislikes.
Collaborative filtering approaches often suffer from three problems: cold start, scalability, and
sparsity.

 Cold start: For a new user or item, there isn't enough data to make accurate
recommendations.
 Scalability: In many of the environments in which these systems make
recommendations, there are millions of users and products. Thus, a large amount of
computation power is often necessary to calculate recommendations.
 Sparsity: The number of items sold on major e-commerce sites is extremely large.
The most active users will only have rated a small subset of the overall database.
Thus, even the most popular items have very few ratings.
One of the most famous examples of collaborative filtering is item-to-item collaborative filtering
(people who buy x also buy y), an algorithm popularized by Amazon.com's recommender
system.

There are a 2 broad categories that collaborative filtering can be split into:

Memory based approach

For the memory based approach, the utility matrix is memorized and recommendations are made
by querying the given user with the rest of the utility matrix. Let9s consider an example of the
same: If we have m movies and u users, we want to find out how much user i likes movie k.

Downloaded by krishna veni ([email protected])

lOMoARcPSD|16994344

This is the mean rating that user i has given all the movies she/he has rated. Using this, we
estimate his rating of movie k as follows:

Similarity between users a and i can be computed using any methods like cosine
similarity/Jaccard similarity/Pearson9s correlation coefficient, etc.
These results are very easy to create and interpret, but once the data becomes too sparse,
performance becomes poor.

Model based approach

One of the more prevalent implementations of model based approach is Matrix Factorization. In
this, we create representations of the users and items from the utility matrix. This is what it looks
like:

Thus, our utility matrix decomposes into U and V where U represents the users and V represents
the movies in a low dimensional space. This can be achieved by using matrix
decomposition techniques like SVD or PCA or by learning the 2 embedding matrices using
neural networks with the help of some optimizer like Adam, SGD etc.

For a user i and every movie j we just need to compute rating y to and recommend the movies
with the highest predicted rating. This approach is most useful when we have a ton of data and it
has high sparsity. Matrix factorization helps by reducing the dimensionality, hence making
computation faster. One disadvantage of this method is that we tend to lose interpretability as we
do not know what exactly elements of the user/item vectors mean.

Downloaded by krishna veni ([email protected])

Seat Leon (1P, 1P0,1P1) Workshop - Electrical System
67% (3)
Seat Leon (1P, 1P0,1P1) Workshop - Electrical System
365 pages
Recommender Systems-Unit I
No ratings yet
Recommender Systems-Unit I
12 pages
Unit I-Introduction
100% (1)
Unit I-Introduction
23 pages
Recommender Systems Notes
No ratings yet
Recommender Systems Notes
26 pages
Jumeed Oral Questions
100% (1)
Jumeed Oral Questions
261 pages
New Misc Mod
No ratings yet
New Misc Mod
36 pages
Computerized System Validation
No ratings yet
Computerized System Validation
14 pages
Implementation and Comparison of Recommender Systems Using Various Models
100% (1)
Implementation and Comparison of Recommender Systems Using Various Models
13 pages
Recommender Systems
No ratings yet
Recommender Systems
23 pages
Job Application Letter Title
100% (1)
Job Application Letter Title
8 pages
Unit 1
No ratings yet
Unit 1
9 pages
FII and DII in Indian Stock Market: A Behavioural Study
No ratings yet
FII and DII in Indian Stock Market: A Behavioural Study
9 pages
Hypertension and Cardiovascular Disease - Nutritional Case Study
No ratings yet
Hypertension and Cardiovascular Disease - Nutritional Case Study
9 pages
UNIT I - Introduction-Recommender Systems
No ratings yet
UNIT I - Introduction-Recommender Systems
24 pages
Catalog China
No ratings yet
Catalog China
61 pages
Unit 1 Final Merged
No ratings yet
Unit 1 Final Merged
254 pages
On Rec Sys
No ratings yet
On Rec Sys
145 pages
ML Unit 6
No ratings yet
ML Unit 6
83 pages
M21DGS323 - 2610 - 02
No ratings yet
M21DGS323 - 2610 - 02
77 pages
Math5 - q2 - Mod4 - Multiply Decimals Up To 2 Decimal Places
No ratings yet
Math5 - q2 - Mod4 - Multiply Decimals Up To 2 Decimal Places
30 pages
Session 1 2
No ratings yet
Session 1 2
92 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
Machine - Learning (Recommendation System)
No ratings yet
Machine - Learning (Recommendation System)
37 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
Recommender Systems: Collaborative Filtering & Content-Based Recommending
No ratings yet
Recommender Systems: Collaborative Filtering & Content-Based Recommending
47 pages
CSE545 sp23 (9) Recommendation Systems 4-10
No ratings yet
CSE545 sp23 (9) Recommendation Systems 4-10
72 pages
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
No ratings yet
CAIM: Cerca I Anàlisi D'informació Massiva: FIB, Grau en Enginyeria Informàtica
36 pages
Unit-1 - Introduction
No ratings yet
Unit-1 - Introduction
46 pages
Recommendation in Social Media: Recommender System
No ratings yet
Recommendation in Social Media: Recommender System
29 pages
Recommended System
No ratings yet
Recommended System
33 pages
6CS4 ML Unit-5
No ratings yet
6CS4 ML Unit-5
33 pages
TQM 2-Customer Satisfaction
No ratings yet
TQM 2-Customer Satisfaction
10 pages
Unit I Introduction
No ratings yet
Unit I Introduction
24 pages
DM Lect 6 - Recommender Systems
No ratings yet
DM Lect 6 - Recommender Systems
46 pages
Recommender Systems - Chaptre1
No ratings yet
Recommender Systems - Chaptre1
62 pages
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
No ratings yet
Building Accurate and Practical Recomender System Usnig ML Classifier and CBF by Asma
19 pages
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
No ratings yet
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
22 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
T10 Recommender System
No ratings yet
T10 Recommender System
45 pages
Recommendation System-WPS Office
No ratings yet
Recommendation System-WPS Office
18 pages
Data Analytics
No ratings yet
Data Analytics
21 pages
Unit III
No ratings yet
Unit III
58 pages
Recommendation Engines
No ratings yet
Recommendation Engines
17 pages
Book Recommendation System Project
No ratings yet
Book Recommendation System Project
14 pages
RS Unit - I
No ratings yet
RS Unit - I
47 pages
Recommendation Engine
No ratings yet
Recommendation Engine
20 pages
Recommendation System
No ratings yet
Recommendation System
21 pages
Split Learning Over Wireless Networks Parallel Design and Resource Management
No ratings yet
Split Learning Over Wireless Networks Parallel Design and Resource Management
30 pages
Unit 3
No ratings yet
Unit 3
21 pages
DIST88FNL
No ratings yet
DIST88FNL
37 pages
Unit 1 Final
No ratings yet
Unit 1 Final
50 pages
RS Unit 1
No ratings yet
RS Unit 1
13 pages
Bda - M 5
No ratings yet
Bda - M 5
14 pages
Movie Recommendations
No ratings yet
Movie Recommendations
12 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
Classics: Invention of The Integrated Circuit
No ratings yet
Classics: Invention of The Integrated Circuit
16 pages
Lec15-S Sarkar
No ratings yet
Lec15-S Sarkar
12 pages
Module 6 - Link Analysis Recommendation Systems
No ratings yet
Module 6 - Link Analysis Recommendation Systems
68 pages
Module 5
No ratings yet
Module 5
8 pages
Karan Mini Proj
No ratings yet
Karan Mini Proj
11 pages
Unit I Introduction
No ratings yet
Unit I Introduction
25 pages
Recommendation Systems
No ratings yet
Recommendation Systems
12 pages
Module4 RecommenderSystem
No ratings yet
Module4 RecommenderSystem
11 pages
Application of Responsibility Accounting
No ratings yet
Application of Responsibility Accounting
28 pages
Shaping, Planning, and Slotting Machines - Principles, Specifications, and Comparisons
No ratings yet
Shaping, Planning, and Slotting Machines - Principles, Specifications, and Comparisons
12 pages
Maths Mount Olives.
No ratings yet
Maths Mount Olives.
16 pages
637768232285587483ce 20ce33pt W3 S3 Sy
No ratings yet
637768232285587483ce 20ce33pt W3 S3 Sy
7 pages
2404 16177v1
No ratings yet
2404 16177v1
6 pages
Vietnam Research.v2
No ratings yet
Vietnam Research.v2
13 pages
DemoProject2Project Report
No ratings yet
DemoProject2Project Report
9 pages
12 Recsys 1
No ratings yet
12 Recsys 1
11 pages
Comparative Analysis of Recommendation System
No ratings yet
Comparative Analysis of Recommendation System
6 pages
Movie Recommendation KNN
No ratings yet
Movie Recommendation KNN
5 pages
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
No ratings yet
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
4 pages
Rs Unit 2
No ratings yet
Rs Unit 2
54 pages
Machine Learning Model For Movie Recomme
No ratings yet
Machine Learning Model For Movie Recomme
6 pages
E96660695201532
No ratings yet
E96660695201532
5 pages
The Use of Copper Shells by Twin Roll Strip Casters: TMS Light Metals March 2010
No ratings yet
The Use of Copper Shells by Twin Roll Strip Casters: TMS Light Metals March 2010
6 pages
Recommendation System
No ratings yet
Recommendation System
3 pages
Stax-21 Quick Reference Guides - Digital - PAX A920
No ratings yet
Stax-21 Quick Reference Guides - Digital - PAX A920
2 pages
Air21 Location
No ratings yet
Air21 Location
1 page
Siemens SW Process Simulate Fs
No ratings yet
Siemens SW Process Simulate Fs
3 pages
How To Add Startup - Cs Class in ASP - NET Core 6 Project
No ratings yet
How To Add Startup - Cs Class in ASP - NET Core 6 Project
3 pages
Advanced Microcontroller Programming DC
No ratings yet
Advanced Microcontroller Programming DC
3 pages
RD Rigidsteelconduitimc
No ratings yet
RD Rigidsteelconduitimc
1 page
Linguine Pasta - Google Search
No ratings yet
Linguine Pasta - Google Search
1 page
Strain Gauge Measurement: Temperature Compensation For Leadwires in Quarter Bridge
No ratings yet
Strain Gauge Measurement: Temperature Compensation For Leadwires in Quarter Bridge
1 page
Solow Growth Model Summary
No ratings yet
Solow Growth Model Summary
1 page
Mastering Agile User Stories
From Everand
Mastering Agile User Stories
DeEtta Balthazar
4/5 (2)
MCS-034: Software Engineering
From Everand
MCS-034: Software Engineering
Dr. DK Sukhani
No ratings yet