0% found this document useful (0 votes)

34 views11 pages

Module4 RecommenderSystem

Uploaded by

fixom15066

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views11 pages

Module4 RecommenderSystem

Uploaded by

fixom15066

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Module 4 - Recommender System

What is a Recommender System?

A recommender system is a type of machine learning application designed to suggest
relevant items to users based on their preferences, behavior, and other contextual factors. It
is widely used in various domains, such as e-commerce, entertainment, education, and
social networks, to improve user experience and drive engagement.

Types of recommender systems include:

• Content-Based Filtering
• Collaborative Filtering
• Hybrid Systems
• Knowledge-Based Systems
• Deep Learning-Based Systems

Recommender systems analyze user behavior and preferences to improve decision-making,

engagement, and satisfaction.

Types of Recommender Systems

1. Content-Based Filtering
Content-based filtering recommends items similar to those the user has interacted with in
the past by analyzing the item's attributes. It matches item features with user preferences
and uses similarity metrics to suggest items.

Advantages:

• Personalized to the user

• Does not require data about other users

Challenges:

• Limited to item attributes

• Struggles with cold start problem

Example: Recommending movies with similar genres, actors, or directors to those already
watched.

2. Collaborative Filtering
Collaborative filtering relies on the preferences and actions of other users to make
recommendations. It includes user-based and item-based approaches.

Advantages:
• No need to understand item attributes
• Handles diverse datasets well

Challenges:

• Cold start problem for new users or items

• Sparse data can reduce effectiveness

Example: Amazon's 'Customers who bought this also bought...' feature.

3. Hybrid Systems
Hybrid systems combine multiple approaches (e.g., content-based and collaborative
filtering) to improve accuracy. They mitigate weaknesses of individual systems and are
more robust.

Challenges: Higher complexity and requires careful tuning to balance methods.

Example: Netflix combining collaborative filtering and content-based filtering.

4. Knowledge-Based Systems
Knowledge-based systems leverage specific knowledge about how certain item features
satisfy user requirements. They do not rely on user history and work well for new users and
items.

Challenges: Requires detailed item and user information and can be less adaptive.

Example: A travel booking system recommending trips based on budget, preferences, and
travel dates.

5. Deep Learning-Based Systems

Deep learning-based systems use neural networks to capture complex relationships
between users and items. They analyze user behavior over time and handle large-scale data.

Advantages: Captures subtle patterns and is suitable for large datasets.

Challenges: Requires significant computational resources and large training data.

Example: Spotify's 'Discover Weekly' playlist.

Association Rules
Association rules are a data mining technique used to find relationships or patterns among
items in large datasets. These rules are widely used in market basket analysis, where the
goal is to discover relationships between items that customers frequently buy together.

Key Concepts of Association Rules

1. Itemset
A collection of one or more items.

Example: {milk, bread, butter} is an itemset.

2. Support
Measures how frequently an itemset appears in the dataset.

**Formula**:

Support(A) = (Transactions containing A) / (Total transactions)

Example: If {milk, bread} appears in 3 out of 10 transactions, support is 0.3 (30%).

3. Confidence
Measures the likelihood that item B is purchased when item A is purchased.

**Formula**:

Confidence(A → B) = Support(A ∪ B) / Support(A)

Example: If 50% of transactions with {milk} also include {bread}, confidence is 0.5 (50%).

4. Lift
Indicates the strength of a rule by comparing the observed support to the expected support
if A and B were independent.

**Formula**:

Lift(A → B) = Confidence(A → B) / Support(B)

Example: If {milk} and {bread} are purchased together more often than expected by chance,
the lift is greater than 1.

Structure of Association Rules

Rules are expressed as A → B, where:

- A: Antecedent (item or set of items).

- B: Consequent (item or set of items).

Example: {milk} → {bread} means 'If a customer buys milk, they are likely to buy bread.'
Steps to Generate Association Rules

1. Generate Frequent Itemsets

Identify all itemsets with a support value greater than a predefined threshold.

Algorithm: Apriori or FP-Growth.

2. Generate Strong Rules

From the frequent itemsets, generate rules with high confidence and lift values.

Applications of Association Rules

• Market Basket Analysis: Discover products often purchased together to improve cross-
selling and upselling. Example: If customers buy {diapers}, they often buy {beer}.
• Recommendation Systems: Suggest items based on association rules. Example: E-
commerce suggesting 'Frequently Bought Together' products.
• Fraud Detection: Identify unusual patterns in financial transactions.
• Healthcare: Find associations between symptoms and diseases or medications and side
effects.

Advantages
• Easy to understand and interpret.
• Applicable to a wide range of industries.

Limitations
• May generate too many rules, making it hard to analyze.
• Does not capture time-based patterns or sequences.
Collaborative Filtering
Collaborative filtering is a popular technique in recommendation systems that predicts a
user's interest in items by analyzing their past behavior and the behavior of other users. It is
based on the assumption that if users have agreed in the past, they are likely to agree in the
future.

Types of Collaborative Filtering

1. User-Based Collaborative Filtering

Finds users similar to the target user based on their preferences and recommends items
that those similar users liked.

Example:

Alice and Bob have similar preferences. If Bob likes a new movie, Alice might like it too.

Advantages:

• Easy to understand and implement.

Challenges:

• Not scalable for a large number of users.

2. Item-Based Collaborative Filtering

Focuses on the similarity between items rather than users. It recommends items that are
frequently liked or purchased together.

Example:

If many users who bought a smartphone also bought a case, the system recommends a case
when a smartphone is purchased.

Advantages:
• More scalable than user-based filtering.

Challenges:

• Requires item similarity computation, which can be intensive for large datasets.

3. Matrix Factorization
Decomposes the user-item interaction matrix into latent factors representing user and item
characteristics. This is often implemented using techniques like Singular Value
Decomposition (SVD).

Advantages:

• Handles sparse data effectively.

• Reduces dimensionality.

Challenges:

• Requires proper parameter tuning.

Key Components of Collaborative Filtering

1. Interaction Matrix
A matrix where rows represent users and columns represent items. Example: If a user rates
a movie, the matrix records the rating at the corresponding row and column.

2. Similarity Metrics
Measures how similar users or items are to one another. Common metrics include Cosine
similarity, Pearson correlation, and Jaccard similarity.

3. Prediction
Predicts the likelihood of a user liking an item based on the preferences of similar users or
items.

Steps to Implement Collaborative Filtering

1. Collect Data: Gather user-item interaction data, such as ratings, purchases, or clicks.
2. Compute Similarity: Calculate similarity scores between users or items using a
similarity metric.
3. Generate Recommendations: Recommend items based on the preferences of similar
users or frequently associated items.

Applications of Collaborative Filtering

• E-commerce: Recommending products to users based on their purchase history and the
preferences of similar customers.
• Streaming Services: Suggesting movies, TV shows, or music tracks based on user
viewing or listening patterns.
• Education: Recommending courses or learning resources based on user engagement
and preferences.
• Social Media: Suggesting friends, groups, or content based on user behavior.

Advantages
• Works well when user and item metadata are unavailable.
• Captures complex relationships among users and items.

Limitations
• Cold Start Problem: Struggles to recommend items to new users or suggest new items
with no prior interactions.
• Data Sparsity: Many users interact with only a small subset of items, leading to sparse
interaction matrices.
• Scalability: Computing similarity in large datasets can be resource-intensive.

Surprise Library
Surprise is a Python library designed specifically for building and evaluating
recommendation systems. It is widely used for collaborative filtering, particularly matrix
factorization, and other algorithms that predict user preferences based on historical data.

Key Features of Surprise

• Supports Various Collaborative Filtering Algorithms: Algorithms like Singular Value
Decomposition (SVD), k-Nearest Neighbors (k-NN), and more.
• Customizable: Allows users to define their own similarity metrics and prediction
algorithms.
• Efficient: Optimized for performance, especially for matrix factorization methods.
• Dataset Management: Provides utilities for loading, splitting, and managing datasets.
• Evaluation Tools: Built-in functions for model evaluation using metrics like RMSE and
MAE.

Installation
Install the Surprise library using pip:

pip install scikit-surprise

Key Components of Surprise

1. Dataset: Tools to load datasets from files, built-in datasets (e.g., MovieLens), or directly
from data in Python objects.
2. Algorithms: Collaborative filtering techniques such as:
- BaselineOnly: Predicts based on baseline estimates.
- SVD: Singular Value Decomposition for matrix factorization.
- KNNBasic: Basic k-Nearest Neighbors approach.
- CoClustering: Co-clustering-based collaborative filtering.

3. Similarity Metrics: Cosine similarity, Pearson correlation, and custom distance metrics.

4. Evaluation: Tools for cross-validation and computing performance metrics.

Example: Building a Recommendation System

This example demonstrates how to build a recommendation system using a built-in dataset:

from surprise import SVD

from surprise import Dataset
from surprise.model_selection import cross_validate

# Load a built-in dataset (e.g., MovieLens 100k)

data = Dataset.load_builtin('ml-100k')

# Use SVD algorithm

algo = SVD()

# Perform cross-validation
cross_validate(algo, data, measures=['RMSE', 'MAE'], cv=5, verbose=True)

Custom Dataset Example

For datasets in custom formats (e.g., CSV):

from surprise import Reader, Dataset

from surprise import SVD
from surprise.model_selection import train_test_split
from surprise import accuracy

# Define a Reader with rating scale

reader = Reader(rating_scale=(1, 5))

# Load data from a custom file (replace 'data.csv' with your file)
data = Dataset.load_from_file('data.csv', reader=reader)

# Split into train and test sets

trainset, testset = train_test_split(data, test_size=0.25)
# Train a model
algo = SVD()
algo.fit(trainset)

# Test the model

predictions = algo.test(testset)

# Calculate RMSE
accuracy.rmse(predictions)

Applications of Surprise
• Movie Recommendations: Recommending movies based on user ratings (e.g., Netflix).
• E-commerce: Suggesting products based on purchase behavior.
• Content Personalization: Tailoring recommendations for individual users.

Advantages
• Easy to use and well-documented.
• Extensive support for collaborative filtering techniques.
• Optimized for performance.

Limitations
• Primarily focused on collaborative filtering.
• Requires familiarity with Python for effective use.

Matrix Factorization
Matrix Factorization is a popular approach used in recommendation systems, particularly in
collaborative filtering, to predict user-item interactions. It decomposes a large, sparse

interaction matrix (e.g., user-item rating matrix) into smaller matrices, capturing latent
factors that represent user preferences and item characteristics.

Key Concepts
• 1. Interaction Matrix: A matrix where rows represent users, columns represent items,
and values represent interactions (e.g., ratings, clicks, purchases).
• 2. Latent Factors: Hidden dimensions that describe user preferences and item attributes
(e.g., genres for movies).
• 3. Decomposition: The interaction matrix R is decomposed into two matrices: User
Matrix (P) and Item Matrix (Q). R ≈ P × Q^T, where R is reconstructed using the dot
product of P and Q.

Matrix Factorization Techniques

• Singular Value Decomposition (SVD): Decomposes R into three matrices (U, Σ, V^T).
Efficient for dense matrices but struggles with sparse data.
• Alternating Least Squares (ALS): Optimizes one matrix at a time (e.g., fixing P and
solving for Q, then vice versa). Commonly used in large-scale systems.
• Non-Negative Matrix Factorization (NMF): Ensures that values in P and Q are non-
negative. Useful for interactions with non-negative values like counts.

Advantages of Matrix Factorization

• Captures Latent Relationships: Reveals hidden patterns in user preferences and item
attributes.
• Dimensionality Reduction: Reduces the complexity of large interaction matrices.
• Personalized Recommendations: Predicts missing values (e.g., ratings) using user and
item latent factors.

Limitations
• Cold Start Problem: Cannot recommend for new users or items with no prior
interactions.
• Scalability: Large datasets can be computationally expensive.
• Interpretability: Latent factors are abstract and may not have clear meanings.

Applications of Matrix Factorization

• Movie Recommendations: Suggests movies based on user ratings and latent factors (e.g.,
genres, directors).
• E-commerce: Recommends products by analyzing purchase history and item features.
• Music Streaming: Predicts songs a user might like based on listening habits and song
features.
• Education: Personalizes course recommendations based on student preferences and
engagement.

Example: Matrix Factorization with Python (Using Surprise Library)

from surprise import SVD

from surprise import Dataset
from surprise.model_selection import train_test_split
from surprise import accuracy

# Load dataset
data = Dataset.load_builtin('ml-100k')

# Split into train and test sets

trainset, testset = train_test_split(data, test_size=0.2)

# Apply SVD
algo = SVD()
algo.fit(trainset)

# Test the model

predictions = algo.test(testset)

# Calculate RMSE
accuracy.rmse(predictions)

Conclusion
Matrix Factorization is a powerful technique for recommendation systems, offering
personalized and scalable solutions by leveraging latent factors. However, it requires
adequate data and computational resources to achieve effective results.

AIML Presentation
No ratings yet
AIML Presentation
21 pages
RS Unit 1
No ratings yet
RS Unit 1
13 pages
Unit 4 - MLMM
No ratings yet
Unit 4 - MLMM
36 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Machine Learning Recommender Systems
No ratings yet
Machine Learning Recommender Systems
33 pages
TECHNICAL+NOTE Recommender+Systems+v.27
No ratings yet
TECHNICAL+NOTE Recommender+Systems+v.27
16 pages
DM Lect 6 - Recommender Systems
No ratings yet
DM Lect 6 - Recommender Systems
46 pages
Rec - Unit 1
No ratings yet
Rec - Unit 1
66 pages
Unit III Collaborative Filtering Final
No ratings yet
Unit III Collaborative Filtering Final
65 pages
Recommender Systems
No ratings yet
Recommender Systems
23 pages
Clustering in Recommender Systems Review
No ratings yet
Clustering in Recommender Systems Review
22 pages
M4
No ratings yet
M4
58 pages
AI Recommendation System
No ratings yet
AI Recommendation System
20 pages
Recommendation Systems
No ratings yet
Recommendation Systems
12 pages
Recommender Systems
No ratings yet
Recommender Systems
12 pages
Module5 Recommender Systems PartA
No ratings yet
Module5 Recommender Systems PartA
54 pages
Recommendation System-WPS Office
No ratings yet
Recommendation System-WPS Office
18 pages
RS Notes1
No ratings yet
RS Notes1
19 pages
Module 5
No ratings yet
Module 5
50 pages
An Introduction To Recommender Systems
No ratings yet
An Introduction To Recommender Systems
6 pages
Recommendation System
No ratings yet
Recommendation System
19 pages
Aai - Unit 3
No ratings yet
Aai - Unit 3
25 pages
Unit Iii-Collaborative Filtering
No ratings yet
Unit Iii-Collaborative Filtering
34 pages
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
No ratings yet
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
8 pages
UNIT I - Introduction-Recommender Systems
No ratings yet
UNIT I - Introduction-Recommender Systems
24 pages
Recommender - Introduction
No ratings yet
Recommender - Introduction
25 pages
Emerging Synergies Between Large Language Models A
No ratings yet
Emerging Synergies Between Large Language Models A
7 pages
Unit Iii Collaborative Filtering
No ratings yet
Unit Iii Collaborative Filtering
51 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
72 pages
Recommendation Systems Overview
No ratings yet
Recommendation Systems Overview
37 pages
Slides Lecture 2 RecSys
No ratings yet
Slides Lecture 2 RecSys
86 pages
Recommender Systems Overview and Methods
No ratings yet
Recommender Systems Overview and Methods
36 pages
CS345A Data Mining: Recommendation Systems
No ratings yet
CS345A Data Mining: Recommendation Systems
26 pages
IDEA - Collaborative Filtering Techniques in Recommendation Systems
No ratings yet
IDEA - Collaborative Filtering Techniques in Recommendation Systems
11 pages
Module 5
No ratings yet
Module 5
8 pages
UNIMAS eRecruitment: Recommender Systems
No ratings yet
UNIMAS eRecruitment: Recommender Systems
46 pages
Welcome 1
No ratings yet
Welcome 1
9 pages
Online Book Recommendation System
100% (1)
Online Book Recommendation System
21 pages
Understanding Recommendation Systems
No ratings yet
Understanding Recommendation Systems
37 pages
Ir Recommendation & KNN
No ratings yet
Ir Recommendation & KNN
9 pages
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
No ratings yet
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
22 pages
Recommendation Systems: A Review
No ratings yet
Recommendation Systems: A Review
6 pages
Feature Reduction in Machine Learning
No ratings yet
Feature Reduction in Machine Learning
65 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
RecommenderSystems Shortened
No ratings yet
RecommenderSystems Shortened
95 pages
Recommender Systems Overview
No ratings yet
Recommender Systems Overview
26 pages
Recommendation Algorithm - PPT
No ratings yet
Recommendation Algorithm - PPT
12 pages
Unit 1 Recommender Systems
No ratings yet
Unit 1 Recommender Systems
33 pages
DM Lec 6
No ratings yet
DM Lec 6
4 pages
Lect 13 DM
No ratings yet
Lect 13 DM
20 pages
Unit..1 Rs
No ratings yet
Unit..1 Rs
16 pages
Recommender Systems Asanov
No ratings yet
Recommender Systems Asanov
7 pages
Recommendation System
No ratings yet
Recommendation System
8 pages
Recommendation System Final
No ratings yet
Recommendation System Final
16 pages
Types of Recommendation Systems
No ratings yet
Types of Recommendation Systems
13 pages
Unit 1 PDF
No ratings yet
Unit 1 PDF
58 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
35 pages
MS - BDA Lec - Recommendation Systems I
No ratings yet
MS - BDA Lec - Recommendation Systems I
31 pages
Crop and Fertilizer Recommendation System
No ratings yet
Crop and Fertilizer Recommendation System
54 pages
Harsh Report
No ratings yet
Harsh Report
28 pages
A I and Machine Learning For Adaptive e Learning Platforms in
No ratings yet
A I and Machine Learning For Adaptive e Learning Platforms in
16 pages
PM - Brochure
No ratings yet
PM - Brochure
15 pages
Improving AI
No ratings yet
Improving AI
11 pages
Intelligent Sustainable Systems: Atulya K. Nagar Dharm Singh Jat Durgesh Mishra Amit Joshi
No ratings yet
Intelligent Sustainable Systems: Atulya K. Nagar Dharm Singh Jat Durgesh Mishra Amit Joshi
562 pages
Movie Recommendation System Using Sentiment Analysis From Microblogging Data
No ratings yet
Movie Recommendation System Using Sentiment Analysis From Microblogging Data
5 pages
Human-AI Collaboration Insights
No ratings yet
Human-AI Collaboration Insights
23 pages
Mind Mat: Personalized Yoga Recommendation System For Wellbeing and Wellness Using Machine Learning Techniques
No ratings yet
Mind Mat: Personalized Yoga Recommendation System For Wellbeing and Wellness Using Machine Learning Techniques
9 pages
It Skills - 25cs01i-1
No ratings yet
It Skills - 25cs01i-1
31 pages
Unit 1
No ratings yet
Unit 1
5 pages
DLP-KDD 2021 Paper 4
No ratings yet
DLP-KDD 2021 Paper 4
4 pages
AI Class 7 Notes 2025-26-1
No ratings yet
AI Class 7 Notes 2025-26-1
11 pages
Nudge Economics
No ratings yet
Nudge Economics
8 pages
MIT 103 - Tutorial 03
No ratings yet
MIT 103 - Tutorial 03
8 pages
RCA On Youtube
No ratings yet
RCA On Youtube
8 pages
4-6 Analytics Lifecycle Case Study Netflix
0% (1)
4-6 Analytics Lifecycle Case Study Netflix
2 pages
Ecommerce - Sheet2
No ratings yet
Ecommerce - Sheet2
8 pages
Presales Proposal Template
No ratings yet
Presales Proposal Template
3 pages
Agentic Ai
No ratings yet
Agentic Ai
71 pages
Report
No ratings yet
Report
51 pages
DIGITAL LIBRARY APP Innovation New
No ratings yet
DIGITAL LIBRARY APP Innovation New
35 pages
Ai Tools and Prompt Engineering Application in Management Assignment Ashish Joshi
No ratings yet
Ai Tools and Prompt Engineering Application in Management Assignment Ashish Joshi
15 pages
1 - Updated - Movie Recommendation System Report Internshippdf2 (1) - Organized
No ratings yet
1 - Updated - Movie Recommendation System Report Internshippdf2 (1) - Organized
24 pages
Apache Mahout Essentials - Sample Chapter
No ratings yet
Apache Mahout Essentials - Sample Chapter
25 pages
Research Paper Writing
No ratings yet
Research Paper Writing
14 pages
Economist Versus Machine Matchmakers
No ratings yet
Economist Versus Machine Matchmakers
29 pages
AK Infosys
No ratings yet
AK Infosys
8 pages
Learning From Data - A Short Course
91% (11)
Learning From Data - A Short Course
215 pages
Producer-Side Experiment Design for Recommenders
No ratings yet
Producer-Side Experiment Design for Recommenders
31 pages

Module4 RecommenderSystem

Uploaded by

Module4 RecommenderSystem

Uploaded by

Module 4 - Recommender System

What is a Recommender System?

Types of recommender systems include:

Recommender systems analyze user behavior and preferences to improve decision-making,

Types of Recommender Systems

• Personalized to the user

• Limited to item attributes

• Cold start problem for new users or items

Example: Amazon's 'Customers who bought this also bought...' feature.

Challenges: Higher complexity and requires careful tuning to balance methods.

Example: Netflix combining collaborative filtering and content-based filtering.

5. Deep Learning-Based Systems

Advantages: Captures subtle patterns and is suitable for large datasets.

Challenges: Requires significant computational resources and large training data.

Example: Spotify's 'Discover Weekly' playlist.

Key Concepts of Association Rules

Example: {milk, bread, butter} is an itemset.

Support(A) = (Transactions containing A) / (Total transactions)

Example: If {milk, bread} appears in 3 out of 10 transactions, support is 0.3 (30%).

Confidence(A → B) = Support(A ∪ B) / Support(A)

Lift(A → B) = Confidence(A → B) / Support(B)

Structure of Association Rules

- A: Antecedent (item or set of items).

- B: Consequent (item or set of items).

1. Generate Frequent Itemsets

Algorithm: Apriori or FP-Growth.

2. Generate Strong Rules

Applications of Association Rules

Types of Collaborative Filtering

1. User-Based Collaborative Filtering

• Easy to understand and implement.

• Not scalable for a large number of users.

2. Item-Based Collaborative Filtering

• Handles sparse data effectively.

• Requires proper parameter tuning.

Key Components of Collaborative Filtering

Steps to Implement Collaborative Filtering

Applications of Collaborative Filtering

Key Features of Surprise

pip install scikit-surprise

Key Components of Surprise

4. Evaluation: Tools for cross-validation and computing performance metrics.

Example: Building a Recommendation System

from surprise import SVD

# Load a built-in dataset (e.g., MovieLens 100k)

# Use SVD algorithm

Custom Dataset Example

from surprise import Reader, Dataset

# Define a Reader with rating scale

# Split into train and test sets

# Test the model

Matrix Factorization Techniques

Advantages of Matrix Factorization

Applications of Matrix Factorization

Example: Matrix Factorization with Python (Using Surprise Library)

from surprise import SVD

# Split into train and test sets

# Test the model

You might also like