Data Mining For Web Personalization
Data Mining For Web Personalization
Introduction
Paper titled: Data Mining for Web Personalization Author: Bamshad Mobasher
Web Personalization
Delivery of dynamic content, such as text, links tailored to a particular user or segments of user
Set of n items:
I = {in, in, , in}
Session
A sequence of pageviews by a single user
All sessions belonging to a user can be aggregated to create the profile for that user
This sessionized data can be used as the input for a variety of data mining algorithms or further abstracted
Data Cleaning:
Tasks such as:
Removing extraneous references to embedded objects Removing references due to spider navigations
Episode Identification:
Episode is a subsequence of a session comprised of related pageviews
(1) Clustering
Clustering divides data into groups where:
Inter-cluster similarities are minimised Intra-cluster similarities are maximised
Generalization to Web usage mining User-based vs. Item-based clustering Efficiency and scalability improvements
Distance issues
Consider web-transactions as sequences
Originally for mining supermarket basket data Discovering Association Rules involves:
1)Discovering frequent itemsets
Satisfying a minimum support threshold
(2) Recommendations
Matching rule antecedents with target user profiles
Sliding window solution Naive approach Frequent Itemset Graph
Recommendation Value
Confidence of corresponding association rule
(2) Recommendations
i1,i2 =>
{i1,i2,i3}
(3) Recommendations
Trie-structure (aggregate tree)
Each node is an item, root is the empty sequence
Recommendation Generation
Found in O(s) by traversing the tree
s = the length of the current user transaction deemed to be useful in recommending the next set of items
Sliding window w Maximum depth of tree therefore is |w|+1 Controlling the size of the tree
Have more recently become popular as a modelling approach in web usage mining Two commonly used LVMs
Finite Mixture Models (FMM) Factor Analysis (FA)
The Primary Goal of this section is to evaluate the accuracy and effectiveness of web personalization models
Why Evaluate?
More complex web-based applications and more complex user interaction requires the selection of more sophisticated models Need to further explore the impact of recommended model on user behaviour There are several different modelling approaches to web personalization Evaluating personalized models is an inherently challenging task firstly, because different models require different evaluation metrics, secondly, the required personalization actions may be quite different depending on the underlying domain, relevant data and intended application Finally, there is also a lack of consensus among researchers as to what factors affect quality of service in personalized systems and of what elements contribute to user satisfaction
Conclusions
Web personalisation is viewed as an application of data mining which dynamically serves customized content (pages, products, recommendations, etc.) to users based on their profiles, preferences, or expected interests of data available to personalization systems, the modelling approaches employed and the current approaches to evaluating these systems We have also discussed the various sources of data available to personalization systems, the modelling approaches employed and the current approaches to evaluating these systems Recent user studies have found that a number of issues can affect the perceived usefulness of personalization systems including, trust in the system, transparency of the recommendation logic, ability for a user to refine the system generated profile and diversity of recommendations Most personalization systems tend to use a static profile of the user. However user interests are not static, changing with time and context. Few systems have attempted to handle the dynamics within the user profile.
Any Questions?