0% found this document useful (0 votes)

19 views76 pages

Week9 Decision Support in E-Business Web Analytics

Uploaded by

Palyam Rohith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views76 pages

Week9 Decision Support in E-Business Web Analytics

Uploaded by

Palyam Rohith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 76

E-BUSINESS

PROF. MAMATA JENAMANI

DEPARTMENT OF INDUSTRIAL AND SYSTEMS ENGINEERING
IIT KHARAGPUR

1
Week 9: Lecture1

DECISION SUPPORT CONCEPTS

We are going to learn
• Concepts related to decision support
• Applications

3
Decision support systems
• Provide interactive ad hoc support for the decision-making
processes of managers and other business professionals.
• Support nonroutine decision making
– Example: What is impact on production schedule if December sales
doubled?
• Model driven DSS,
• Data driven DSS
• Serve middle management
• Examples: product pricing, profitability forecasting, and risk analysis systems.
Components
of a typical
DSS
Analytical Models
• Optimization
• Simulation
• Decision analysis
• Static vs. dynamic models
• Deterministic Vs. Stochastic models

6
Statistical Models
• Descriptive statistics
• Outlier analysis
• Univariate predictive models
• Multi variate predictive models

7
Data mining Models
• Predictive:
– Regression
– Classification
– Collaborative Filtering
• Descriptive:
– Clustering / similarity matching
– Association rules and variants

8
Text mining
• Natural Language processing
• Discover patterns

9
Term Time frame Specific meaning
Decision support 1970–1985 Use of data analysis to support decision making

Executive support 1980–1990 Focus on data analysis for decisions by senior

executives
Online analytical 1990–2000 Software for analyzing multidimensional data tables
processing (OLAP)
Business intelligence 1989–2005 Tools to support data driven decisions, with emphasis
on reporting
Analytics 2005–2010 Focus on statistical and mathematical analysis for
decisions
Big data 2010–present Focus on very large, unstructured, fast-moving data

10
11
Week 9: Lecture2

UNDERSTANDING THE WEB LOG

We are going to learn
• How web logs are generated
• Structure of the access log
• Pre-processing
• Session identification

13
Review of
HTTP

http://www.highteck.net/EN/Application/Application_Layer_Functionality_and_Protocols.html
Review of
HTTP

http://www.doc.ic.ac.uk/~nd/surprise_97/journal/vol2/pcg1/
Review of HTTP

http://www.cisco.com/c/en/us/about/press/internet-protocol-journal/back-issues/table-contents-47/131-aggregation.html
Review
of HTTP

http://seo-advisors.com/searchegnies-web-crawlers/
Collecting site navigation data
• Server Log Files
– Access log
• Common log format
• Extended (Combined) log format
– Error log

http://httpd.apache.org/docs/1.3/logs.html
What’s in a typical Web server log …
<ip_addr> - - <date><method><file><protocol><code><bytes><referrer><user_agent>
203.30.5.145 - - [01/Jun/1999:03:09:21 -0600] "GET /Calls/OWOM.html HTTP/1.0" 200 3942 "http://www.lycos.com/cgi-
bin/pursuit?query=advertising+psychology-&maxhits=20&cat=dir" "Mozilla/4.5 [en] (Win98; I)"
203.30.5.145 - - [01/Jun/1999:03:09:23 -0600] "GET /Calls/Images/earthani.gif HTTP/1.0" 200 10689 "http://www.acr-
news.org/Calls/OWOM.html" "Mozilla/4.5 [en] (Win98; I)"
203.30.5.145 - - [01/Jun/1999:03:09:24 -0600] "GET /Calls/Images/line.gif HTTP/1.0" 200 190 "http://www.acr-
news.org/Calls/OWOM.html" "Mozilla/4.5 [en] (Win98; I)"
203.252.234.33 - - [01/Jun/1999:03:12:31 -0600] "GET / HTTP/1.0" 200 4980 "" "Mozilla/4.06 [en] (Win95; I)"
203.252.234.33 - - [01/Jun/1999:03:12:35 -0600] "GET /Images/line.gif HTTP/1.0" 200 190 "http://www.acr-news.org/"
"Mozilla/4.06 [en] (Win95; I)"
203.252.234.33 - - [01/Jun/1999:03:12:35 -0600] "GET /Images/red.gif HTTP/1.0" 200 104 "http://www.acr-news.org/"
"Mozilla/4.06 [en] (Win95; I)"
203.252.234.33 - - [01/Jun/1999:03:12:35 -0600] "GET /Images/earthani.gif HTTP/1.0" 200 10689 "http://www.acr-
news.org/" "Mozilla/4.06 [en] (Win95; I)"
203.252.234.33 - - [01/Jun/1999:03:13:11 -0600] "GET /CP.html HTTP/1.0" 200 3218 "http://www.acr-news.org/"
"Mozilla/4.06 [en] (Win95; I)“
203.30.5.145 - - [01/Jun/1999:03:13:25 -0600] "GET /Calls/AWAC.html HTTP/1.0" 200 104 "http://www.acr-
news.org/Calls/OWOM.html" "Mozilla/4.5 [en] (Win98; I)"
Field Data Description

Date date The date that the activity occurred

Time time The time that the activity occurred
Client IP address c-ip The IP address of the client that accessed your server
The name of the autheticated user who access your server, anonymous
User Name cs-username users are represented by -
Service Name s-sitename The Internet service and instance number that was accessed by a client
Server Name s-computername The name of the server on which the log entry was generated
Server IP Address s-ip The IP address of the server that accessed your server
Server Port s-port The port number the client is connected to
Method cs-method The action the client was trying to perform
URI Stem cs-uri-stem The resource accessed
URI Query cs-uri-query The query, if any, the client was trying to perform
Protocol Status sc-status The status of the action, in HTTP or FTP terms
Win32 Status sc-win32-status The status of the action, in terms used by Microsoft Windows
Bytes Sent sc-bytes The number of bytes sent by the server
Bytes Received cs-bytes The number of bytes received by the server
Time Taken time-taken The duration of time, in milliseconds, that the action consumed
Protocol Version cs-version The protocol (HTTP, FTP) version used by the client
Host cs-host Display the content of the host header
User Agent cs(User Agent) The browser used on the client
Cookie cs(Cookie) The content of the cookie sent or received, if any
The previous site visited by the user. This site provided a link to the current
Referrer cs(Referrer) site
s = server actions cs = client-to-server actions
W3C Extended Log File Format c = client actions sc = server-to-client actions
Extended Log: Few Important Fields with a closer look
• IP Address (ISP Provided) • Status
– 144.16.192.247 – HTTP status code
• User name • Size
– determined by HTTP – Total number of bytes transferred
authentication by the server to the client
• Time • Referrer
• Method/URL/Protocol: – The name of the URL from which
– Method of transaction such as GET the request originated
or POST
– URL • Agent
– Version of the HTTP Protocol used – Name and version of the browser
by the server making the request
127.0.0.1 - frank [10/Oct/2000:13:55:36 -0700]
"GET /apache_pb.gif HTTP/1.0" 200 2326 "http://www.example.com/start.html"
"Mozilla/4.08 [en] (Win98; I ;Nav)"
Web usage
mining
process
Week 9: Lecture3

UNDERSTANDING THE WEB LOG-II

We are going to learn
• How web logs are generated
• Structure of the access log
• Preprocessing
• Session identification

26
Problems using the access log
• Caching
• Dynamic Address Allocation by the ISP
• Stateless nature of the HTTP protocol
• Crawler Activity
• Tedious preprocessing and cleaning steps
– Other Approaches for session tracking
• Cookies
• URL rewriting
• Hidden form fields
Filtering Merged Access
• Filtering the entries for embedded requests
Log Files
– Image, video and audio files
– HTML files within a frame
• Filtering robot entries
– Not human like trials
• Searching all the links in an HTML document
• Requests only for the text documents
– Analyzing user agent fields
• Tracing popular well-behaved robots
– Robot.txt
• Using a table of web pages
• Filtering consumes 80% of the effort in log analysis
Detecting Robots: http://www.cs.princeton.edu/~kyoungso/papers/robot-usenix.pdf, http://ieeexplore.ieee.org/iel5/7101/19134/00884534.pdf,
http://caltechlib.library.caltech.edu/73/01/Report-2004-NOV.pdf
Popular Robots: http://www.pgts.com.au/pgtsj/pgtsj0502d.html
Data
preparation

29
Pre-processing of web usage data

30
Data Preprocessing (1)
Data cleaning
– remove irrelevant references and fields in server logs
– remove references due to spider navigation
– remove erroneous references
– add missing references due to caching (done after sessionization)
Data integration
– synchronize data from multiple server logs
– Integrate semantics, e.g.,
• meta-data (e.g., content labels)
• e-commerce and application server data
– integrate demographic / registration data
Data Preprocessing (2)
Data Transformation
– user identification
– sessionization / episode identification
– pageview identification
• a pageview is a set of page files and associated objects that contribute to a single display in a Web
Browser
Data Reduction
– sampling and dimensionality reduction (ignoring certain pageviews /
items)
– Identifying User Transactions (i.e., sets or sequences of pageviews
possibly with associated weights)
Why sessionize?
• Quality of the patterns discovered depends on the quality of the data on
which mining is applied.
• In Web usage analysis, these data are the sessions of the site visitors: the
activities performed by a user from the moment she enters the site until
the moment she leaves it.
• Difficult to obtain reliable usage data due to proxy servers and
anonymizers, dynamic IP addresses, missing references due to caching,
and the inability of servers to distinguish among different visits.
• Cookies and embedded session IDs produce the most faithful
approximation of users and their visits, but are not used in every site, and
not accepted by every user.
• Therefore, heuristics are needed that can sessionize the available access
data.
Mechanisms for User Identification

Examples: page tags (use javascript), some browser plugins

Examples of “software agents“

Page tagging with Javascript: see also

http://www.bruceclay.com/analytics/disadvantages.htm
Sessionization strategies: Sessionization heuristics

These heuristics are quite accurate! (see Spiliopoulou et al., 2003)

Path Completion
• Refers to the problem of inferring missing user
references due to caching.
• Effective path completion requires extensive
knowledge of the link structure within the site
• Referrer information in server logs can also be
used in disambiguating the inferred paths.
• Problem gets much more complicated in frame-
based sites.
Why integrate semantics?
Basic idea: associate each requested page with one or more
domain concepts, to better understand the process of navigation
/ Web usage
Example: a shopping site
From ...
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:03:51 +0100]
"GET /search.html?l=ostsee%20strand&syn=023785&ord=asc HTTP/1.0" 200 1759
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:05:06 +0100]
"GET /search.html?l=ostsee%20strand&p=low&syn=023785&ord=desc HTTP/1.0" 200 8450
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:06:41 +0100]
"GET /mlesen.html?Item=3456&syn=023785 HTTP/1.0" 200 3478
To ...
Refine search Choose item
Search by Look at individual product
Search by category Category+title
From URLs to topics / concepts: Basics of
semantic session modelling
– 1 request  1 concept or n concepts
– Concepts can concern content or service
– Concepts can be part of an ontology (simple case:
concept hierarchy)
– Session = set / sequence / tree / graph of requests
 also possible: n requests  1 concept
Resulting format: if the request is the instance
Usually flat file (format like Web server log) or database
Resulting format: If a session is the instance
– What features can a session have?
– Refer again to the example:
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:03:51 +0100]
"GET /search.html?l=ostsee%20strand&syn=023785&ord=asc HTTP/1.0" 200 1759
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:05:06 +0100]
"GET /search.html?l=ostsee%20strand&p=low&syn=023785&ord=desc HTTP/1.0" 200 8450
p3ee24304.dip.t-dialin.net - - [19/Mar/2002:12:06:41 +0100]
"GET /mlesen.html?Item=3456&syn=023785 HTTP/1.0" 200 3478,

Refine search Choose item

Search by category Search by Category+title Look at individual product
42
Week 9: Lecture4
USING THE WEB LOG: WEB USAGE MINING
We are going to learn
• Framework for analysing the Web Log
• Types of Analysis

44
Site Basic Framework for Web Log Data Analysis
Content
Web Usage and E-Business Analytics
Content Data Cleaning / Integrated Session Analysis /
Analysis Sessionization Sessionized Static Aggregation
Module Module Data

OLAP
Data E-Commerce Tools
Web/Application Integration Data Mart
Server Logs OLAP
Module Analysis

Data Cube

– Static Aggregation and Statistics

– Session Analysis
– OLAP
– Data Mining
Static Aggregation
Most common form of analysis.
(Reports)
Data aggregated by predetermined units such as days or sessions.
Advantages:
– Gives quick overview of how a site is being used.
– Minimal disk space or processing power required.
Drawbacks:
– No ability to “dig deeper” into the data.
Page Number of Average View Count
View Sessions per Session
Home Page 50,000 1.5
Catalog Ordering 500 1.1
Shopping Cart 9000 2.3
Session Analysis
Simplest form of analysis: examine individual or groups of server
sessions and e-commerce data.

Advantages:
– Gain insight into typical customer behaviors.
– Trace specific problems with the site.

Drawbacks:
– LOTS of data.
– Difficult to generalize.
Online Analytical Processing (OLAP)
Allows changes to aggregation level for multiple dimensions.
Generally associated with a Data Warehouse.
Advantages & Drawbacks
– Very flexible
– Requires significantly more resources than static reporting.
Page Number of Average View Count
View Sessions per Session
Kid's Stuff Products 2,000 5.9
Page Number of Average View Count
View Sessions per Session
Kid's Stuff Products
Electronics
Educational 63 2.3
Radio-Controlled 93 2.5
Web Log Analytics
• The measurement, collection, analysis and reporting of internet
data for purposes of understanding and optimizing web usage
• Tools
– Webalizer
Level of Processing
– Sawmill
Static Aggregation and Statistics
– WebTrends
Session Analysis
– AWStats
– WWWStat
– Apache Logs Viewer
– Google analytics
UCSF School of Medicine,
Office of the Dean, Information
Services Unit
Few Definitions
• Hits
– A request for a file from the web server. Available only in log analysis
• Page Views
– A request for a file whose type is defined as a page
• Visits/Sessions
– A series of requests from the same uniquely identified client with a set
timeout, often 30 minutes. A visit contains one or more page views
• Click Paths
– the sequence of hyperlinks one or more website visitors follows on a
given site

UCSF School of Medicine,

Office of the Dean, Information
Services Unit
Page Tagging

UCSF School of Medicine,

Office of the Dean, Information
Services Unit
Google

https://communicators.ucsf.edu/resources/files/web_analytics.ppt
What Numbers Say
• About Navigation
• About Content
• About Users

UCSF School of Medicine,

Office of the Dean, Information
Services Unit
Markov
Prediction of next event chains
Sequence
Discovery of associated events or mining
Going deeper
Data Mining:
application objects Association rules

Discovery of visitor groups with

common properties and interests Clustering

Discovery of visitor groups with Session

common behaviour Clustering

Characterization of visitors with

respect to a set of predefined classes Classification

Card fraud detection

Mining Navigation Patterns
• Each session induces a user trail through the site
• A trail is a sequence of web pages followed by a user
during a session, ordered by time of access.
• A pattern in this context is a frequent trail.
• Co-occurrence of web pages is important, e.g. shopping-
basket and checkout.
– Association rule mining
– Markov chain model.
Trails inferred from Log data
(Each session results in a trail)
ID Trail
1 A1 > A2 > A3 Association based
Approach
2 A1 > A2 > A3
3 A1 > A2 > A3 > A4
4 A5 > A2 > A4
5 A5 > A2 > A4 > A6
6 A5 > A2 > A3 > A6
Association Rule Mining-The Idea
Given a set of transactions, find rules that will predict the occurrence of
an item based on the occurrences of other items in the transaction

Market-Basket transactions
Example of Association Rules
TID Items
1 Bread, Milk {Diaper}  {Beer},
2 Bread, Diaper, Beer, Eggs {Milk, Bread}  {Eggs,Coke},
3 Milk, Diaper, Beer, Coke {Beer, Bread}  {Milk},
4 Bread, Milk, Diaper, Beer Implication means co-occurrence, not
5 Bread, Milk, Diaper, Coke causality!
Applications
• Pre-fetching and caching web pages
• Web site reorganisation
• Personalisation
• Recommendation of links and products
Applications
• Calibration of a Web server:
– Prediction of the next page invocation over a group of
concurrent Web users under certain constraints
• Sequence mining, Markov chains
• Cross-selling of products:
– Mapping of Web pages/objects to products
– Discovery of associated products
• Association rules, Sequence Mining
– Placement of associated products on the same page
Applications
Sophisticated cross-selling and up-selling of products:
– Mapping of pages/objects to products of different price groups
– Identification of Customer Groups
• Clustering, Classification
– Discovery of associated products of the same/different price
categories
• Association rules, Sequence Mining
– Formulation of recommendations to the end-user
• Suggestions on associated products
• Suggestions based on the preferences of similar users
Summary
• Web usage mining has emerged as the essential tool for
realizing more personalized, user-friendly and business-
optimal Web services.
• The key is to use the user-clickstream data for many mining
purposes.
• Traditionally, Web usage mining is used by e-commerce
sites to organize their sites and to increase profits.
• It is now also used by search engines to improve search
quality and to evaluate search results, etc, and by many
other applications.

62
63
Week 9: Lecture 5

USER BEHAVIOR MODELING FROM WEB LOG

We are going to learn
• A model of browsing behaviour
• Interpreting the model outcome

65
Probabilistic models of browsing behavior
• Useful to build models that describe the
browsing behavior of users
• Can generate insight into how users use the
website
• Provide mechanism for making predictions
• Can help in pre-fetching and personalization

66
Markov models for understanding user behavior
• General approach is to use a finite-state Markov chain
– Each state can be a specific Web page or a category of Web
pages
– If only interested in the order of visits (and not in time), each
new request can be modeled as a transition of states
• Issues
– Self-transition
– Time-independence

67
Discrete – Time Markov Chains
Many real-world systems contain uncertainty and
evolve over time.

Stochastic processes (and Markov chains)

are probability models for such systems.
A discrete-time stochastic process
is a sequence of random variables
X0, X1, X2, . . . typically denoted by { Xn }.
Modeling A Website
• State: A functional area in the website
– A page or a group of pages representing the functional area
• Two dummy states
– entry and exit
– Customer is assumed to stay in the entry state before entering
into the site
– Customer is assumed to stay in the exit state.
• Customer behavior model graph
– Static part
– Dynamic part
Building a User Behavior Graph
• Static Part
– Determine the set of functions provided by the e-
commerce site.
• States
• Group of web pages
– Determine all possible transitions between states
• From site layout
• Dynamic Part
– Transition probability matrix
– Average transition time matrix
Determining transition probability matrix
• Count frequency of transitions from one state to another
Browse 8

Add to cart 4 Select 1

Browse 7
• Calculate probability
Browse 8/20

Add to cart 4/20 Select 1/20

Browse 7/20
Customer’s think time
Client Server
t0 Reque
st for
Page A
t1
Customer’s

Time
A
t2 ends Page
Think time Ser ver S
t3
t4

t5 Reques
t for Pag
eB
t6
Finding average think time
• Total think time from all the visits from one
state to the other/frequency of visit
Browse 10
5
Add to cart 15 Select

Browse 12
Browsing Behaviour as a Markov Chain
Properties of the transition probability matrix of a CBMG
• pi1 = 0 2≤i≤n-1
– No transition can be made to the Entry state from any state
other than the Exit state.
• p1n = 0
– No transition can be made from the Entry state to the Exit state.
• pnj = 0 2≤j≤n-1
– No transition can be made from the Exit state to any state other
than the Entry state.
• pnn + pn1= 1
– A transition from the Exit state to itself or to the Entry state.
End of Week 9

Speedy Notes - Information Gathering
No ratings yet
Speedy Notes - Information Gathering
9 pages
Our Topic:: Web Usage Mining
No ratings yet
Our Topic:: Web Usage Mining
51 pages
Week5 - Internet and Web Based System
No ratings yet
Week5 - Internet and Web Based System
104 pages
Web Mining PPT 4121
No ratings yet
Web Mining PPT 4121
18 pages
A Study of Web Traffic Analysis
No ratings yet
A Study of Web Traffic Analysis
8 pages
Analysis of Web Server Log Files
No ratings yet
Analysis of Web Server Log Files
8 pages
Unit 10 - Week 9
No ratings yet
Unit 10 - Week 9
4 pages
Hands-On Ethical Hacking and Network Defense
No ratings yet
Hands-On Ethical Hacking and Network Defense
47 pages
Proc. US Nat'l Science Foundation Workshop On Next-Generation Data Mining (NGDM)
0% (1)
Proc. US Nat'l Science Foundation Workshop On Next-Generation Data Mining (NGDM)
1 page
Web Usage Mining Chris Yang3114
No ratings yet
Web Usage Mining Chris Yang3114
32 pages
An Optimized K-Harmonic Mean Based Clustering User Navigation Patterns
No ratings yet
An Optimized K-Harmonic Mean Based Clustering User Navigation Patterns
4 pages
IJEE Paper
No ratings yet
IJEE Paper
7 pages
Analysis of User Identification Methods in Web Usage Mining: Abstract
No ratings yet
Analysis of User Identification Methods in Web Usage Mining: Abstract
9 pages
Classn 439
No ratings yet
Classn 439
6 pages
9-Advanced Preprocessing Using Distinct User
No ratings yet
9-Advanced Preprocessing Using Distinct User
5 pages
Week 4
No ratings yet
Week 4
34 pages
Site Analytics in IBM WebSphere Portal
100% (1)
Site Analytics in IBM WebSphere Portal
11 pages
Web Analytics Tutorial
No ratings yet
Web Analytics Tutorial
29 pages
Unit 3
No ratings yet
Unit 3
38 pages
Enhancing Virtual Intranet Server Full Doc (Editing)
No ratings yet
Enhancing Virtual Intranet Server Full Doc (Editing)
56 pages
Footprinting
No ratings yet
Footprinting
43 pages
Verzeo Minor Project
No ratings yet
Verzeo Minor Project
25 pages
CEH IT Security Penetration Testing Step
67% (3)
CEH IT Security Penetration Testing Step
42 pages
Web Traffic Logs: An Imperfect Resource For Evaluation
No ratings yet
Web Traffic Logs: An Imperfect Resource For Evaluation
11 pages
Module - 2 (Footprinting)
No ratings yet
Module - 2 (Footprinting)
94 pages
21778-1 Information Gathering
No ratings yet
21778-1 Information Gathering
232 pages
Sigproc SP
No ratings yet
Sigproc SP
7 pages
Web Traffic Analysis
No ratings yet
Web Traffic Analysis
10 pages
Web Analytics Tutorial
No ratings yet
Web Analytics Tutorial
29 pages
Lecture 5
No ratings yet
Lecture 5
46 pages
User Profiling: Web Usage Mining
No ratings yet
User Profiling: Web Usage Mining
4 pages
Algorithm For Tracing Visitors' On-Line Behaviors
No ratings yet
Algorithm For Tracing Visitors' On-Line Behaviors
7 pages
Reconnaissance Process
No ratings yet
Reconnaissance Process
13 pages
Clustering and Classification
No ratings yet
Clustering and Classification
1 page
Unit 5 DM
No ratings yet
Unit 5 DM
61 pages
Module 2 Web Usage Mining
No ratings yet
Module 2 Web Usage Mining
34 pages
CH 01
No ratings yet
CH 01
41 pages
INE Web Application Penetration Testing Course File
No ratings yet
INE Web Application Penetration Testing Course File
70 pages
Web Analytics, Web Mining, and Social Analytics
No ratings yet
Web Analytics, Web Mining, and Social Analytics
53 pages
CH 01
No ratings yet
CH 01
41 pages
Analysis of Web Mining Types and Weblogs
No ratings yet
Analysis of Web Mining Types and Weblogs
4 pages
INE Web Application Penetration Testing Web Fingerprinting and Enumeration
No ratings yet
INE Web Application Penetration Testing Web Fingerprinting and Enumeration
60 pages
Collecting Information
No ratings yet
Collecting Information
37 pages
Ijca PDF
No ratings yet
Ijca PDF
9 pages
Unit 2
No ratings yet
Unit 2
49 pages
Internet Activity Analysis
No ratings yet
Internet Activity Analysis
31 pages
Google Hack Basic
No ratings yet
Google Hack Basic
57 pages
An Enhanced Pre-Processing Research Framework For Web Log Data
No ratings yet
An Enhanced Pre-Processing Research Framework For Web Log Data
7 pages
Hacking Steps
No ratings yet
Hacking Steps
29 pages
2nd Project Report Pse12april
No ratings yet
2nd Project Report Pse12april
11 pages
Phase 1
No ratings yet
Phase 1
27 pages
Math Fractions Brian Tracy
No ratings yet
Math Fractions Brian Tracy
49 pages
Reconnaissance
No ratings yet
Reconnaissance
11 pages
Aaaa Internet Services CH 2 Final 10d
No ratings yet
Aaaa Internet Services CH 2 Final 10d
79 pages
ECH Guide by Shibs 30-June-2022
100% (2)
ECH Guide by Shibs 30-June-2022
107 pages
Hacking Module 02
100% (1)
Hacking Module 02
28 pages
Unit 2
No ratings yet
Unit 2
71 pages
Lab 03: Part 1: Name: Lê Minh Hoàng Student ID: SE130425
No ratings yet
Lab 03: Part 1: Name: Lê Minh Hoàng Student ID: SE130425
18 pages
CS Lab Workbook Om Rawat 21csu335
No ratings yet
CS Lab Workbook Om Rawat 21csu335
36 pages

Week9 Decision Support in E-Business Web Analytics

Uploaded by

Week9 Decision Support in E-Business Web Analytics

Uploaded by

E-BUSINESS

PROF. MAMATA JENAMANI

DECISION SUPPORT CONCEPTS

Executive support 1980–1990 Focus on data analysis for decisions by senior

UNDERSTANDING THE WEB LOG

Date date The date that the activity occurred

UNDERSTANDING THE WEB LOG-II

Examples: page tags (use javascript), some browser plugins

Page tagging with Javascript: see also

These heuristics are quite accurate! (see Spiliopoulou et al., 2003)

Refine search Choose item

Site Map customers

– Static Aggregation and Statistics

UCSF School of Medicine,

UCSF School of Medicine,

UCSF School of Medicine,

Discovery of visitor groups with

Discovery of visitor groups with Session

Characterization of visitors with

Card fraud detection

USER BEHAVIOR MODELING FROM WEB LOG

Stochastic processes (and Markov chains)

Add to cart 4 Select 1

Add to cart 4/20 Select 1/20

You might also like