Web Mining For BI - Part 2
Web Mining For BI - Part 2
Usage
discovering
useful discovering understanding
information or and modeling user access
knowledge the hyperlink patterns from
from web page structure of the Web usage logs
contents web pages
Web Mining techniques
The crawler
◼ digs through individual web pages,
◼ pulls out keywords and then
◼ adds the pages to the search engine's database
Steps in Web Content Mining
Collect
• Fetch the content from the web
Parse
• Extract usable data from formatted data (HTML, PDF,)
Analyze
Produce
consists of
◼ Web pages as nodes, and
◼ hyperlinks as edges connecting between two related pages.
Web Structure Mining
Structure of the Web
Web graph
Web
Analytics
Voice of
Customer
Customer Experience
Management
Example: Amazon.com
All books on software at Amazon.com
(Database)
Example: Amazon.com
Book cover images on Amazon.com
(File Base)
Web Mining Tools
Product Name URL
Angoss Knowledge WebMiner angoss.com
ClickTracks clicktracks.com
LiveStats from DeepMetrix deepmetrix.com
Megaputer WebAnalyst megaputer.com
MicroStrategy Web Traffic Analysis microstrategy.com
SAS Web Analytics sas.com
SPSS Web Mining for Clementine spss.com
WebTrends webtrends.com
XML Miner scientio.com
Web Mining vs Text Mining vs Data Mining
Web mining
use of data mining techniques to automatically
discover and extract information from Web
documents and services
Text mining
Deal with textual data
Data mining
Text Web
Mining Mining
Data
Mining
Web Mining Process
46
END OF PART 2