0% found this document useful (0 votes)

77 views51 pages

1 DMiningKuliah 1 Introduction

Data mining involves analyzing large amounts of data to discover hidden patterns and relationships. It has the potential to help organizations understand their data better and make more informed decisions. The data mining process involves cleaning and preparing data, applying data mining algorithms to discover patterns, and evaluating and presenting the results. Common data mining techniques include classification, estimation, prediction, clustering, and association rule mining.

Uploaded by

Ricky Chandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

77 views51 pages

1 DMiningKuliah 1 Introduction

Uploaded by

Ricky Chandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Data Mining

Data Mining 1
Introduction
■ Why data mining?

■ What is Data Mining / Knowledge Data Discovery?

■ Origins of Data Mining

■ Potential Applications

■ Data Mining: On what kind of data?

■ Data Mining Functionalities

■ OLAP Mining System

Data Mining 2
Why Data Mining:
Trends leading to Data Flood
More data is generated:
■ Bank, telecom, other
business transactions
...
■ Scientific data:
astronomy, biology, etc
■ Web, text, and
e-commerce

Data Mining 3
Scale Of Data

Data Mining 4
Data Growth Rate
■ Twice as much information was created in
2002 as in 1999 (~30% growth rate)
■ Other growth rate estimates even higher
■ And THE PROBLEM IS:
■ Very little data will ever be looked at by a
human
■ We are drowning in data, but starving for
knowledge
■ Knowledge Discovery is NEEDED to make
sense and use of data.
Data Mining 5
Why Mine Data?
■ There is often information “hidden” in the data that is not readily
evident
■ Human analysts may take weeks to discover useful information
■ Much of the data is never analyzed at all

Data Mining 6
Why Mine Data?

Data Mining 7
What Is Data Mining:
Many Names of Data Mining
■ Data Fishing, Data Dredging: 1960-
■ used by Statistician (as a bad name)
■ Data Mining :1990-
■ used DB, business
■ in 2003 – bad image because of TIA
■ Knowledge Discovery in Databases: 1989-
■ used by AI, Machine Learning Community
■ also Data Archaeology, Information Harvesting,
Information Discovery, Knowledge Extraction, ...

Currently: Data Mining and Knowledge Discovery in

Databases (KDD) are used interchangeably
Data Mining 8
Knowledge Data Discovery (KDD)
■ Knowledge Discovery in Data
is the non-trivial process of
identifying
■ valid
■ novel
■ potentially useful
■ and ultimately
understandable patterns in
data.
from Advances in Knowledge Discovery and Data
Mining, Fayyad, Piatetsky-Shapiro, Smyth, and
Uthurusamy, (Chapter 1), AAAI/MIT Press 1996
Data Mining 9
What is (not) Data Mining?
What is not Data What is Data Mining?
Mining?
– Look up phone number – Certain names are more
in phone directory prevalent in certain US
locations (O’Brien,
O’Rurke, O’Reilly… in
Boston area)
– Query a Web search – Group together similar
engine for information documents returned by
about “Amazon” search engine according to
their context (e.g. Amazon
rainforest, Amazon.com,
etc)
Data Mining 10
Origins of Data Mining
■ Draws ideas from
machine
learning/AI,
pattern
recognition,
statistics,
and
database
systems

Data Mining 11
Data Mining: Confluence of Multiple Disciplines

Database
Statistics
Technology

Machine
Learning
Data Mining Visualization

Information Other
Science Disciplines
Data Mining 12
What is Data Mining: A KDD Process

Data mining: the core of

Knowledge Data Discovery
process. Pattern Evaluation

Data Mining
Task-relevant
Data
Selection
Data
Warehouse
Data
Cleaning

Data Integration
Databases
Data Mining 13
Steps of a KDD Process
1. Learning the application domain
■ relevant prior knowledge and goals of application
2. Creating a target data set → data selection
3. Data cleaning and preprocessing (may take 60% of effort!)
4. Data reduction and transformation
■ Find useful features, dimensionality/variable reduction,
invariant representation.
5. Choosing functions of data mining
■ summarization, classification, regression, association,
clustering.
6. Choosing the mining algorithm(s)
7. Data mining → search for patterns of interest
8. Pattern evaluation and knowledge presentation
■ visualization, transformation, removing redundant patterns,
etc.
9. Use of discovered knowledge
Data Mining 14
Data Mining and Business Intelligence
Increasing potential
to support
business decisions End User
Making
Decisions

Data Presentation Business

Analyst
Visualization Techniques
Data Mining
Information Discovery
Data
Data Exploration Analyst
Statistical Analysis, Querying and Reporting
Data Warehouses / Data Marts
OLAP, MDA
DBA
Data Sources
Paper, Files, Information Providers,
Data Mining Database Systems, OLTP 15
Architecture of a Typical Data
Mining System
Graphical user interface

Pattern evaluation

Data mining engine

(Database / data Knowledge-base

warehouse) server
Data cleaning & data integration Filtering

Data
Databases Warehouse
Data Mining 16
What Tasks Can Data Mining
Accomplish?

The most common data mining tasks.

■ Description
■ Classification
■ Estimation
■ Prediction
■ Clustering
■ Association

Data Mining 17
Task 1: Description
■ Find ways to describe patterns and trends
lying within data.
■ For example:
■ A pollster can uncover evidence that those who
have been laid off are less likely to support the
present incumbent in the presidential election.
■ From descriptions of patterns and trends we knew
that they are now less well off financially than
before the incumbent was elected, and so would
tend to prefer an alternative.

Data Mining 18
Task 1: Description
■ The models should be as transparent as
possible.
■ High-quality description can often be
accomplished by exploratory data
analysis , a graphical method of
exploring data in search of patterns and
trends.

Data Mining 19
Task 2: Classification
The data mining model examines a large set of records, each record
containing information on the target variable as well as a set of
input or predictor variables.
■ For example, consider the excerpt data set.

■ After “learns” the data, the algorithm can classify new records,
for which no information about income bracket is available.

Data Mining 20
Task 2: Classification
Examples of classification tasks in business and research include:
■ Determining whether a particular credit card transaction is
fraudulent
■ Placing a new student into a particular track with regard to
special needs
■ Assessing whether a mortgage application is a good or bad
credit risk
■ Diagnosing whether a particular disease is present
■ Determining whether a will was written by the actual deceased,
or fraudulently by someone else
■ Classifying type of drug a patient should be prescribed, based
on certain patient characteristics.
■ Etc.

Data Mining 21
Task 2: Classification
■ Common data mining methods
used for classification are:
■ k -nearest neighbor
■ decision tree
■ neural network

Data Mining 22
Task 3: Estimation
■ Similar to classification except that the target
variable is numerical rather than categorical.
■ Models are built using “complete ” records,
which provide the value of the target variable
as well as the predictors.
■ Then, for new observations, estimates of the
value of the target variable are made, based
on the values of the predictors.

Data Mining 23
Task 3: Estimation
Examples of estimation tasks in business and research include:
■ Estimating the amount of money a randomly chosen family of
four will spend for back-to-school shopping this fall.
■ Estimating the percentage decrease in rotary-movement
sustained by a National Football League running back with a
knee injury.
■ Estimating the number of points per game that Patrick Ewing will
score when double-teamed in the playoffs.
■ Estimating the grade-point average (GPA) of a graduate student,
based on that student ’s undergraduate GPA.
■ Estimating person yearly incomes based on the description and
personal data, ie: age, jobs, home addresses, etc.
■ Etc.

Data Mining 24
Task 3: Estimation
■ Common data mining methods used for
estimation are:
■ Statistical analysis:
■ Point estimation
■ Confidence interval estimations
■ Simple linear regression
■ Multiple regression
■ Correlation
■ Neural networks

Data Mining 25
Task 4: Prediction
Similar to classification and estimation, except that for
prediction, the results lie in the future.
■ For example, predicting the price of a stock three
months in the future.

Data Mining 26
Task 4: Prediction
Examples of prediction tasks in business and research
include:
■ Predicting the price of a stock three months into the
future
■ Predicting the percentage increase in traffic deaths
next year if the speed limit is increased
■ Predicting the winner of this fall’s baseball World
Series, based on a comparison of team statistics
■ Predicting whether a particular molecule in drug
discovery will lead to a profitable new drug for a
pharmaceutical company
Data Mining 27
Task 4: Prediction
■ Any of the methods and techniques
used for classification and estimation
may also be used for prediction. These
include:
■ Statistical methods
■ Neural Networks
■ Decision tree
■ k-nearest neighbor
Data Mining 28
Task 5: Clustering
■ Grouping of records, observations, or cases into
classes of similar objects.
■ A cluster is a collection of records that are similar to
one another, and dissimilar to records in other
clusters.
■ The clustering task does not try to classify, estimate,
or predict the value of a target variable.
■ It seek to segment the entire data set into relatively
homogeneous subgroups or clusters.

Data Mining 29
Task 5: Clustering
■ For Example, PRIZM segmentation system, which
describes every U.S. zip code area in terms of
distinct lifestyle types.
■ For illustration, the clusters for zip code 90210,
Beverly Hills, California, are:
■ Cluster 01: Blue Blood Estates
■ Cluster 10: Bohemian Mix
■ Cluster 02: Winner ’s Circle
■ Cluster 07: Money and Brains
■ Cluster 08: Young Literati

Data Mining 30
Task 5: Clustering
Examples of clustering tasks in business and research
include:
■ Target marketing of a niche product for a
small-capitalization business that does not have a large
marketing budget
■ For accounting auditing purposes, to segment financial
behavior into benign and suspicious categories
■ As a dimension-reduction tool when the data set has
hundreds of attributes
■ For gene expression clustering, where very large
quantities of genes may exhibit similar behavior

Data Mining 31
Task 5: Clustering
Common data mining methods used for
clustering are:
■ Hierarchical clustering (AgNes, DiAna, etc)
■ Partitional clustering (K–means, PAM, etc)
■ DB-Scan
■ Kohonen networks

Data Mining 32
Task 6: Association
■ Finding which attributes “go together. ”
■ Most prevalent in the business world.
■ It is known as affinity analysis or
market basket analysis
■ The task of association seeks to
uncover rules for quantifying the
relationship between two or more
attributes.
Data Mining 33
Task 6: Association
■ For example, a particular supermarket may
find that of the 1000 customers shopping on a
Thursday night, 200 bought diapers, and of
those 200 who bought diapers, 50 bought
beer.
■ Thus, the association rule would be “If buy
diapers, then buy beer” with a support of
200/1000 = 20% and a confidence of 50/200
= 25%.

Data Mining 34
Task 6: Association
Examples of association tasks in business and research
include:
■ Examining the proportion of children whose parents read to
them who are themselves good readers
■ Predicting degradation in telecommunications networks
■ Finding out which items in a supermarket are purchased
together and which items are never purchased together
■ Determining the proportion of cases in which a new drug
will exhibit dangerous side effects
■ Cross-selling analysis of the products.
■ Optimize the performance of online banner advertisement,
which presents discount offers on various investment
products
Data Mining 35
Task 6: Association
Common data mining methods used for
association are:
■ Apriori Algorithm
■ FP-Tree
■ Generalized Rule Induction Method
■ Etc.

Data Mining 36
Potential Applications
■ Database analysis and decision support
■ Market analysis and management
■ target marketing, customer relation management, market
basket analysis, cross selling, market segmentation
■ Risk analysis and management
■ Forecasting, customer retention, improved underwriting,
quality control, competitive analysis
■ Fraud detection and management
■ Other Applications
■ Text mining (news, email, documents) and Web analysis.
■ Intelligent query answering

Data Mining 37
Market Analysis and Management (1)
■ The Data Sources
■ Sales transactions, credit card transactions, loyalty cards,
discount coupons, customer complaint calls, plus (public)
lifestyle studies
■ Target marketing
■ Find clusters of “model” customers who share the same
characteristics: interest, income level, spending habits, etc.
■ Determine customer purchasing patterns over time
■ Conversion of single to a joint bank account: marriage, etc.
■ Cross-market analysis
■ Associations/co-relations between product sales
■ Prediction based on the association information
Data Mining 38
Market Analysis and Management (2)
■ Customer profiling
■ data mining can tell you what types of customers buy what
products (clustering or classification)

■ Identifying customer requirements

■ identifying the best products for different customers
■ use prediction to find what factors will attract new customers
■ Provides summary information
■ various multidimensional summary reports
■ statistical summary information (data central tendency and
variation)
Data Mining 39
Corporate Analysis and Risk Management
■ Finance planning and asset evaluation:
■ cash flow analysis and prediction
■ claim analysis to evaluate assets
■ cross-sectional and time series analysis (financial-ratio, trend
analysis, etc.)
■ Resource planning:
■ summarize and compare the resources and spending
■ Competition:
■ monitor competitors and market directions
■ group customers into classes and a class-based pricing
procedure
■ set pricing strategy in a highly competitive market

Data Mining 40
Successful e-commerce – Case Study

Data Mining 41
Fraud Detection and Management (1)
■ Applications
■ widely used in health care, retail, credit card services,
telecommunications (phone card fraud), etc.
■ Approach
■ use historical data to build models of fraudulent behavior and
use data mining to help identify similar instances
■ Examples
■ auto insurance: detect a group of people who stage accidents
to collect on insurance
■ money laundering: detect suspicious money transactions (US
Treasury's Financial Crimes Enforcement Network)
■ medical insurance: detect professional patients and ring of
doctors and ring of references
Data Mining 42
Fraud Detection and Management (2)
■ Detecting inappropriate medical treatment
■ Australian Health Insurance Commission identifies that in many
cases blanket screening tests were requested (save Australian
$1m/yr).
■ Detecting telephone fraud
■ Telephone call model: destination of the call, duration, time of
day or week. Analyze patterns that deviate from an expected
norm.
■ British Telecom identified discrete groups of callers with
frequent intra-group calls, especially mobile phones, and broke
a multimillion dollar fraud.
■ Retail
■ Analysts estimate that 38% of retail shrink is due to dishonest
employees.
Data Mining 43
Other Applications
■ Sports
■ IBM Advanced Scout analyzed NBA game statistics (shots blocked,
assists, and fouls) to gain competitive advantage for New York Knicks
and Miami Heat
■ Astronomy
■ JPL and the Palomar Observatory discovered 22 quasars with the help
of data mining
■ Internet Web Surf-Aid
■ IBM Surf-Aid applies data mining algorithms to Web access logs for
market-related pages to discover customer preference and behavior
pages, analyzing effectiveness of Web marketing, improving Web site
organization, etc.
■ Detecting diseases, pendemic, epidemic, plagues spreading.
Data Mining 44
Data Mining: On What Kind of Data?
■ Relational databases
■ Data warehouses
■ Transactional databases
■ Advanced DB and information repositories
■ Object-oriented and object-relational databases
■ Spatial databases
■ Time-series data and temporal data
■ Text databases and multimedia databases
■ Heterogeneous and legacy databases
■ WWW
Data Mining 45
Data Mining Functionalities (1)

■ Concept description
■ Generalize, summarize, and contrast data characteristics, e.g.,
dry vs. wet regions

■ Association (correlation and causality)

■ Multi-dimensional vs. single-dimensional association
■ buys(x, "diapers") 🡪 buys(x, "beer") [0.5%, 60%]
■ age(X, “20..29”) ^ income(X, “20..29K”) 🡪 buys(X, “PC”) [support
= 2%, confidence = 60%]
■ contains(T, “computer”) 🡪 contains(x, “software”) [1%, 75%]

Data Mining 46
Data Mining Functionalities (2)
■ Classification and Prediction
■ Finding models (functions) that describe and distinguish classes
or concepts for future prediction
■ E.g., classify countries based on climate, or classify cars based

on gas mileage
■ Presentation: decision-tree, classification rule, neural network
■ Prediction: Predict some unknown or missing numerical values
■ Cluster analysis
■ Class label is unknown: Group data to form new classes, e.g.,
cluster houses to find distribution patterns
■ Clustering based on the principle: maximizing the intra-class
similarity and minimizing the interclass similarity
Data Mining 47
Data Mining Functionalities (3)
■ Outlier / Anomaly analysis
■ Outlier: a data object that does not comply with the general behavior of
the data
■ It can be considered as noise or exception but is quite useful in fraud
detection, rare events analysis
■ Trend and evolution analysis
■ Trend and deviation: regression analysis
■ Sequential pattern mining, periodicity analysis
■ Similarity-based analysis
■ Other pattern-directed or statistical analyses

Data Mining 48
OLAP Mining: An Integration of Data
Mining and Data Warehousing
■ Data mining systems, DBMS, Data warehouse
systems coupling
■ On-line analytical mining data
■ integration of mining and OLAP technologies
■ Interactive mining multi-level knowledge
■ Necessity of mining knowledge and patterns at different levels
of abstraction by drilling/rolling, pivoting, slicing/dicing, etc.
■ Integration of multiple mining functions
■ Characterized classification, first clustering and then
association
Data Mining 49
An OLAM Architecture
Mining query Mining result Layer4
User Interface
User GUI API
Layer3
OLAM OLAP
Engine Engine OLAP/OLAM

Data Cube API

Layer2
MDDB
MDDB
Meta Data

Filtering&Integration Database API Filtering

Layer1
Data cleaning Data
Databases Data
Data integration Warehouse
Data Mining 50
Repository
Thanks

Data Mining 51

Martin Luther's Legacy: Reforming Reformation Theology For The 21st Century
100% (8)
Martin Luther's Legacy: Reforming Reformation Theology For The 21st Century
369 pages
BKI - Vol 2 - Rules For Hull
67% (3)
BKI - Vol 2 - Rules For Hull
355 pages
Data Mining
No ratings yet
Data Mining
254 pages
Data Mining Merged PDF CS1 CS8
No ratings yet
Data Mining Merged PDF CS1 CS8
272 pages
The Drug That Obliterates 97% of Delhi Covid Cases Is IVERMECTIN
100% (1)
The Drug That Obliterates 97% of Delhi Covid Cases Is IVERMECTIN
10 pages
Surveys (Tunneling)
No ratings yet
Surveys (Tunneling)
66 pages
Handout - Chaldean Oracles, Divination and Theurgy
100% (1)
Handout - Chaldean Oracles, Divination and Theurgy
5 pages
DM - Unit I-Updated
No ratings yet
DM - Unit I-Updated
65 pages
Science Force and Friction Grade 5
No ratings yet
Science Force and Friction Grade 5
28 pages
Class Notes 1-5
No ratings yet
Class Notes 1-5
51 pages
Ma Theses-The Effectiveness of Project-Based Learning On Students Achievement and Motivation
No ratings yet
Ma Theses-The Effectiveness of Project-Based Learning On Students Achievement and Motivation
155 pages
MVH3K Datasheet ENG PDF
No ratings yet
MVH3K Datasheet ENG PDF
3 pages
02 DM BI Data Mining
No ratings yet
02 DM BI Data Mining
66 pages
DB 14
No ratings yet
DB 14
97 pages
Data Mining for Beginners: A Programmer’s Guide
From Everand
Data Mining for Beginners: A Programmer’s Guide
Agasti Khatri
No ratings yet
Data Mining and Its Branches
No ratings yet
Data Mining and Its Branches
37 pages
21 Reasons Kettlebells PDF
No ratings yet
21 Reasons Kettlebells PDF
4 pages
Tum Dersler Veri Madenciligi
No ratings yet
Tum Dersler Veri Madenciligi
123 pages
Inf 444e - Datamining N Advanced Databases Introduction 2019
No ratings yet
Inf 444e - Datamining N Advanced Databases Introduction 2019
32 pages
Unit 1
No ratings yet
Unit 1
59 pages
Lec 2-Week 1 - (Design of Sewer System)
No ratings yet
Lec 2-Week 1 - (Design of Sewer System)
19 pages
02-Introduction To Data Mining
No ratings yet
02-Introduction To Data Mining
40 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Data Mining
No ratings yet
Data Mining
88 pages
Datamining&warehousing
No ratings yet
Datamining&warehousing
65 pages
Physics FYUGP
No ratings yet
Physics FYUGP
57 pages
Lecture 1.1.1 1.1.2
No ratings yet
Lecture 1.1.1 1.1.2
32 pages
Data Mining Nostos
100% (1)
Data Mining Nostos
39 pages
1 - Introduction To DM
No ratings yet
1 - Introduction To DM
59 pages
Data Mining Mids
No ratings yet
Data Mining Mids
24 pages
Lecture 1428550844
No ratings yet
Lecture 1428550844
87 pages
Ce-1254 - Surveying Ii
No ratings yet
Ce-1254 - Surveying Ii
9 pages
DWDM LS1 Fall 24 25
No ratings yet
DWDM LS1 Fall 24 25
42 pages
DMiningKuliah 1 Introduction
No ratings yet
DMiningKuliah 1 Introduction
41 pages
Data Preprocessing: Data Cleaning Data Integration and Transformation
No ratings yet
Data Preprocessing: Data Cleaning Data Integration and Transformation
41 pages
System and Network Administration Assignment
No ratings yet
System and Network Administration Assignment
64 pages
Introduction
No ratings yet
Introduction
26 pages
01 Intro 1
No ratings yet
01 Intro 1
33 pages
Module 1
No ratings yet
Module 1
40 pages
362WH14C0
No ratings yet
362WH14C0
77 pages
TPO 57 Listening
No ratings yet
TPO 57 Listening
11 pages
Lecture 1-Introduction To Data Mining - M
No ratings yet
Lecture 1-Introduction To Data Mining - M
38 pages
01 Intro
No ratings yet
01 Intro
40 pages
Data Analysis-2
No ratings yet
Data Analysis-2
41 pages
Prameet (12a) (5728)
No ratings yet
Prameet (12a) (5728)
33 pages
Classification: Decision Tree Hunt's Algorithm ID3 Rule Based Classifier C4.5
No ratings yet
Classification: Decision Tree Hunt's Algorithm ID3 Rule Based Classifier C4.5
45 pages
01 Intro
No ratings yet
01 Intro
28 pages
Top Bar Beekeeping (Text)
No ratings yet
Top Bar Beekeeping (Text)
5 pages
Kruthika CV
No ratings yet
Kruthika CV
4 pages
Introduction Lecture1gghhhhh
No ratings yet
Introduction Lecture1gghhhhh
23 pages
Lecture 01 11jan
No ratings yet
Lecture 01 11jan
29 pages
Lecture 1-Introduction To Data Mining - M
No ratings yet
Lecture 1-Introduction To Data Mining - M
38 pages
01 Intro
No ratings yet
01 Intro
29 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
44 pages
Instructions: Meet DRU - The World's First Pizza Delivery Robot!
No ratings yet
Instructions: Meet DRU - The World's First Pizza Delivery Robot!
9 pages
01 - Data Mining Introduction
No ratings yet
01 - Data Mining Introduction
21 pages
CSM6404 DM L1
No ratings yet
CSM6404 DM L1
29 pages
01 - Introduction To Datamining
No ratings yet
01 - Introduction To Datamining
19 pages
Unit 1: Data Warehousing & Data Mining
No ratings yet
Unit 1: Data Warehousing & Data Mining
54 pages
Data Mining: July 18, 2019 1
No ratings yet
Data Mining: July 18, 2019 1
41 pages
01 Introduction
No ratings yet
01 Introduction
36 pages
Unit I
No ratings yet
Unit I
19 pages
Unit 3
No ratings yet
Unit 3
23 pages
01-Introduction To Data Mining
No ratings yet
01-Introduction To Data Mining
43 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
13 pages
Introduction To Data Mining-Week1
No ratings yet
Introduction To Data Mining-Week1
43 pages
Chapter - 1
No ratings yet
Chapter - 1
22 pages
Lecture 2 Data Mining Functions
No ratings yet
Lecture 2 Data Mining Functions
40 pages
Data Mining: Concepts and Techniques: - Chapter 1
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 1
37 pages
IS414: Data Mining: DR - Waleed M.Ead
No ratings yet
IS414: Data Mining: DR - Waleed M.Ead
36 pages
GCRG International Conference
No ratings yet
GCRG International Conference
5 pages
Data Mining:: Concepts and Techniques
No ratings yet
Data Mining:: Concepts and Techniques
28 pages
Shutdown Isolation Procedures
No ratings yet
Shutdown Isolation Procedures
3 pages
Data Mining Concepts
No ratings yet
Data Mining Concepts
35 pages
Chapter 6 Data Mining
No ratings yet
Chapter 6 Data Mining
39 pages
Fundamentals of Data Mining: Dr. Jasim Saeed Jasim - Saeed@riphah - Edu.pk
No ratings yet
Fundamentals of Data Mining: Dr. Jasim Saeed Jasim - Saeed@riphah - Edu.pk
15 pages
Assignment Submission by Ahumuza Ivan
No ratings yet
Assignment Submission by Ahumuza Ivan
3 pages
Introduction To Data Mining & Business Intelligence
No ratings yet
Introduction To Data Mining & Business Intelligence
25 pages
Archlinux - Grub
No ratings yet
Archlinux - Grub
15 pages
2 DMiningKuliah 2A DPreparation
No ratings yet
2 DMiningKuliah 2A DPreparation
32 pages
Python Class 11 Test Gen 002
No ratings yet
Python Class 11 Test Gen 002
6 pages
Lecture 1
No ratings yet
Lecture 1
17 pages
1 Intro
No ratings yet
1 Intro
33 pages
01 Intro
No ratings yet
01 Intro
23 pages
Classification: Rule Based Classification 0R Holte 1R Holte Decision Tree
No ratings yet
Classification: Rule Based Classification 0R Holte 1R Holte Decision Tree
24 pages
DM Introduction-SSM
No ratings yet
DM Introduction-SSM
6 pages
Tax Quizzer
No ratings yet
Tax Quizzer
3 pages
Introduction To Data Mining: Dr. Dipti Chauhan Assistant Professor SCSIT, SUAS Indore
No ratings yet
Introduction To Data Mining: Dr. Dipti Chauhan Assistant Professor SCSIT, SUAS Indore
16 pages
Essay Structure and Paragraphing
No ratings yet
Essay Structure and Paragraphing
3 pages
Brand Personality
No ratings yet
Brand Personality
3 pages
Advantage of Using Virtual Reality
No ratings yet
Advantage of Using Virtual Reality
3 pages
Deloitte Mergers Aquisitons Tax
No ratings yet
Deloitte Mergers Aquisitons Tax
1 page
Miraña Genus Aeromonas
No ratings yet
Miraña Genus Aeromonas
1 page
Big Data: Statistics, Data Mining, Analytics, And Pattern Learning
From Everand
Big Data: Statistics, Data Mining, Analytics, And Pattern Learning
Rob Botwright
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet

1 DMiningKuliah 1 Introduction

Uploaded by

1 DMiningKuliah 1 Introduction

Uploaded by

Data Mining

■ What is Data Mining / Knowledge Data Discovery?

■ Origins of Data Mining

■ Data Mining: On what kind of data?

■ Data Mining Functionalities

■ OLAP Mining System

Currently: Data Mining and Knowledge Discovery in

Data mining: the core of

Data Presentation Business

Data mining engine

(Database / data Knowledge-base

The most common data mining tasks.

■ Identifying customer requirements

■ Association (correlation and causality)

Data Cube API

Filtering&Integration Database API Filtering

You might also like