0% found this document useful (0 votes)

9 views52 pages

3analysing Important Trend

Chapter 13 of 'Data Mining: Concepts and Techniques' discusses trends and research frontiers in data mining, including methodologies for mining complex data types such as sequence, graph, and multimedia data. It highlights various applications across industries like finance, retail, and science, emphasizing the importance of statistical methods and visualization techniques in data mining. The chapter also addresses the societal implications and evolving trends in the field.

Uploaded by

Kuruthum Katongo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views52 pages

3analysing Important Trend

Uploaded by

Kuruthum Katongo

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

Data Mining:

Concepts and
Techniques
(3rd ed.)

— Chapter 13 —

Jiawei Han, Micheline Kamber, and Jian Pei

University of Illinois at Urbana-Champaign &
Simon Fraser University
©2011 Han, Kamber & Pei. All rights reserved.
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
3
Mining Complex Types of Data

 Mining Sequence Data

 Mining Time Series
 Mining Symbolic Sequences
 Mining Biological Sequences
 Mining Graphs and Networks
 Mining Other Kinds of Data
4
Mining Sequence Data
 Similarity Search in Time Series Data

Subsequence match, dimensionality reduction, query-based
similarity search, motif-based similarity search
 Regression and Trend Analysis in Time-Series Data

long term + cyclic + seasonal variation + random movements
 Sequential Pattern Mining in Symbolic Sequences

GSP, PrefixSpan, constraint-based sequential pattern mining
 Sequence Classification

Feature-based vs. sequence-distance-based vs. model-based
 Alignment of Biological Sequences

Pair-wise vs. multi-sequence alignment, substitution matirces,
BLAST
 Hidden Markov Model for Biological Sequence Analysis

Markov chain vs. hidden Markov models, forward vs. Viterbi vs.
Baum-Welch algorithms
5
Mining Graphs and Networks
 Graph Pattern Mining
 Frequent subgraph patterns, closed graph patterns, gSpan vs.
CloseGraph
 Statistical Modeling of Networks
 Small world phenomenon, power law (log-tail) distribution,
densification
 Clustering and Classification of Graphs and Homogeneous Networks
 Clustering: Fast Modularity vs. SCAN
 Classification: model vs. pattern-based mining
 Clustering, Ranking and Classification of Heterogeneous Networks
 RankClus, RankClass, and meta path-based, user-guided methodology
 Role Discovery and Link Prediction in Information Networks
 PathPredict
 Similarity Search and OLAP in Information Networks: PathSim, GraphCube
 Evolution of Social and Information Networks: EvoNetClus
6
Mining Other Kinds of Data
 Mining Spatial Data

Spatial frequent/co-located patterns, spatial clustering and classification
 Mining Spatiotemporal and Moving Object Data

Spatiotemporal data mining, trajectory mining, periodica, swarm, …
 Mining Cyber-Physical System Data

Applications: healthcare, air-traffic control, flood simulation
 Mining Multimedia Data

Social media data, geo-tagged spatial clustering, periodicity analysis, …
 Mining Text Data

Topic modeling, i-topic model, integration with geo- and networked data
 Mining Web Data

Web content, web structure, and web usage mining
 Mining Data Streams

Dynamics, one-pass, patterns, clustering, classification, outlier detection

7
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
8
Other Methodologies of Data
Mining
 Statistical Data Mining
 Views on Data Mining Foundations
 Visual and Audio Data Mining

9
Major Statistical Data Mining
Methods
 Regression
 Generalized Linear Model
 Analysis of Variance
 Mixed-Effect Models
 Factor Analysis
 Discriminant Analysis
 Survival Analysis
10
Statistical Data Mining (1)

 There are many well-established statistical techniques for

data analysis, particularly for numeric data
 applied extensively to data from scientific experiments

and data from economics and the social sciences

 Regression
 predict the value of a response
(dependent) variable from one or
more predictor (independent)
variables where the variables are
numeric
forms of regression: linear,
multiple, weighted, polynomial,
nonparametric, and robust

11
Scientific and Statistical Data Mining
(2)

 Generalized linear models


allow a categorical response variable
(or some transformation of it) to be
related to a set of predictor variables

similar to the modeling of a numeric
response variable using linear
regression

include logistic regression and
 Poisson regression
Mixed-effect models

For analyzing grouped data, i.e. data that can be classified
according to one or more grouping variables

Typically describe relationships between a response
variable and some covariates in data grouped according to
one or more factors
12
Scientific and Statistical Data Mining
(3)

 Regression trees
 Binary trees used for classification
and prediction
 Similar to decision trees:Tests are
performed at the internal nodes
 In a regression tree the mean of
the objective attribute is computed
and used as the predicted value
 Analysis of variance
 Analyze experimental data for two
or more populations described by
a numeric response variable and
one or more categorical variables
(factors)

13
Statistical Data Mining (4)
 Factor analysis
 determine which variables are

combined to generate a given

factor
 e.g., for many psychiatric data,

one can indirectly measure other

quantities (such as test scores)
that reflect the factor of interest
 Discriminant analysis
 predict a categorical response

variable, commonly used in social

science
 Attempts to determine several

discriminant functions (linear

combinations of the independent
variables) that discriminate among
the groups defined by the
response variable
www.spss.com/datamine/factor.htm
14
Statistical Data Mining (5)
 Time series: many methods such as autoregression,
ARIMA (Autoregressive integrated moving-average
modeling), long memory time-series modeling
 Quality control: displays group summary charts

 Survival analysis
Predicts the
probability that a
patient undergoing a
medical treatment
would survive at least
to time t (life span
prediction)
15
Other Methodologies of Data
Mining
 Statistical Data Mining
 Views on Data Mining Foundations
 Visual and Audio Data Mining

16
Views on Data Mining Foundations (I)

 Data reduction
 Basis of data mining: Reduce data
representation
 Trades accuracy for speed in response
 Data compression
 Basis of data mining: Compress the given data
by encoding in terms of bits, association rules,
decision trees, clusters, etc.
 Probability and statistical theory
 Basis of data mining: Discover joint probability
distributions of random variables
17
Views on Data Mining Foundations (II)
 Microeconomic view
 A view of utility: Finding patterns that are interesting only
to the extent in that they can be used in the decision-
making process of some enterprise
 Pattern Discovery and Inductive databases
 Basis of data mining: Discover patterns occurring in the
database, such as associations, classification models,
sequential patterns, etc.
 Data mining is the problem of performing inductive logic
on databases
 The task is to query the data and the theory (i.e., patterns)
of the database
 Popular among many researchers in database systems

18
Other Methodologies of Data
Mining
 Statistical Data Mining
 Views on Data Mining Foundations
 Visual and Audio Data Mining

19
Visual Data Mining
 Visualization: Use of computer graphics to create
visual images which aid in the understanding of
complex, often massive representations of data
 Visual Data Mining: discovering implicit but useful
knowledge from large data sets using visualization
techniques
Multimedi
Compute Human
a Systems
r Compute
Graphics r
Visual Interface
High Data s
Mining Pattern
Performance
Recogniti
Computing
on
20
Visualization
 Purpose of Visualization
 Gain insight into an information space by
mapping data onto graphical primitives
 Provide qualitative overview of large data sets
 Search for patterns, trends, structure,
irregularities, relationships among data.
 Help find interesting regions and suitable
parameters for further quantitative analysis.
 Provide a visual proof of computer
representations derived

21
Visual Data Mining & Data
Visualization

 Integration of visualization and data mining


data visualization

data mining result visualization

data mining process visualization

interactive visual data mining
 Data visualization

Data in a database or data warehouse can be
viewed

at different levels of abstraction

as different combinations of attributes or
dimensions

Data can be presented in various visual forms
22
Data Mining Result Visualization

 Presentation of the results or knowledge obtained

from data mining in visual forms
 Examples
 Scatter plots and boxplots (obtained from
descriptive data mining)
 Decision trees
 Association rules
 Clusters
 Outliers
 Generalized rules
23
Boxplots from Statsoft: Multiple
Variable Combinations

24
Visualization of Data Mining Results
in SAS Enterprise Miner: Scatter Plots

25
Visualization of Association
Rules in SGI/MineSet 3.0

26
Visualization of a Decision Tree in
SGI/MineSet 3.0

27
Visualization of Cluster Grouping in
IBM Intelligent Miner

28
Data Mining Process Visualization

 Presentation of the various processes of data

mining in visual forms so that users can see
 Data extraction process
 Where the data is extracted
 How the data is cleaned, integrated,
preprocessed, and mined
 Method selected for data mining
 Where the results are stored
 How they may be viewed

29
Visualization of Data Mining
Processes by Clementine

See your solution

discovery
process clearly

Understand
variations with
visualized data

30
Interactive Visual Data Mining

 Using visualization tools in the data mining process

to help users make smart data mining decisions
 Example

Display the data distribution in a set of attributes
using colored sectors or columns (depending on
whether the whole space is represented by either
a circle or a set of columns)

Use the display to which sector should first be
selected for classification and where a good split
point for this sector may be

31
Perception-Based Classification
(PBC)

32
Audio Data Mining

 Uses audio signals to indicate the patterns of data

or the features of data mining results
 An interesting alternative to visual mining
 An inverse task of mining audio (such as music)
databases which is to find patterns from audio data
 Visual data mining may disclose interesting
patterns using graphical displays, but requires
users to concentrate on watching patterns
 Instead, transform patterns into sound and music
and listen to pitches, rhythms, tune, and melody in
order to identify anything interesting or unusual

33
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
34
Data Mining Applications
 Data mining: A young discipline with broad and
diverse applications
 There still exists a nontrivial gap between generic

data mining methods and effective and scalable

data mining tools for domain-specific applications
 Some application domains (briefly discussed here)
 Data Mining for Financial data analysis

 Data Mining for Retail and Telecommunication

Industries
 Data Mining in Science and Engineering

 Data Mining for Intrusion Detection and Prevention

 Data Mining and Recommender Systems

35
Data Mining for Financial Data
Analysis (I)
 Financial data collected in banks and financial
institutions are often relatively complete, reliable, and
of high quality
 Design and construction of data warehouses for
multidimensional data analysis and data mining
 View the debt and revenue changes by month, by
region, by sector, and by other factors
 Access statistical information such as max, min,
total, average, trend, etc.
 Loan payment prediction/consumer credit policy
analysis
 feature selection and attribute relevance ranking

 Loan payment performance

 Consumer credit rating

36
Data Mining for Financial Data
Analysis (II)
 Classification and clustering of customers for
targeted marketing

multidimensional segmentation by nearest-
neighbor, classification, decision trees, etc. to
identify customer groups or associate a new
customer to an appropriate customer group
 Detection of money laundering and other financial
crimes

integration of from multiple DBs (e.g., bank
transactions, federal/state crime history DBs)

Tools: data visualization, linkage analysis,
classification, clustering tools, outlier analysis,
and sequential pattern analysis tools (find
unusual access sequences)
37
Data Mining for Retail & Telcomm. Industries
(I)

 Retail industry: huge amounts of data on sales,

customer shopping history, e-commerce, etc.
 Applications of retail data mining
 Identify customer buying behaviors
 Discover customer shopping patterns and trends
 Improve the quality of customer service
 Achieve better customer retention and satisfaction
 Enhance goods consumption ratios
 Design more effective goods transportation and
distribution policies
 Telcomm. and many other industries: Share many
similar goals and expectations of retail data mining
38
Data Mining Practice for Retail
Industry
 Design and construction of data warehouses
 Multidimensional analysis of sales, customers, products,
time, and region
 Analysis of the effectiveness of sales campaigns
 Customer retention: Analysis of customer loyalty
 Use customer loyalty card information to register
sequences of purchases of particular customers
 Use sequential pattern mining to investigate changes in
customer consumption or loyalty
 Suggest adjustments on the pricing and variety of goods
 Product recommendation and cross-reference of items
 Fraudulent analysis and the identification of usual patterns
 Use of visualization tools in data analysis
39
Data Mining in Science and
Engineering
 Data warehouses and data preprocessing

Resolving inconsistencies or incompatible data collected in
diverse environments and different periods (e.g. eco-system
studies)
 Mining complex data types

Spatiotemporal, biological, diverse semantics and
relationships
 Graph-based and network-based mining

Links, relationships, data flow, etc.
 Visualization tools and domain-specific knowledge
 Other issues

Data mining in social sciences and social studies: text and
social media

Data mining in computer science: monitoring systems,
software bugs, network intrusion
40
Data Mining for Intrusion Detection
and Prevention
 Majority of intrusion detection and prevention systems use

Signature-based detection: use signatures, attack patterns
that are preconfigured and predetermined by domain
experts

Anomaly-based detection: build profiles (models of normal
behavior) and detect those that are substantially deviate
from the profiles
 What data mining can help

New data mining algorithms for intrusion detection

Association, correlation, and discriminative pattern analysis
help select and build discriminative classifiers

Analysis of stream data: outlier detection, clustering, model
shifting

Distributed data mining

Visualization and querying tools
41
Data Mining and Recommender
Systems
 Recommender systems: Personalization, making product
recommendations that are likely to be of interest to a user
 Approaches: Content-based, collaborative, or their hybrid
 Content-based: Recommends items that are similar to items the

user preferred or queried in the past

 Collaborative filtering: Consider a user's social environment,

opinions of other customers who have similar tastes or

preferences
 Data mining and recommender systems
 Users C × items S: extract from known to unknown ratings to

predict user-item combinations

 Memory-based method often uses k-nearest neighbor approach

 Model-based method uses a collection of ratings to learn a model

(e.g., probabilistic models, clustering, Bayesian networks, etc.)

 Hybrid approaches integrate both to improve performance (e.g.,

using ensemble)

42
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
43
Ubiquitous and Invisible Data Mining
 Ubiquitous Data Mining
 Data mining is used everywhere, e.g., online shopping
 Ex. Customer relationship management (CRM)
 Invisible Data Mining
 Invisible: Data mining functions are built in daily life operations
 Ex. Google search: Users may be unaware that they are
examining results returned by data
 Invisible data mining is highly desirable
 Invisible mining needs to consider efficiency and scalability,
user interaction, incorporation of background knowledge and
visualization techniques, finding interesting patterns, real-time,
…
 Further work: Integration of data mining into existing business
and scientific technologies to provide domain-specific data
mining tools

44
Privacy, Security and Social Impacts of
Data Mining
 Many data mining applications do not touch personal data
 E.g., meteorology, astronomy, geography, geology, biology, and
other scientific and engineering data
 Many DM studies are on developing scalable algorithms to find
general or statistically significant patterns, not touching individuals
 The real privacy concern: unconstrained access of individual
records, especially privacy-sensitive information
 Method 1: Removing sensitive IDs associated with the data
 Method 2: Data security-enhancing methods
 Multi-level security model: permit to access to only authorized
level

Encryption: e.g., blind signatures, biometric encryption, and
anonymous databases (personal information is encrypted and
stored at different locations)
 Method 3: Privacy-preserving data mining methods
45
Privacy-Preserving Data Mining
 Privacy-preserving (privacy-enhanced or privacy-sensitive)
mining:

Obtaining valid mining results without disclosing the
underlying sensitive data values

Often needs trade-off between information loss and privacy
 Privacy-preserving data mining methods:

Randomization (e.g., perturbation): Add noise to the data in
order to mask some attribute values of records

K-anonymity and l-diversity: Alter individual records so that
they cannot be uniquely identified

k-anonymity: Any given record maps onto at least k other records

l-diversity: enforcing intra-group diversity of sensitive values

Distributed privacy preservation: Data partitioned and
distributed either horizontally, vertically, or a combination of
both

Downgrading the effectiveness of data mining: The output of
data mining may violate privacy

Modify data or mining results, e.g., hiding some association rules or
46
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
47
Trends of Data Mining
 Application exploration: Dealing with application-specific problems
 Scalable and interactive data mining methods
 Integration of data mining with Web search engines, database
systems, data warehouse systems and cloud computing systems
 Mining social and information networks
 Mining spatiotemporal, moving objects and cyber-physical systems
 Mining multimedia, text and web data
 Mining biological and biomedical data
 Data mining with software engineering and system engineering
 Visual and audio data mining
 Distributed data mining and real-time data stream mining
 Privacy protection and information security in data mining

48
Chapter 13: Data Mining Trends
and Research Frontiers

 Mining Complex Types of Data

 Other Methodologies of Data Mining
 Data Mining Applications
 Data Mining and Society
 Data Mining Trends
 Summary
49
Summary
 We present a high-level overview of mining complex data types
 Statistical data mining methods, such as regression, generalized
linear models, analysis of variance, etc., are popularly adopted
 Researchers also try to build theoretical foundations for data
mining
 Visual/audio data mining has been popular and effective
 Application-based mining integrates domain-specific knowledge
with data analysis techniques and provide mission-specific
solutions
 Ubiquitous data mining and invisible data mining are penetrating
our data lives
 Privacy and data security are importance issues in data mining,
and privacy-preserving data mining has been developed recently
 Our discussion on trends in data mining shows that data mining is
a promising, young field, with great, strategic importance 50
References and Further Reading
 The books lists a lot of references for further reading. Here we only list a few books

 E. Alpaydin. Introduction to Machine Learning, 2nd ed., MIT Press, 2011

 S. Chakrabarti. Mining the Web: Statistical Analysis of Hypertex and Semi-Structured
Data. Morgan Kaufmann, 2002
 R. O. Duda, P. E. Hart, and D. G. Stork. Pattern Classification, 2ed., Wiley-Interscience,
2000
 D. Easley and J. Kleinberg. Networks, Crowds, and Markets: Reasoning about a Highly
Connected World. Cambridge University Press, 2010.
 U. Fayyad, G. Grinstein, and A. Wierse (eds.), Information Visualization in Data Mining and
Knowledge Discovery, Morgan Kaufmann, 2001
 J. Han, M. Kamber, J. Pei. Data Mining: Concepts and Techniques. Morgan Kaufmann, 3rd
ed. 2011
 T. Hastie, R. Tibshirani, and J. Friedman. The Elements of Statistical Learning: Data
Mining, Inference, and Prediction, 2nd ed., Springer-Verlag, 2009
 D. Koller and N. Friedman. Probabilistic Graphical Models: Principles and Techniques. MIT
Press, 2009.
 B. Liu. Web Data Mining, Springer 2006.
 T. M. Mitchell. Machine Learning, McGraw Hill, 1997
 M. Newman. Networks: An Introduction. Oxford University Press, 2010.
 P.-N. Tan, M. Steinbach and V. Kumar, Introduction to Data Mining, Wiley, 2005
51
52

Design Thinking and Innovation at Apple
No ratings yet
Design Thinking and Innovation at Apple
36 pages
Information System Security Plan
No ratings yet
Information System Security Plan
8 pages
Project Proposal OF ERP
50% (2)
Project Proposal OF ERP
7 pages
Design of A Steel Structure SAFI
100% (1)
Design of A Steel Structure SAFI
55 pages
Elektor Indice 1980 2004 SP PDF
No ratings yet
Elektor Indice 1980 2004 SP PDF
204 pages
Internal
No ratings yet
Internal
267 pages
Practice Exercises 4
No ratings yet
Practice Exercises 4
2 pages
Chapter 2 - Understanding Quality Attributes
No ratings yet
Chapter 2 - Understanding Quality Attributes
39 pages
Epsilon Installation & Configuration Manual
No ratings yet
Epsilon Installation & Configuration Manual
32 pages
Palo Alto Networks Emea How To Guide
No ratings yet
Palo Alto Networks Emea How To Guide
21 pages
MODULE-V - Data Mining Trends and Research Frontiers
No ratings yet
MODULE-V - Data Mining Trends and Research Frontiers
52 pages
VLAN
No ratings yet
VLAN
31 pages
Data Mining 1
No ratings yet
Data Mining 1
56 pages
Concepts and Techniques: - Chapter 1
No ratings yet
Concepts and Techniques: - Chapter 1
37 pages
Teachers Weekly Accomplishment Report Format
No ratings yet
Teachers Weekly Accomplishment Report Format
2 pages
AWS Certified Solutions Architect Associate SAA-C03 Slides Tutorials Dojo
No ratings yet
AWS Certified Solutions Architect Associate SAA-C03 Slides Tutorials Dojo
1,031 pages
Hw2 Report
No ratings yet
Hw2 Report
5 pages
Unit 5
No ratings yet
Unit 5
26 pages
Idp Electrical
No ratings yet
Idp Electrical
6 pages
The Oracle IProcurement Functionality
No ratings yet
The Oracle IProcurement Functionality
18 pages
Unit - Introduction - : Data Mining: Concepts and Techniques
No ratings yet
Unit - Introduction - : Data Mining: Concepts and Techniques
56 pages
Chapter 1. Introduction
No ratings yet
Chapter 1. Introduction
323 pages
Amazon - Applied Scientist
No ratings yet
Amazon - Applied Scientist
2 pages
Concepts and Techniques: - Chapter 13
No ratings yet
Concepts and Techniques: - Chapter 13
52 pages
Data Mining: Concepts and Techniques: Jiawei Han and Micheline Kamber
No ratings yet
Data Mining: Concepts and Techniques: Jiawei Han and Micheline Kamber
46 pages
Supported Devices: List of Special Relay
No ratings yet
Supported Devices: List of Special Relay
9 pages
Chapter - 1
No ratings yet
Chapter - 1
22 pages
TR 2013 2
No ratings yet
TR 2013 2
40 pages
Release Notes For Uncertainty Sidekick 1.0: Installation and Setup
No ratings yet
Release Notes For Uncertainty Sidekick 1.0: Installation and Setup
1 page
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
31 pages
Week 02 PDF
No ratings yet
Week 02 PDF
39 pages
Irony and Satire English Quiz Presentation in Cream Modern Abstract Style
No ratings yet
Irony and Satire English Quiz Presentation in Cream Modern Abstract Style
20 pages
Chapter 3-IB
No ratings yet
Chapter 3-IB
69 pages
Mobile-First Responsive Web Design
No ratings yet
Mobile-First Responsive Web Design
47 pages
Debug 1214
No ratings yet
Debug 1214
4 pages
Data Mining: Concepts and Techniques: - Chapter 1
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 1
37 pages
Unit 1: Data Warehousing & Data Mining
No ratings yet
Unit 1: Data Warehousing & Data Mining
54 pages
01 Intro
No ratings yet
01 Intro
26 pages
01 Intro
No ratings yet
01 Intro
35 pages
Data Mining Concepts
No ratings yet
Data Mining Concepts
35 pages
Data Mining:: Concepts and Techniques
No ratings yet
Data Mining:: Concepts and Techniques
28 pages
Intro of Data Mining
No ratings yet
Intro of Data Mining
27 pages
DataMining Lecture 1
No ratings yet
DataMining Lecture 1
35 pages
Data Mining: Concepts and Techniques (2nd Edition)
No ratings yet
Data Mining: Concepts and Techniques (2nd Edition)
8 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
25 pages
01 Intro
No ratings yet
01 Intro
29 pages
Data Mining
No ratings yet
Data Mining
26 pages
BDA Class1
No ratings yet
BDA Class1
33 pages
Data Mining: An Overview From A Database Perspective
No ratings yet
Data Mining: An Overview From A Database Perspective
30 pages
User-Manual LAUNCH
No ratings yet
User-Manual LAUNCH
3 pages
Chapter Five Data Mining For Healthcare Analytics
No ratings yet
Chapter Five Data Mining For Healthcare Analytics
77 pages
Data Mining: Concepts and Techniques
No ratings yet
Data Mining: Concepts and Techniques
27 pages
01 Intro
No ratings yet
01 Intro
23 pages
Xi Cs Abbreviations
No ratings yet
Xi Cs Abbreviations
3 pages
DM Introduction
No ratings yet
DM Introduction
32 pages
Data Mining: Concepts and Techniques: Sujata Chakravarty Associate Professor RCMA, Bhubaneswar
No ratings yet
Data Mining: Concepts and Techniques: Sujata Chakravarty Associate Professor RCMA, Bhubaneswar
17 pages
Introduction
No ratings yet
Introduction
46 pages
Module1 IntroToDataMining
No ratings yet
Module1 IntroToDataMining
36 pages
Data Mining: Concepts and Techniques
100% (2)
Data Mining: Concepts and Techniques
27 pages
Class X CA-Record-combined-1
No ratings yet
Class X CA-Record-combined-1
34 pages
ICS 2408 Lecture 1 Introduction
No ratings yet
ICS 2408 Lecture 1 Introduction
32 pages
01 Intro
No ratings yet
01 Intro
22 pages
Unit 1
No ratings yet
Unit 1
148 pages
01 - Data Mining Introduction
No ratings yet
01 - Data Mining Introduction
21 pages
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
No ratings yet
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
10 pages
Lecture 13
No ratings yet
Lecture 13
51 pages
Resume Mayank
No ratings yet
Resume Mayank
1 page
Data Mining 1
No ratings yet
Data Mining 1
39 pages
Introduction
No ratings yet
Introduction
27 pages
DWDM LS1 Fall 24 25
No ratings yet
DWDM LS1 Fall 24 25
42 pages
A+ Exam Wrong Answers
No ratings yet
A+ Exam Wrong Answers
50 pages
WINSEM2024-25 MCSE615L TH VL2024250502897 2024-12-19 Reference-Material-I
No ratings yet
WINSEM2024-25 MCSE615L TH VL2024250502897 2024-12-19 Reference-Material-I
58 pages
Chap1 Introduction
No ratings yet
Chap1 Introduction
21 pages
Korblox - Pesquisa Google
No ratings yet
Korblox - Pesquisa Google
1 page
403 Forbidden Issue #497 R0oth3x49 - Udemy-Dl
No ratings yet
403 Forbidden Issue #497 R0oth3x49 - Udemy-Dl
3 pages
Clustering Full 1
No ratings yet
Clustering Full 1
98 pages
Intro Data Mining
No ratings yet
Intro Data Mining
51 pages
Trends in Data Mining
No ratings yet
Trends in Data Mining
9 pages
Lecture 01 11jan
No ratings yet
Lecture 01 11jan
29 pages
01 Intro
No ratings yet
01 Intro
28 pages
Praneeth K FSD AWS
No ratings yet
Praneeth K FSD AWS
4 pages
Unit-5 Case Studies - 21CSE355T
No ratings yet
Unit-5 Case Studies - 21CSE355T
21 pages
DM Overview
No ratings yet
DM Overview
52 pages
Handout 2 Data Mining
No ratings yet
Handout 2 Data Mining
16 pages
02-Introduction To Data Mining
No ratings yet
02-Introduction To Data Mining
40 pages
DM - Unit I-Updated
No ratings yet
DM - Unit I-Updated
65 pages
Lecture 1.1.1 1.1.2
No ratings yet
Lecture 1.1.1 1.1.2
32 pages
1 - DM
No ratings yet
1 - DM
5 pages
Inf 444e - Datamining N Advanced Databases Introduction 2019
No ratings yet
Inf 444e - Datamining N Advanced Databases Introduction 2019
32 pages
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
From Everand
Data Science through R. Unsupervised Learning. Dimension Reduction Techniques: Principal Components, Factor Analysis and Correspondence Analysis
César Pérez López
No ratings yet

3analysing Important Trend

Uploaded by

3analysing Important Trend

Uploaded by

Data Mining:

Jiawei Han, Micheline Kamber, and Jian Pei

 Mining Complex Types of Data

 Mining Sequence Data

 Mining Complex Types of Data

 There are many well-established statistical techniques for

and data from economics and the social sciences

 Generalized linear models

combined to generate a given

one can indirectly measure other

variable, commonly used in social

discriminant functions (linear

 Integration of visualization and data mining

 Presentation of the results or knowledge obtained

 Presentation of the various processes of data

See your solution

 Using visualization tools in the data mining process

 Uses audio signals to indicate the patterns of data

 Mining Complex Types of Data

data mining methods and effective and scalable

 Data Mining for Retail and Telecommunication

 Data Mining for Intrusion Detection and Prevention

 Data Mining and Recommender Systems

 Loan payment performance

 Consumer credit rating

 Retail industry: huge amounts of data on sales,

user preferred or queried in the past

opinions of other customers who have similar tastes or

predict user-item combinations

 Model-based method uses a collection of ratings to learn a model

(e.g., probabilistic models, clustering, Bayesian networks, etc.)

 Mining Complex Types of Data

 Mining Complex Types of Data

 Mining Complex Types of Data

 E. Alpaydin. Introduction to Machine Learning, 2nd ed., MIT Press, 2011

You might also like