kerDataMining - 2010 8 19PAstPResentFuture PDF

Data mining has evolved over the past few decades from machine learning and statistics. It involves techniques like pattern extraction, data clustering, and classification that are used to discover hidden patterns and insights from large datasets. Common techniques include association rule mining to find frequent patterns in transactional data and K-means clustering to group similar data points. While data mining of structured data is now well-established, current research also focuses on mining non-traditional data types like text, images, and networks.

Uploaded by

Jaydwin Labiano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views5 pages

kerDataMining - 2010 8 19PAstPResentFuture PDF

Uploaded by

Jaydwin Labiano

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

The Knowledge Engineering Review, Vol. 00:0, 1–24.

c 2004, Cambridge University Press

DOI: 10.1017/S000000000000000 Printed in the United Kingdom

Data Mining: Past, Present and Future

FRANS COENEN
Department of Computer Science, The University of Liverpool, Liverpool, L693BX, UK,
E-mail: [email protected]

Abstract
Data mining has become a well established discipline within the domain of Artificial Intelligence
(AI) and Knowledge Engineering (KE). It has its roots in machine learning and statistics, but
encompasses other areas of computer science. It has received much interest over the last decade
as advances in computer hardware have provided the processing power to enable large scale data
mining to be conducted. Unlike other innovations in AI and KE, data mining can be argued to
be an application rather then a technology and thus can be expected to remain topical for the
foreseeable future. This paper presents a brief review of the history of data mining, up to the
present day, and some insights into future directions.

1 Introduction
Data Mining has become an established discipline within the scope of computer science. The
origins of data mining can be traced back to the late 80s when the term began to be used, at
least within the research community. In the early days there was little agreement on what the
term data mining encompassed, and it can be argued that in some sense this is still the case.
Broadly data mining can be defined as as set of mechanisms and techniques, realised in software,
to extract hidden information from data. The word hidden in this definition is important; SQL
style querying, however sophisticated, is not data mining. Also the term information should be
interpreted in its widest sense. By the early 1990s data mining was commonly recognised as a
sub-process within a larger process called Knowledge Discovery in Databases or KDD (although
in the modern context of data mining Knowledge Discovery in Data would be more apt, as we
are no longer preoccupied solely by databases). The most commonly used definition of KDD is
that attributed to Fayyad et al. : “The nontrivial process of identifying valid, novel, potentially
useful, and ultimately understandable patterns in data” (Fayyad et al. 1996). As such data
mining should be viewed as the sub-process, within the overall KDD process, concerned with the
discovery of “hidden information”. Other sub-processes that form part of the KDD process are
data preparation (warehousing, data cleaning, pre-processing, etc) and the analysis/visualisation
of results. For may practical purposes KDD and data mining are seen as synonymous, but
technically one is a sub-process of the other.
The data that data mining techniques were originally directed at was tabular data and, given
the processing power available at the time, computational efficiency (and particular the number
databases accesses) was of significant concern. As the amount of processing power generally
available increased, processing time (although still an issue) became less of a concern and was
replaced with a desire for accuracy and a desire to mine ever larger data collections. Today, in
the context of tabular data, we have a well established range of data mining techniques available.
It is well within the capabilities of many commercial enterprises and researchers to mine tabular
data, using software such as SPSS clementine or Weka, on standard desktop machines. However,
the amount of electronic data collected by all kinds of institutions and commercial enterprises,
2 f. coenen

year on year, continues to grow and thus there is still a need for effective mechanisms to mine
ever larger data sets. A second current focus of the data mining community is the application
of data mining to non-standard data sets (i.e. non-tabular data sets). Examples include: image
sets, document collections, video, multimedia data of all kinds, and graph and network data.
The popularity of data mining increased significantly in the 1990s, notably with the estab-
lishment of a number of dedicated conferences; the ACM SIGKDD annual conference in 1995,
and the European PKDD and the Pacific/Asia PAKDD conferences in 1997 (The IEEE ICDM
conference was not introduced till 2001 as was the first SIAM conference on data mining). This
increase in popularity can be attributed to advances in technology; the computer processing
power and data storage capabilities available meant that the processing of large volumes of data
using desk top machines was a realistic possibility. It became common place for commercial
enterprises to maintain data in computer readable form, in most cases this was primarily to
support commercial activities, the idea that this data could be mined often came second. The
1990s also saw the introduction of customer loyalty cards (particularly with respect to large
super market chains) that allowed enterprises to record customer purchases, the resulting data
could then be mined to identify customer purchasing patterns. The popularity of data mining has
continued to grow over the last decade with a particular current emphasis on mining non-standard
data (i.e. non-tabular data).

2 Data Mining Mechanism and Techniques

The mechanisms and techniques within the remit of of data mining can be described as an
amalgamation of approaches to machine learning and statistics; from this perspective data mining
can be said to have “grown” out of the disciplines of machine learning and statistics. Indeed the
data mining community is dominated by a mix of computer scientists and statisticians. The
European Conference on Machine Learning (ECML) and the European Conference on Principles
and practice of Knowledge Discovery in Databases (PKDD) came together on 2001 and have
stayed together ever since. There is, however, a distinction between data mining and machine
learning. Data mining is focused on data (in all its formats) and as such can be viewed as
an application domain; while machine learning, at least in its traditional form, is focussed on
mechanisms whereby computers can learn (for example, one focus of early work on machine
learning was computer programmes that could learn to play chess). Machine learning can thus
be viewed as a technology, whereas data mining, and by extension KDD, as an application.
Traditionally data mining techniques can very broadly be categorised as being directed as
either: (i) pattern extraction/identification, (ii) data clustering or (iii) classification/ categorisa-
tion. Each is briefly considered in more detail in the following subsections. Within the current data
mining literature we can also find reference to many other techniques that have been adopted from
fields such as statistics and mathematics, for example linear regression and Principal Component
Analysis (PCA).

2.1 Pattern extraction

Throughout its history data mining has had a substantial focus on finding patterns in data.
These patterns can take many forms, we have already mentioned customer purchasing patterns;
alternative patterns my be trends in temporal or longitudinal data, frequently occurring
subgraphs in graph data and so on. A patten is any frequently occurring combination of entities,
events, objects, etc. The exemplar pattern mining technique is Association Rule Mining (ARM)
as first proposed by Agrawal et al. in the context of super market basket analysis (Agrawal et
al., 1993). The aim here was to identify frequent occurring patterns in the data and then, from
these patterns, extract Association Rules (ARs). An AR is a probabilistic rule that states that
if some set of data attributes occur together then some other (disjoint) set of attributes is also
likely to occur. The fundamental challenge of ARM is that given a data set with N attributes
(field-value pairs), there are 2N − 1 candidate patterns. ARM has attracted much attention from
Data Mining: Past, Present and Future 3

the data mining community over the years. Many extensions have been proposed such as weighted
and utility ARM, spatio-temporal ARM, incremental ARM, fuzzy ARM etc. Frequent pattern
mining remains a common area of investigation within the domain of data mining. Resent work on
frequent pattern mining has been directed at recommender systems (people who bought x also
bought y). The most popular current frequent pattern mining algorithm is arguably Frequent
Pattern (FP) growth (Han et al., 2000).

2.2 Clustering

Clustering is concerned with the grouping of data into categories. This is particularly desirable in
the context of customer data where it is useful to group similar customers together for the purpose
of (say) targeted advertising. For many concerns clustering is an exploratory activity. Typically
we wish to cluster data into either a specified number of clusters, as in the case of the well known
K-means algorithm (MacQueen, 1967); or according to some proximity threshold, as in the case of
the well established KNN algorithm (Hastie and Tibshirani, 1996). An alternative approach is to
adopt some form of hierarchical clustering where the data is iteratively partitioned to form a set of
clusters. The most frequently sighted hierarchical clustering algorithm is arguably BIRCH (Zhang
et al., 1996).The “goodness” of a cluster configuration is usually measured in terms of intra-cluster
cohesion and inter-cluster separation. The issues with established clustering algorithms, such as
K-means and KNN, are that the generated clusters are represented as hyper-spheres when this
may not be the ideal shape. Further issues are: the frequently encountered high dimensionality
of the input data, and the treatment of noise (outliers) and categorical data. Clustering is a well
established data mining (and before that machine learning) technique. Interestingly there is no
“best” clustering algorithm applicable to all data; instead, for reasons that are not entirely clear,
some algorithms work better on some data sets than others.

2.3 Classification

Classification is concerned with the construction of “classifiers” that can be applied to “unseen”
data so as to categorise that data into groups (classes). As such classification has parallels with
clustering. The distinction, however, is that classification requires pre-labelled training data from
which the classifiers can be built. As such classification is sometimes referred to as supervised
learning while clustering is considered to represent unsupervised learning. The desired classifiers,
can take many forms: decision trees, Support Vector Machines (SVMs) as first proposed by Vapnik
(1995), rules, etc. Decision trees are the simplest. The most influential decision tree generation
algorithm with respect to data mining is Quinlan’s C4.5 algorithm (Quinlan, 1993). The advantage
of rule based classifiers is that they offer a ready explanation to end users. In the context of rule
base classifiers classification rules can be considered to be a special form of AR and as such
ARM techniques (see above) can be used to generate such rules. The most frequently referenced
classification ARM algorithm is arguably the CBA algorithm (Liu et al., 1998). Other notable
classification techniques include regression, for example the CART algorithm (Breiman et al.
1984), and Naive Bayes (Hand and Yu 2001). Classifiers can be either: (i) binary classifiers (select
between two alternatives), (ii) multi-class classifiers (select between more than two alternatives);
or (iii) multi-labelled (assign unseen data to one or more classes). Binary classifiers are the
simplest to generate. The quality of a generated classifier is usually measured in terms of accuracy,
sensitivity and specificity. To an extent similarities can be drawn between classification and
Case Based Reasoning, both operate using previous cases/knowledge. Classification continuous
to receive attention from the data mining community. One extension is the concept of ordinal
classifiers where the possible classes are ordered in some way. There is also significant interest in
dynamic classification, for example classifying video sequences.
4 f. coenen

3 Applications
From the foregoing the original focus of data mining was tabular data; an extremely effective set
of techniques has been established directed at the mining of tabular data, however data miners
wish to mine everything! This section briefly reviews some current applications of the technology
beyond simple tabular mining. There are, of course, many more.

3.1 Text Mining

A Natural next step from traditional tabular data mining was text mining. A typical application
is to build classifiers to categorise or cluster large document collections (news articles are a
popular example, another is web pages). Another application is opinion or questionnaire ming
where the objective is to obtain useful informations, i.e. “opinions”, from the free text element of
questionnaire style data. A further application is text summarisation, an application that starts
to “blur” into the domain of information retrieval. In the context of text classification SVMs
operate well (but offer no explanation of resulting classifications). Generally speaking the issue
with text mining is how best to represent textual data so as to allow the application of data mining
techniques. The most common representation is the bag-of-words representation where documents
are represented in terms of a collection of key words. The question then is what keywords to
include? These can be defined by experts, or extracted using other data mining techniques or
Natural language Processing (NLP) techniques. An alternative to the bag-of-words representation
is the bag-of-phrases representation. However, in both cases, the ordering of words/phrases is lost.
Alternative techniques attempt to maintain this knowledge, however this entails a significant
increase in computational complexity. Text mining, in all its forms, continues to be a popular
data mining activity.

3.2 ImageMining
There are many large collections of digital images that have been generated with respect to many
applications. As in the case of text mining, image mining is concerned with the representation of
images (both 2D and 3D) so that mining techniques may be applied. For this purpose images can
be represented in many different ways, popular techniques include the generation of histograms
or trees/graphs (one per image). Alternatively, we can attempt to represent images in terms of
sets of objects identified using segmentation and registration techniques. Image segmentation
techniques have limited success, depending on the nature of the images, and are the subject of
continuing research within the image analysis community. Image analysis remains a challenging
research topic (we are still unable to get a machine to distinguish between a cat and a dog with
any degree reliability). In certain fields, such as medical image mining, where the problem can be
scoped in a specific way, image mining has had some successes. Examples include the classification
of retina image data and Magnetic Resonance Imaging (MRI) scan data to identify disorders.
Another popular area of application is satellite image mining. Current research in image mining
continues to be focused on how best to represent images so that data mining techniques can be
applied. In this respect it is worth observing that for the application of data mining techniques
we do not need to have a representation that is interpretable by humans, as long as the data
mining works (for example we do not necessarily need precise segmentation techniques).

3.3 GraphMining
Graph (and tree) mining is essentially an extension of frequent pattern mining (see above), what
we are interested in is frequently occurring sub-graphs. Graph mining practitioners argue that
everything can be represented as a graph. Indeed it is straight forward to see how entities such
as documents, emails and images can be represented in this form. A common application area
is chemical compound analysis. At a high level we can identify two forms of the problem: (i)
Data Mining: Past, Present and Future 5

frequent sub-graphs that occur across a collection of graphs, and (ii) frequent sub-graphs that
occur in one very large graph. We can also distinguish between graph mining and tree mining; tree
mining is more tractable as advantage can be taken of the inherent features of a tree (no cycles,
etc). Graph (and tree) mining require some canonical form with which to represent the graphs;
much early work was focussed on this. The main current issues with graph mining are candidate
subgraph generation and sub-graph isomorphism testing. The most influential frequent sub-graph
mining algorithm is arguably gSpan (Yan and Han, 2002). A popular extension of graph mining
is social network mining. The motivation here is the popularity of social networking sites such as
Facebook, and the consequent desire to identify groupings (communities) within these networks.
However, there are many other forms of social networks, such as transport and co-authoring
(bibliographic) networks, to which social network mining techniques can be applied.

4 Conclusions
Data mining has come to prominence over the last two decades as a discipline in its own right
which offers benefits with respect to many domains, both commercial and academic. Broadly
data mining can be viewed as a application domain, as opposed to a technology. The increasing
ability of institutions to collect electronic data, facilitated by advanced in computer processing,
means that the desire to “mine” data is likely to expend. The data mining community has a well
established set of techniques available which we are seeking to apply to an ever greater variety of
data. Generally speaking the actual data mining processes, in many cases, are readily available.
Current issues are more concerned with the processing of data so that data mining techniques
can be applied, and the post-processing (e.g. visualisation, explanation generation, etc.) of the
end result. Thus, although we are very good at the actual data mining, the “end-to-end” process
of data mining still requires significant research input. Another driver for research in data mining
is the ever increasing size of the data we wish to work with. We are therefore also interested in
techniques to mine ever larger data sets (and an ever greater variety of data).

References
Agrawal, R, Imielinski, T. and Swami, A. (1993). Mining association rules between sets of items in large
databases. Proc. ACM SIGMOD International Conference on Management of Data (SIGMOD’93),
ACM Press, pp207-216.
Breiman, L., Friedman, Y., Olshen, R. and Stone, C. (1984). Classification and Regression Trees.
Wadsworth, Belmont, CA, 1984.
Fayyad, U., Piatetsky-Shapiro, H. and Smyth, P. (1996). The KDD process for extracting useful
knowledge from volumes of data. Communications of the ACM, 39 (11), pp27 - 34.
Han, J., Pei, J., and Yin, Y. (2000). Mining frequent patterns without candidate generation. Proc. ACM
SIGMOD Conference on Management of Data (SIGMOD ’00). ACM Press, pp1-12.
Hand, D.J. and Yu, K. (2001). Idiot’s Bayes: Not So Stupid After All? Internat. Statist. Rev. 69, pp385-
398.
Hastie, T. and Tibshirani, R. (1996). Discriminant Adaptive Nearest Neighbor Classification. IEEE
Transaction on Pattern Analysis and Machibe Intelligence, 18(6), pp607-616.
Liu, B., Hsu, W. and Ma, Y. M. (1998). Integrating classification and association rule mining. Proc
KDD-98, ACM press, pp.80-86.
MacQueen, J. B. (1967). Some methods for classification and analysis of multivariate observations. Proc.
5th Berkeley Symp. Mathematical Statistics and Probability, University of California Press, pp281-
297.
Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc.
Vapnik, V. N. (1995). The Nature of Statistical Learning Theory. Springer-Verlag.
Yan, X. and Han, J. (2002). gSpan: Graph-Based Substructure Pattern Mining. Proc. IEEE International
Conference on Data Mining (ICDM ’02), IEEE, pp721-724.
Zhang, T., Ramakrishnan, R., and Livny, M. (1996). BIRCH: an efficient data clustering method for very
large databases. Proc. ACM SIGMOD international Conference on Management of Data, ACM Press,
pp103-114.

Research On Uber Eats Design (Autosaved)
No ratings yet
Research On Uber Eats Design (Autosaved)
22 pages
Fbi Crime Analysis and Prediction Using Machine Learning
No ratings yet
Fbi Crime Analysis and Prediction Using Machine Learning
8 pages
Introduction To Software Security Concepts
No ratings yet
Introduction To Software Security Concepts
10 pages
User Guide Xpr7550e Xpr7580e Color Display
No ratings yet
User Guide Xpr7550e Xpr7580e Color Display
1,149 pages
Hubbert Smith - Data Center Storage - Cost-Effective Strategies, Implementation, and Management (2011, Auerbach Publications)
No ratings yet
Hubbert Smith - Data Center Storage - Cost-Effective Strategies, Implementation, and Management (2011, Auerbach Publications)
363 pages
Industrial Automation - Technical Interview Questions
81% (21)
Industrial Automation - Technical Interview Questions
27 pages
Project of Networking
100% (1)
Project of Networking
5 pages
Mining
No ratings yet
Mining
7 pages
Knowledge Discovery in Databases (KDD) : An Overview
No ratings yet
Knowledge Discovery in Databases (KDD) : An Overview
4 pages
Data Mining and Its Techniques: A Review Paper: Maria Shoukat (MS Student)
No ratings yet
Data Mining and Its Techniques: A Review Paper: Maria Shoukat (MS Student)
7 pages
DMWH M1
No ratings yet
DMWH M1
25 pages
23 Vol 2 No 4
No ratings yet
23 Vol 2 No 4
5 pages
Datamining & Cluster Coputing
No ratings yet
Datamining & Cluster Coputing
16 pages
1st Slides
No ratings yet
1st Slides
60 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
8 pages
DM Module 1
No ratings yet
DM Module 1
11 pages
UNIT 2
No ratings yet
UNIT 2
60 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
25 pages
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
No ratings yet
1.1 Introduction To Data Mining: 1.1.1 Moving Toward The Information Age
14 pages
Background: Research and Evolution
No ratings yet
Background: Research and Evolution
6 pages
A Brief Study and Analysis of Data Mining Techniques
No ratings yet
A Brief Study and Analysis of Data Mining Techniques
4 pages
Screenshot 2023-10-19 at 11.36.57
No ratings yet
Screenshot 2023-10-19 at 11.36.57
27 pages
Unit 1 Datamining For Business Intelligence
No ratings yet
Unit 1 Datamining For Business Intelligence
101 pages
The KDD Process For From Volumes Of: Extracting Useful Knowledge Data
No ratings yet
The KDD Process For From Volumes Of: Extracting Useful Knowledge Data
12 pages
UNIT 2
No ratings yet
UNIT 2
58 pages
Data Mining Research
No ratings yet
Data Mining Research
4 pages
Unit II Data Mining
No ratings yet
Unit II Data Mining
8 pages
Data Mining Prologues: K.Sankar Lecturer / M.E., (P.HD) ., D.V.Rajkumar M.C.A., M.Phil Lecturer
No ratings yet
Data Mining Prologues: K.Sankar Lecturer / M.E., (P.HD) ., D.V.Rajkumar M.C.A., M.Phil Lecturer
4 pages
Data Mining A Conceptual Overview
No ratings yet
Data Mining A Conceptual Overview
32 pages
Data Mining Using Neural Networks: Miss. Mukta Arankalle
No ratings yet
Data Mining Using Neural Networks: Miss. Mukta Arankalle
36 pages
A Review of Data Mining Literature
No ratings yet
A Review of Data Mining Literature
6 pages
04cali 67
No ratings yet
04cali 67
8 pages
DM-unit 1
No ratings yet
DM-unit 1
22 pages
5104 - 07.S. L. Nalawade1
No ratings yet
5104 - 07.S. L. Nalawade1
5 pages
Sheenaz Project
No ratings yet
Sheenaz Project
22 pages
Introduction To Data Mining Techniques: Dr. Rajni Jain
No ratings yet
Introduction To Data Mining Techniques: Dr. Rajni Jain
11 pages
Web Intelligence: What Is Webintelligence?
No ratings yet
Web Intelligence: What Is Webintelligence?
25 pages
Unit I DM
No ratings yet
Unit I DM
27 pages
The Survey of Data Mining Applications and Feature Scope
No ratings yet
The Survey of Data Mining Applications and Feature Scope
16 pages
Application Based, Advantageous K-Means Clustering Algorithm in Data Mining - A Review
No ratings yet
Application Based, Advantageous K-Means Clustering Algorithm in Data Mining - A Review
6 pages
M.sc. SE IVth Yr ISEE84 Data Warehouse and Mining
No ratings yet
M.sc. SE IVth Yr ISEE84 Data Warehouse and Mining
110 pages
Yihao Final Paper CCSC For Submission
No ratings yet
Yihao Final Paper CCSC For Submission
6 pages
: - -: What The Data Mining?: عوضوملا
No ratings yet
: - -: What The Data Mining?: عوضوملا
6 pages
DATA MINING-Knowledge Discovery in Databases
No ratings yet
DATA MINING-Knowledge Discovery in Databases
6 pages
Data Mining: Encyclopedic Style Neutral
No ratings yet
Data Mining: Encyclopedic Style Neutral
12 pages
V3N2 121 PDF
No ratings yet
V3N2 121 PDF
4 pages
DWM 4
No ratings yet
DWM 4
23 pages
Absract:: Data, Information, and Knowledge
No ratings yet
Absract:: Data, Information, and Knowledge
7 pages
Unit III
No ratings yet
Unit III
101 pages
First Page PDF
No ratings yet
First Page PDF
1 page
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
No ratings yet
Kinds of Data: 1. Data Bases Data 2.data Warehouses Data 3. Transactional Data
24 pages
Data Mining and Its Applications
No ratings yet
Data Mining and Its Applications
60 pages
Module1 DataMining Ktustudents - in
No ratings yet
Module1 DataMining Ktustudents - in
24 pages
Unit 1
No ratings yet
Unit 1
21 pages
Unit 2
No ratings yet
Unit 2
20 pages
Data Structures: Notes For Lecture 12 Introduction To Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 12 Introduction To Data Mining by Samaher Hussein Ali
4 pages
1.1 Data and Information Mining
No ratings yet
1.1 Data and Information Mining
24 pages
Data Mining and Warehousing-1
No ratings yet
Data Mining and Warehousing-1
43 pages
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
No ratings yet
Data Warehousing & Data Mining Syllabus Subject Code:56055 L:4 T/P/D:0 Credits:4 Int. Marks:25 Ext. Marks:75 Total Marks:100
52 pages
DM - UNIT I
No ratings yet
DM - UNIT I
58 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
24 pages
Data Mining Information
100% (1)
Data Mining Information
15 pages
Notes For DMDWH - Module1
No ratings yet
Notes For DMDWH - Module1
21 pages
Unit 1
No ratings yet
Unit 1
11 pages
Bi - Unit 3
No ratings yet
Bi - Unit 3
18 pages
SelectionStatement PDF
No ratings yet
SelectionStatement PDF
17 pages
Crime Analytics: Exploring Analysis of Crimes Through R Programming Language
No ratings yet
Crime Analytics: Exploring Analysis of Crimes Through R Programming Language
5 pages
Evolving Data Mining Algorithms On The Prevailing Crime Trend - An Intelligent Crime Prediction Model
No ratings yet
Evolving Data Mining Algorithms On The Prevailing Crime Trend - An Intelligent Crime Prediction Model
6 pages
Innovation in and The Diffusion Of: Technology
No ratings yet
Innovation in and The Diffusion Of: Technology
7 pages
The Ethical Dangers and Merits of Predictive Policing: Community Safety & Well-Being
No ratings yet
The Ethical Dangers and Merits of Predictive Policing: Community Safety & Well-Being
5 pages
Research Methodology: "Approaches To Research Methods"
No ratings yet
Research Methodology: "Approaches To Research Methods"
14 pages
Journal of Food Engineering: Avi Herbon, Avisahi Ceder (Avi)
No ratings yet
Journal of Food Engineering: Avi Herbon, Avisahi Ceder (Avi)
12 pages
JaydwinLabiano - WhatData MIne
No ratings yet
JaydwinLabiano - WhatData MIne
9 pages
Implementing Classification Techniques of Data Mining in Creating Model For Predicting Academic Marketing
No ratings yet
Implementing Classification Techniques of Data Mining in Creating Model For Predicting Academic Marketing
7 pages
SIPROTEC 7SA86 Profile
No ratings yet
SIPROTEC 7SA86 Profile
2 pages
Project Management
No ratings yet
Project Management
12 pages
Suman Resume
No ratings yet
Suman Resume
2 pages
Assignment
No ratings yet
Assignment
8 pages
PowerLogic PM8000 Series - METSEPM8210
No ratings yet
PowerLogic PM8000 Series - METSEPM8210
5 pages
Catalogo Productos Uniview 2021
No ratings yet
Catalogo Productos Uniview 2021
43 pages
Brochure ViewPoint6 EUEN Digital JB05708XE
No ratings yet
Brochure ViewPoint6 EUEN Digital JB05708XE
8 pages
TOS Q1 TLEICTCSS 9 - Jonalyn Ambrona
No ratings yet
TOS Q1 TLEICTCSS 9 - Jonalyn Ambrona
2 pages
Help For The W3C Markup Validation Service
No ratings yet
Help For The W3C Markup Validation Service
10 pages
Ndace Project 123
100% (1)
Ndace Project 123
42 pages
HPVM Cheat Sheet
No ratings yet
HPVM Cheat Sheet
4 pages
MQTC v2016 IIB Performance Final PDF
No ratings yet
MQTC v2016 IIB Performance Final PDF
143 pages
Virtual Campus Recruitment Guide
No ratings yet
Virtual Campus Recruitment Guide
19 pages
Samuel P. Tregelles - The Hope of Christ's Second Coming (1864)
No ratings yet
Samuel P. Tregelles - The Hope of Christ's Second Coming (1864)
97 pages
ProductPrice Oracle資料庫百寶箱
No ratings yet
ProductPrice Oracle資料庫百寶箱
8 pages
BPP - EHS - Incident Management Rev 040607 Final
100% (1)
BPP - EHS - Incident Management Rev 040607 Final
21 pages
C3425 - Heating Roller Temperature Sensor (TEMS2) - IH Power Supply (IHPU) - Noise Filter (NF1) - Printer Control Board (PRCB)
No ratings yet
C3425 - Heating Roller Temperature Sensor (TEMS2) - IH Power Supply (IHPU) - Noise Filter (NF1) - Printer Control Board (PRCB)
5 pages
Keys Soal Latihan TKM Diambil Dari Aspd 2021
No ratings yet
Keys Soal Latihan TKM Diambil Dari Aspd 2021
6 pages
An_Empirical_Study_of_DevSecOps_Focused_on_Continuous_Security_Testing
No ratings yet
An_Empirical_Study_of_DevSecOps_Focused_on_Continuous_Security_Testing
8 pages
Interview Questions For QA Tester
No ratings yet
Interview Questions For QA Tester
27 pages
Description Features: PT6311 VFD Driver/Controller IC
No ratings yet
Description Features: PT6311 VFD Driver/Controller IC
22 pages
Incidente
No ratings yet
Incidente
25 pages
How To Create Your NFT Marketplace With An OpenSea Clone Script
No ratings yet
How To Create Your NFT Marketplace With An OpenSea Clone Script
6 pages
3G UMTS Architecture
No ratings yet
3G UMTS Architecture
48 pages

kerDataMining - 2010 8 19PAstPResentFuture PDF

Uploaded by

kerDataMining - 2010 8 19PAstPResentFuture PDF

Uploaded by

The Knowledge Engineering Review, Vol. 00:0, 1–24.

c 2004, Cambridge University Press

Data Mining: Past, Present and Future

2 Data Mining Mechanism and Techniques

2.1 Pattern extraction

3.1 Text Mining

You might also like