0% found this document useful (0 votes)

18 views4 pages

9 Data Mining - Classification & Prediction

The document discusses two primary forms of data analysis in data mining: classification and prediction. Classification involves predicting categorical labels, while prediction focuses on forecasting continuous values. Key processes include building classifiers, using them for classification, and addressing issues like data preparation, accuracy, speed, robustness, scalability, and interpretability.

Uploaded by

besongbryan5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views4 pages

9 Data Mining - Classification & Prediction

Uploaded by

besongbryan5

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

3/10/2023 Data Mining - Classification & Prediction

Data Mining - Classification & Prediction

There are two forms of data analysis that can be used for extracting models describing important
classes or to predict future data trends. These two forms are as follows −

Classification
Prediction

Classification models predict categorical class labels; and prediction models predict continuous
valued functions. For example, we can build a classification model to categorize bank loan
applications as either safe or risky, or a prediction model to predict the expenditures in dollars of
potential customers on computer equipment given their income and occupation.

What is classification?
Following are the examples of cases where the data analysis task is Classification −

A bank loan officer wants to analyze the data in order to know which customer (loan applicant)
are risky or which are safe.
A marketing manager at a company needs to analyze a customer with a given profile, who will
buy a new computer.

In both of the above examples, a model or classifier is constructed to predict the categorical labels.
These labels are risky or safe for loan application data and yes or no for marketing data.

What is prediction?
Following are the examples of cases where the data analysis task is Prediction −

Suppose the marketing manager needs to predict how much a given customer will spend during a
sale at his company. In this example we are bothered to predict a numeric value. Therefore the data
analysis task is an example of numeric prediction. In this case, a model or a predictor will be
constructed that predicts a continuous-valued-function or ordered value.

Note − Regression analysis is a statistical methodology that is most often used for numeric
prediction.

How Does Classification Works?

https://www.tutorialspoint.com/data_mining/dm_classification_prediction.htm 1/4
3/10/2023 Data Mining - Classification & Prediction

With the help of the bank loan application that we have discussed above, let us understand the
working of classification. The Data Classification process includes two steps −

Building the Classifier or Model

Using Classifier for Classification

Building the Classifier or Model

This step is the learning step or the learning phase.
In this step the classification algorithms build the classifier.
The classifier is built from the training set made up of database tuples and their associated
class labels.
Each tuple that constitutes the training set is referred to as a category or class. These tuples
can also be referred to as sample, object or data points.

Using Classifier for Classification

In this step, the classifier is used for classification. Here the test data is used to estimate the
accuracy of classification rules. The classification rules can be applied to the new data tuples if the
accuracy is considered acceptable.

https://www.tutorialspoint.com/data_mining/dm_classification_prediction.htm 2/4
3/10/2023 Data Mining - Classification & Prediction

Classification and Prediction Issues

The major issue is preparing the data for Classification and Prediction. Preparing the data involves
the following activities −

Data Cleaning − Data cleaning involves removing the noise and treatment of missing values.
The noise is removed by applying smoothing techniques and the problem of missing values is
solved by replacing a missing value with most commonly occurring value for that attribute.

Relevance Analysis − Database may also have the irrelevant attributes. Correlation analysis
is used to know whether any two given attributes are related.

Data Transformation and reduction − The data can be transformed by any of the following
methods.

Normalization − The data is transformed using normalization. Normalization involves

scaling all values for given attribute in order to make them fall within a small specified
range. Normalization is used when in the learning step, the neural networks or the
methods involving measurements are used.

Generalization − The data can also be transformed by generalizing it to the higher

concept. For this purpose we can use the concept hierarchies.

Note − Data can also be reduced by some other methods such as wavelet transformation, binning,
histogram analysis, and clustering.

Comparison of Classification and Prediction Methods

Here is the criteria for comparing the methods of Classification and Prediction −

https://www.tutorialspoint.com/data_mining/dm_classification_prediction.htm 3/4
3/10/2023 Data Mining - Classification & Prediction

Accuracy − Accuracy of classifier refers to the ability of classifier. It predict the class label
correctly and the accuracy of the predictor refers to how well a given predictor can guess the
value of predicted attribute for a new data.

Speed − This refers to the computational cost in generating and using the classifier or
predictor.

Robustness − It refers to the ability of classifier or predictor to make correct predictions from
given noisy data.

Scalability − Scalability refers to the ability to construct the classifier or predictor efficiently;
given large amount of data.

Interpretability − It refers to what extent the classifier or predictor understands.

https://www.tutorialspoint.com/data_mining/dm_classification_prediction.htm 4/4

PROJECT PROPOSAL of Student Portal
100% (1)
PROJECT PROPOSAL of Student Portal
3 pages
FRAMME FeaDefOver
No ratings yet
FRAMME FeaDefOver
11 pages
Data Mining Classification Prediction
No ratings yet
Data Mining Classification Prediction
3 pages
Classification Unit3
No ratings yet
Classification Unit3
15 pages
Data Mining Jntuh Cse R18
No ratings yet
Data Mining Jntuh Cse R18
20 pages
DM Unit 4
No ratings yet
DM Unit 4
22 pages
Data Mining - Classification & Prediction
No ratings yet
Data Mining - Classification & Prediction
5 pages
Data Mining Module 3
No ratings yet
Data Mining Module 3
27 pages
Data Mining UNIT-2 Notes
No ratings yet
Data Mining UNIT-2 Notes
91 pages
Big Data Analytics - Unit 3
No ratings yet
Big Data Analytics - Unit 3
55 pages
U4 Clasification and Prediction
No ratings yet
U4 Clasification and Prediction
15 pages
Classification
No ratings yet
Classification
15 pages
Classification & Prediction
No ratings yet
Classification & Prediction
19 pages
Classification in Data Mining 12
No ratings yet
Classification in Data Mining 12
7 pages
Down 4
No ratings yet
Down 4
83 pages
Unit 3 DM
No ratings yet
Unit 3 DM
34 pages
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
No ratings yet
Classification and Prediction Lecture-22,23,24,25,26,27, 28: Dr. Sudhir Sharma Manipal University Jaipur
43 pages
3 DM Classification
No ratings yet
3 DM Classification
62 pages
Chp8 (Topic Not in Book) - ClassificationPrediction+Issues
No ratings yet
Chp8 (Topic Not in Book) - ClassificationPrediction+Issues
7 pages
Chapter 4 Classification
No ratings yet
Chapter 4 Classification
78 pages
Unit-5 3161610
No ratings yet
Unit-5 3161610
92 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
Classification: Unit-III
No ratings yet
Classification: Unit-III
90 pages
Classification and Predication in Data Mining
No ratings yet
Classification and Predication in Data Mining
6 pages
Unit Iii Classification
No ratings yet
Unit Iii Classification
57 pages
18mca52c U3
No ratings yet
18mca52c U3
8 pages
Lecture 16
No ratings yet
Lecture 16
14 pages
4 - Data Analytics Using DM and ML Algorithms - 1
No ratings yet
4 - Data Analytics Using DM and ML Algorithms - 1
71 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
Data Mining and Warehousing Mod3
No ratings yet
Data Mining and Warehousing Mod3
69 pages
DM Unit-3
No ratings yet
DM Unit-3
46 pages
Unit 3
No ratings yet
Unit 3
53 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
129 pages
Classification Basic Concept - Data Mining
No ratings yet
Classification Basic Concept - Data Mining
20 pages
Data Mining-Unit-3
No ratings yet
Data Mining-Unit-3
16 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
129 pages
Chapter 5. Classification and Prediction
No ratings yet
Chapter 5. Classification and Prediction
122 pages
Module 3 - Classification
No ratings yet
Module 3 - Classification
9 pages
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
115 pages
Data Mining 5 Semester Bca
No ratings yet
Data Mining 5 Semester Bca
44 pages
Lecture 3.1.1
No ratings yet
Lecture 3.1.1
17 pages
Classification - Prediction Data Model Very Important
No ratings yet
Classification - Prediction Data Model Very Important
173 pages
Classification and Prediction
No ratings yet
Classification and Prediction
126 pages
10 Classification2022
No ratings yet
10 Classification2022
20 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
Classification Analysis
No ratings yet
Classification Analysis
4 pages
Data Mining and Predictive Modelling
No ratings yet
Data Mining and Predictive Modelling
14 pages
Chapter3 Classification and Prediction
No ratings yet
Chapter3 Classification and Prediction
63 pages
Classification and Prediction
No ratings yet
Classification and Prediction
130 pages
Unit 3 (DWDM)
No ratings yet
Unit 3 (DWDM)
23 pages
Classify Vs Pedict
No ratings yet
Classify Vs Pedict
6 pages
Classification, Prediction
100% (1)
Classification, Prediction
67 pages
Module 04
No ratings yet
Module 04
75 pages
DWM Unit 3 Final Notes
No ratings yet
DWM Unit 3 Final Notes
47 pages
Chapter 4 - Part 1
No ratings yet
Chapter 4 - Part 1
28 pages
New Classification11
No ratings yet
New Classification11
98 pages
Data Mining Unit 3
No ratings yet
Data Mining Unit 3
50 pages
V1-CH-6-Classification and Prediction
No ratings yet
V1-CH-6-Classification and Prediction
38 pages
26076classification - Data Mining
No ratings yet
26076classification - Data Mining
4 pages
Overview Basics
No ratings yet
Overview Basics
16 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Ada 353576
No ratings yet
Ada 353576
67 pages
Answers
No ratings yet
Answers
3 pages
Designing Smart and Reliable Networks
No ratings yet
Designing Smart and Reliable Networks
105 pages
Ca Programing
No ratings yet
Ca Programing
11 pages
Include Using: STD A Flag Name Flag B A Name B Flag Name A B
No ratings yet
Include Using: STD A Flag Name Flag B A Name B Flag Name A B
14 pages
Configuring and Installing Pfsense
No ratings yet
Configuring and Installing Pfsense
8 pages
Kirtan Resume
No ratings yet
Kirtan Resume
1 page
Bibliography
No ratings yet
Bibliography
31 pages
Types of Prompting
No ratings yet
Types of Prompting
4 pages
Principles of It ITBP 103: Unit 2 Information
No ratings yet
Principles of It ITBP 103: Unit 2 Information
45 pages
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes Part A: Content Design
No ratings yet
Birla Institute of Technology & Science, Pilani: Work Integrated Learning Programmes Part A: Content Design
6 pages
Harness The Power of Data: Now Is The Time To Become An Analytics-Driven Organization. Discover How
No ratings yet
Harness The Power of Data: Now Is The Time To Become An Analytics-Driven Organization. Discover How
20 pages
Assignment LIS 108
No ratings yet
Assignment LIS 108
3 pages
Database Security Course Handout 2023
No ratings yet
Database Security Course Handout 2023
2 pages
Oracle Database 11g Editions
No ratings yet
Oracle Database 11g Editions
5 pages
Knowledge Discovery in Databases
No ratings yet
Knowledge Discovery in Databases
29 pages
Arshi Resume 1 1744870008015
No ratings yet
Arshi Resume 1 1744870008015
1 page
Tabnine Your AI Coding Assistant
No ratings yet
Tabnine Your AI Coding Assistant
9 pages
(COURSE SUPPORT) Getting Started - Apache Iceberg
No ratings yet
(COURSE SUPPORT) Getting Started - Apache Iceberg
34 pages
Create Table Customers
No ratings yet
Create Table Customers
5 pages
مدينة الحب لا يسكنها العقلاء
No ratings yet
مدينة الحب لا يسكنها العقلاء
141 pages
IDB Assignment Question
No ratings yet
IDB Assignment Question
4 pages
DiskSorter File Classification
No ratings yet
DiskSorter File Classification
60 pages
EPI Enterprise Resource Planning: Computerized Maintenance Management System
No ratings yet
EPI Enterprise Resource Planning: Computerized Maintenance Management System
8 pages
BP CS XII Set1
No ratings yet
BP CS XII Set1
1 page
Laboratory Exercise Hci Guidelines I
No ratings yet
Laboratory Exercise Hci Guidelines I
3 pages
Presentation IT Infrastructure
No ratings yet
Presentation IT Infrastructure
18 pages
IT Grade 9
No ratings yet
IT Grade 9
4 pages
Recommendation System
No ratings yet
Recommendation System
7 pages
Proven Tips and Techniques To Optimize Query Performance in Sap Netweaver BW 7.3
No ratings yet
Proven Tips and Techniques To Optimize Query Performance in Sap Netweaver BW 7.3
34 pages
2-Advanced UI Patterns PDF
No ratings yet
2-Advanced UI Patterns PDF
20 pages
Coolies Transferring Bales of Jute From Boats at Saraghat (-)
No ratings yet
Coolies Transferring Bales of Jute From Boats at Saraghat (-)
1 page
Apache Cassandra: Het Patel Kajal Patel
No ratings yet
Apache Cassandra: Het Patel Kajal Patel
8 pages
Arrow Diagram Example - Benchmarking Project: 0 5 3 8 Latest Start
No ratings yet
Arrow Diagram Example - Benchmarking Project: 0 5 3 8 Latest Start
4 pages
Vikan ArtMedicineMagic 1984
No ratings yet
Vikan ArtMedicineMagic 1984
29 pages

9 Data Mining - Classification & Prediction

Uploaded by

9 Data Mining - Classification & Prediction

Uploaded by

3/10/2023 Data Mining - Classification & Prediction

Data Mining - Classification & Prediction

How Does Classification Works?

Building the Classifier or Model

Building the Classifier or Model

Using Classifier for Classification

Classification and Prediction Issues

Normalization − The data is transformed using normalization. Normalization involves

Generalization − The data can also be transformed by generalizing it to the higher

Comparison of Classification and Prediction Methods

Interpretability − It refers to what extent the classifier or predictor understands.

You might also like