Classification Algorithm

Uploaded by

r9492046

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views51 pages

Classification Algorithm

Uploaded by

r9492046

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

DATA MINING

Data Mining

Data mining is most commonly defined as

the process of using computers and
automation to search large sets of data for
patterns and trends, turning those findings
into business insights and predictions.
Data Mining

Data mining goes beyond the search

process, as it uses data to evaluate
future probabilities and develop
actionable analyses.
What Are the Benefits
of Data Mining?

Since we live and work in a data-centric world, it’s

essential to get as many advantages as possible.
Data mining provides us with the means of
resolving problems and issues in this challenging
information age.
Data mining benefits include:
• It helps companies gather reliable
information.
• It helps businesses make profitable
production and operational adjustments
• It helps businesses make informed
decisions
Data mining benefits include:
• It helps data scientists quickly initiate automated
predictions of behaviors and trends and discover
hidden patterns.
• It helps detect credit risks and fraud
• It helps data scientists easily analyze enormous
amounts of data quickly.
• Data scientists can use the information to detect
fraud, build risk models, and improve product
safety
Questions that can be answered
through Data Mining
• What kind of customers should
a business target in its next ad
campaign?
• What patterns in behavior are
connected to financial fraud?
Questions that can be answered
through Data Mining
• What are the buying patterns of
customers based on their demographics?
• What are the factors influencing the
success of marketing campaigns?
DATA
ANALYST
Data Analyst
• A data analyst collects, cleans, and
interprets data sets in order to answer a
question or solve a problem. They work
in many industries, including business,
finance, criminal justice, science,
medicine, and government.
Data analysis can take different forms,
depending on the question you’re trying to
answer.
TYPES OF DATA ANALYSIS
-Descriptive analysis tells us what happened
-Diagnostic analysis tells us why it happened
-Predictive analytics forms projections about
the future
-Prescriptive analysis creates actionable
advice on what actions to take.
Phases / Steps in
Analyzing Data
•Identify the data you want to analyze
•Collect the data
•Clean the data in preparation for analysis
•Analyze the data
•Interpret the results of the analysis
Classification
Algorithm
Data Mining
Data Mining Algorithm
-Classification Algorithms.

• Naïve Bayes
• Support Vector Machine
• K-Nearest Neighbours
• Decision Tree
DATASET
• a collection of related sets of information
that is composed of separate elements but
can be manipulated as a unit by a computer:

• They are mostly used in fields like machine

learning, business, and government to gain
insights, make informed decisions, or train
algorithms.
• Datasets play a vital role in every facet of our lives. In
this modern day, all devices are made to collect data
and create datasets for advertisers/businesses to
personalize their advertisements to consumers. The
limitation is that as a result of over-reliance on datasets,
the mining techniques of data have become ethically
questionable with many social media applications and
websites getting criticism for data privacy issues, data
leaks, and so on. As a result, data is the currency and
many companies mine user information without the
user’s knowledge to create datasets.
Steps to Build a
Classification Model
Steps to Build a
Classification Model
Continuation in Building
a Classification Model
Continuation in Building
a Classification Model
Classification Algorithm
• The Classification algorithm is a Supervised
Learning technique that is used to identify the
category of new observations on the basis of
training data. In Classification, a program learns
from the given dataset or observations and then
classifies new observation into a number of
classes or groups. Such as, Yes or No, 0 or 1,
Spam or Not Spam, cat or dog, etc. Classes can
be called as targets/labels or categories.
• It is an important task in data mining
because it enables organizations to make
data-driven decisions. For example,
businesses can assign or classify
sentiments of customer feedback, reviews,
or social media posts to understand how
well their products or services are doing.
Classification Technique
Categories

Binary-Class Classification Multi-Class Classification

Classification Technique
Categories

• Classification techniques can be divided

into categories - binary classification and
multi-class classification. Binary
classification assigns labels to instances
into two classes, such as fraudulent or
non-fraudulent. Multi-class classification
assigns labels into more than two classes,
such as happy, neutral, or sad.
Types of Classification
Algorithm
Some Types of
Classification Algorithm

• Random Forest
• Naïve Bayes
Random Forest Algorithm
• Random Forest is a classifier that contains a number of
decision trees on various subsets of the given dataset and
takes the average to improve the predictive accuracy of
that dataset."

* The greater number of trees in the forest leads to higher

accuracy and prevents the problem of overfitting.
RANDOM FOREST DIAGRAM
Assumptions for Random Forest

• Since the random forest combines

multiple trees to predict the class of the
dataset, it is possible that some decision
trees may predict the correct output,
while others may not. But together, all
the trees predict the correct output.
Assumptions for Random Forest
Why Use Random Forest
Random Forest Applications
Advantages and
Disadvantages
Advantages
and Disadvantages
WEKA
Weka is a collection of machine learning
algorithms for solving real-world data mining
problems. It is written in Java / Python
programming language and runs on almost any
platform. The algorithms can either be applied
directly to a dataset or called from your own
Java or Python code.
• Naïve Bayes
• Support Vector Machine
• K-Nearest Neighbours
• Decision Tree
Naïve Bayes
Classification
Naïve Bayes and Data Mining /
Machine Learning
• Applying Bayes'theorem:

• P(Yes|Sunny)= P(Sunny|Yes)*P(Yes)/P(Sunny)
• P(Sunny|Yes)= 3/10= 0.3
• P(Sunny)= 0.35
• P(Yes)=0.71
• So P(Yes|Sunny) = 0.3*0.71/0.35= 0.60

• Hence on a Sunny day, Player can play the game.

Advantages and
Disadvantages
Where is Naïve Bayes used
Where is Naïve Bayes used
Thank You
• Definition
• Why Use That Algorithm
• Advantages and Disadvantages
• Real World Case Example
= Algorithm Execution

Data Mining
No ratings yet
Data Mining
20 pages
Data Mining
No ratings yet
Data Mining
254 pages
Combinepdf 1
No ratings yet
Combinepdf 1
74 pages
Data Mining at UVA: New Horizons in Teaching and Learning Conference
No ratings yet
Data Mining at UVA: New Horizons in Teaching and Learning Conference
19 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
15 pages
Data Mining
No ratings yet
Data Mining
20 pages
SQL BI Course for IT Professionals
No ratings yet
SQL BI Course for IT Professionals
43 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
48 pages
Week001-Module (1) Merged
No ratings yet
Week001-Module (1) Merged
122 pages
BI Unit 3 Part 1
No ratings yet
BI Unit 3 Part 1
51 pages
Data Mining and Visualization
No ratings yet
Data Mining and Visualization
8 pages
Unit 3
No ratings yet
Unit 3
22 pages
CSE2021 - MODULE 1ppt
No ratings yet
CSE2021 - MODULE 1ppt
62 pages
Data Mining
No ratings yet
Data Mining
15 pages
Presentation 1
No ratings yet
Presentation 1
28 pages
Classification in Data Mining
No ratings yet
Classification in Data Mining
14 pages
Dmi Unit 1 - 186 - N3
No ratings yet
Dmi Unit 1 - 186 - N3
12 pages
Data Mining Merged PDF CS1 CS8
No ratings yet
Data Mining Merged PDF CS1 CS8
272 pages
Data Mining
No ratings yet
Data Mining
21 pages
Introduction To Data Mining Unit1
100% (1)
Introduction To Data Mining Unit1
37 pages
Comp 6838
No ratings yet
Comp 6838
41 pages
Data Mining: Concepts and Applications
No ratings yet
Data Mining: Concepts and Applications
11 pages
Data Mining in Image and Video Analysis
No ratings yet
Data Mining in Image and Video Analysis
23 pages
An Introduction To Data Mining
No ratings yet
An Introduction To Data Mining
47 pages
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
No ratings yet
Data Mining and Data Warehouse BY: Dept. of Computer Science Engineering
10 pages
Data Mining & BI Course Guide
No ratings yet
Data Mining & BI Course Guide
25 pages
Data Science Module 1 Notes
No ratings yet
Data Science Module 1 Notes
16 pages
Datamining: by Guan Hang Su Cs157A Section 2 Fall 2005
0% (1)
Datamining: by Guan Hang Su Cs157A Section 2 Fall 2005
31 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
38 pages
Study Material I
No ratings yet
Study Material I
140 pages
Data Mining
No ratings yet
Data Mining
8 pages
Busiess Analytics Data Mining Lecture 3
No ratings yet
Busiess Analytics Data Mining Lecture 3
52 pages
Classification Chapter 5
No ratings yet
Classification Chapter 5
26 pages
Data Mining
No ratings yet
Data Mining
24 pages
Data Mining Overview and Applications
No ratings yet
Data Mining Overview and Applications
125 pages
Understanding Data Mining Essentials
No ratings yet
Understanding Data Mining Essentials
5 pages
Module 7 Introduction To Data Mining
No ratings yet
Module 7 Introduction To Data Mining
14 pages
Data Mining
No ratings yet
Data Mining
31 pages
Introduction to Data Mining Concepts
No ratings yet
Introduction to Data Mining Concepts
27 pages
Major Issues in Data Mining
80% (5)
Major Issues in Data Mining
45 pages
Data Mining: Applications and Techniques
No ratings yet
Data Mining: Applications and Techniques
60 pages
Data Mining in Search Engine Analytics
No ratings yet
Data Mining in Search Engine Analytics
7 pages
LectureSlide 1
No ratings yet
LectureSlide 1
12 pages
Data Mining: Concepts and Applications
No ratings yet
Data Mining: Concepts and Applications
36 pages
Full and Correct Notes For FDS-6th Bca
No ratings yet
Full and Correct Notes For FDS-6th Bca
83 pages
4 Datamining
No ratings yet
4 Datamining
90 pages
Presentation Data Mining
No ratings yet
Presentation Data Mining
22 pages
IT in Society - Data Mining
No ratings yet
IT in Society - Data Mining
22 pages
Unit 1
No ratings yet
Unit 1
59 pages
Data Mining
No ratings yet
Data Mining
7 pages
Lecture 1 & 2 - Introduction To Data Mining2
No ratings yet
Lecture 1 & 2 - Introduction To Data Mining2
19 pages
DWM Unit 3 Final Notes
No ratings yet
DWM Unit 3 Final Notes
47 pages
Data Types
No ratings yet
Data Types
2 pages
Topic 3 Data Mining For Business Intelligence
No ratings yet
Topic 3 Data Mining For Business Intelligence
49 pages
Data Mining Note Sixth Semester ..
No ratings yet
Data Mining Note Sixth Semester ..
79 pages
Data Mining Concepts
No ratings yet
Data Mining Concepts
3 pages
Mining Temporal Attack Patterns From Cyberthreat Intelligence Reports
No ratings yet
Mining Temporal Attack Patterns From Cyberthreat Intelligence Reports
14 pages
ECOC
No ratings yet
ECOC
23 pages
Answer Key
No ratings yet
Answer Key
8 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
File Word Tiểu Luận Nhóm 8
No ratings yet
File Word Tiểu Luận Nhóm 8
31 pages
Face Mask Detection Final Report Draft
No ratings yet
Face Mask Detection Final Report Draft
69 pages
Tweedie Regression for Insurance
No ratings yet
Tweedie Regression for Insurance
2 pages
Machine Learning Basics Infographic With Algorithm Examples
No ratings yet
Machine Learning Basics Infographic With Algorithm Examples
1 page
Applied Machine Learning Course Overview
100% (4)
Applied Machine Learning Course Overview
22 pages
Deep Learning A Visual Approach Glassner Available Any Format
No ratings yet
Deep Learning A Visual Approach Glassner Available Any Format
86 pages
A Classification of Quran Verses Using Deep Learning
No ratings yet
A Classification of Quran Verses Using Deep Learning
14 pages
DL Practical
No ratings yet
DL Practical
25 pages
Ai Unit 3
No ratings yet
Ai Unit 3
23 pages
UCI ML Datasets for Beginners
No ratings yet
UCI ML Datasets for Beginners
23 pages
Expert Systems With Applications: Marcin Michał Miro Nczuk, Jarosław Protasiewicz
No ratings yet
Expert Systems With Applications: Marcin Michał Miro Nczuk, Jarosław Protasiewicz
19 pages
Module 1
No ratings yet
Module 1
65 pages
Comparing Categorical Encoding Methods
No ratings yet
Comparing Categorical Encoding Methods
11 pages
Unit 3 - Supervise Learning Classification
No ratings yet
Unit 3 - Supervise Learning Classification
23 pages
Understanding Deep Learning
No ratings yet
Understanding Deep Learning
100 pages
Multi-Class Classification
No ratings yet
Multi-Class Classification
52 pages
Mcsl-228 Practical Viva Revision
No ratings yet
Mcsl-228 Practical Viva Revision
26 pages
Module 2
No ratings yet
Module 2
151 pages
KNN Updated
No ratings yet
KNN Updated
30 pages
7 Classification Algorithms in Python
No ratings yet
7 Classification Algorithms in Python
9 pages
Performance Assessment of Various Deep Learning Ba 240902 075649
No ratings yet
Performance Assessment of Various Deep Learning Ba 240902 075649
6 pages
ML PR-3
No ratings yet
ML PR-3
9 pages
Text Classification Quiz
0% (3)
Text Classification Quiz
12 pages
Visual Introduction Deep Learning v21-02
100% (6)
Visual Introduction Deep Learning v21-02
236 pages
Data Science Professional - 1z0-1110-23 - 55QA - New
No ratings yet
Data Science Professional - 1z0-1110-23 - 55QA - New
20 pages
A Course in Machine Learning
No ratings yet
A Course in Machine Learning
50 pages

Classification Algorithm

Uploaded by

Classification Algorithm

Uploaded by

DATA MINING

Data mining is most commonly defined as

Data mining goes beyond the search

Since we live and work in a data-centric world, it’s

• They are mostly used in fields like machine

Binary-Class Classification Multi-Class Classification

• Classification techniques can be divided

* The greater number of trees in the forest leads to higher

• Since the random forest combines

• Hence on a Sunny day, Player can play the game.

You might also like