Mod 3a
Mod 3a
MODULE-3
CLASSIFICATION
Classification
A form of data analysis that extracts models
describing important data classes.
Classifiers, predict categorical (discrete,
unordered) class labels.
Ex: classification model to categorize bank
loan applications as either safe or risky.
Applications: fraud detection, target
marketing, performance prediction,
manufacturing, and medical diagnosis.
Classifiers & class labels
Loan application: safe/risky
Marketing: yes/no
Medical: treatment A / treatment B / treatment C
Predictors
Continuous valued functions or ordered value
Ex: a customers spending capacity
Method: Regression analysis
Classification and numeric prediction are the
two major types of prediction problems.
Classification – 2 step process
classification rules.
Ex:
Advantages
Does not require domain knowledge or parameter
setting, and therefore is appropriate for exploratory
knowledge discovery.
Can handle multidimensional data.
Representation of acquired knowledge in tree form is
intuitive and generally easy to understand by humans.
Learning and classification steps of decision tree
induction are simple and fast.
Good accuracy