4 3DM - Classification-Methods
4 3DM - Classification-Methods
Introduction
Data Preparation and Preprocessing
Data Representation
Classification Methods
Evaluation
Clustering Methods
Mid Exam
Association Rules
Knowledge Representation
Special Case study : Document clustering
Discussion of Case studies by students
Out Lines
3
Algorithm
Given a new instance x,
Advantage
Robust to noisy training data
Effective if the training data is large
Disadvantage
Need to determine value of parameter K
Distance based learning is not clear which type of
distance to use and which attribute to use to
produce the best results.
Computation cost is quite high because we need
to compute distance of each query instance to all
training samples.
Next …
8