DS Chapter 5
DS Chapter 5
College of computing
Department of Information Science
Introduction to Data Science
Dibaba A. (MSc)
• Market Segmentation
• Image segmentation
Useful for:
Partitioning Clustering
Density-Based Clustering
Hierarchical Clustering
Fuzzy Clustering
9 Clustering Algorithms
• Noise;
• Fraud;
• Attacks.
15 Types of Anomalies or Outliers
Collective Anomaly
There are two forms of data analysis that can be used to extract
models describing important classes or predict future data trends.
These two forms are as follows:
1. Classification
2. Prediction
We use classification and prediction to extract a model,
representing the data classes to predict future data trends.
Classification predicts the categorical labels of data with the
prediction models.
This analysis provides us with the best understanding of the data at
18 Cont’d…
The set of input data and the corresponding outputs are given to the
algorithm. So, the training data set includes the input data and their
associated class labels.
21 Classification Cont’d…