Model Evaluation and Improvement 2
Model Evaluation and Improvement 2
Supervised
Classification – KNN, Naive Bayes, Decision Tree, etc.
Unsupervised
Clustering – K-Means
Market Basket Analysis
SUPERVISED LEARNING - CLASSIFICATION
Test Data
Intel
SUPERVISED LEARNING - REGRESSION
y = α + βx
UNSUPERVISED LEARNING
Unlabelled Data
Cluster 2
Cluster 1
Cluster 3
Cluster 4
UNSUPERVISED LEARNING – MARKET BASKET
ANALYSIS
SELECTING A MODEL
Input
Data Trained Model
Test
20% - 30% Data
Model Performance
K-FOLD CROSS-VALIDATION– OVERALL APPROACH
K-FOLD CROSS-VALIDATION– DETAILED APPROACH
BOOTSTRAP SAMPLING / BOOTSTRAPPING
TRAIN A MODEL – UNDER VS. OVER FIT