0% found this document useful (0 votes)
51 views17 pages

Model Evaluation and Improvement 2

This document provides an overview of machine learning modelling basics. It discusses the different types of machine learning algorithms, including supervised algorithms like classification and regression, as well as unsupervised algorithms like clustering and market basket analysis. The document also covers topics like selecting the appropriate model for predictive vs descriptive problems, training models using holdout validation and k-fold cross validation, overfitting vs underfitting, and bias-variance tradeoff. The goal is to introduce machine learning practitioners to fundamental modelling concepts.

Uploaded by

Subhradeep Pal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
51 views17 pages

Model Evaluation and Improvement 2

This document provides an overview of machine learning modelling basics. It discusses the different types of machine learning algorithms, including supervised algorithms like classification and regression, as well as unsupervised algorithms like clustering and market basket analysis. The document also covers topics like selecting the appropriate model for predictive vs descriptive problems, training models using holdout validation and k-fold cross validation, overfitting vs underfitting, and bias-variance tradeoff. The goal is to introduce machine learning practitioners to fundamental modelling concepts.

Uploaded by

Subhradeep Pal
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 17

MODELLING BASICS

Presenter: Prof. Amit Kumar Das


Assistant Professor,
Dept. of Computer Science and Engg.,
Institute of Engineering & Management.
WHAT IS MODELLING IN CONTEXT OF
MACHINE LEARNING?
WHAT ARE THE DIFFERENT ML
ALGORITHMS?

 Supervised
 Classification – KNN, Naive Bayes, Decision Tree, etc.

 Regression – Simple Linear Regression, Logistic


Regression

 Unsupervised
 Clustering – K-Means
 Market Basket Analysis
SUPERVISED LEARNING - CLASSIFICATION

Labelled Training Data

Classifier Classification Model

Test Data

Intel
SUPERVISED LEARNING - REGRESSION

y = α + βx
UNSUPERVISED LEARNING

Unlabelled Data

Unsupervised Learning Model

Grouped data / Clusters


UNSUPERVISED LEARNING - CLUSTERING

Cluster 2

Cluster 1

Cluster 3
Cluster 4
UNSUPERVISED LEARNING – MARKET BASKET
ANALYSIS
SELECTING A MODEL

 Predictive models (supervised)


 Predict the value of a category or class
 Problems that can be solved : Prediction of win/loss,
fraudulent transactions, etc.
 Examples : k-Nearest Neighbor (kNN), Naïve Bayes,

Decision Tree, etc.


 Predict numerical values of the target
 Problems that can be solved : Prediction of revenue

growth, rainfall amount, etc,


 Examples: Linear Regression, Logistic Regression, etc.
SELECTING A MODEL
 Descriptive
models
(unsupervised)
 Group together
similar data
instances
 Problems that can be
solved: Customer
grouping or
segmentation based
on social,
demographic, ethnic,
etc. factors
 Most popular model
for clustering is k-
Means
TRAIN A MODEL – HOLDOUT METHOD
70% - 80% Training
Data

Input
Data Trained Model

Test
20% - 30% Data

Model Performance
K-FOLD CROSS-VALIDATION– OVERALL APPROACH
K-FOLD CROSS-VALIDATION– DETAILED APPROACH
BOOTSTRAP SAMPLING / BOOTSTRAPPING
TRAIN A MODEL – UNDER VS. OVER FIT

Under fit Balanced fit Over fit

Under fit Balanced fit Over fit


TRAIN A MODEL – BIAS VS. VARIANCE
THANK YOU &
QUESTIONS PLEASE!

You might also like