05 - Machine Learning
05 - Machine Learning
Classical
learning
Neural nets
and deep
learning
Machine Learning Approaches
Classical
learning
Ensemble
learning
Reinforcement
learning
Genetic
Algorithm Q-Learning …
(GA)
Machine Learning Approaches
Neural nets
(NN) and
deep learning
Back
Feed forward Convolutional
Propagation Recurrent NN ….
NN NN
NN
Supervised vs. Unsupervised Learning
◼ Classification
◼ predicts categorical class labels (discrete or nominal)
◼ Target marketing
◼ Medical diagnosis
◼ Fraud detection
Supervised Learning: Drawbacks
Classification
Algorithms
Training
Data
Classifier
Testing
Data Unseen Data
(Jeff, Professor, 4)
NAME RANK YEARS TENURED
Tom A ssistan t P ro f 2 no Tenured?
M erlisa A sso c iate P ro f 7 no
G eo rg e P ro fesso r 5 yes
J o sep h A ssistan t P ro f 7 yes
Machine learning in data mining
Issues regarding to classification and prediction
Issues: Data Preparation
◼ Data cleaning
◼ Preprocess data in order to reduce noise and handle
missing values
◼ Relevance analysis (feature selection)
◼ Remove the irrelevant or redundant attributes
◼ Data transformation
◼ Generalize and/or normalize data
Issues: Evaluating Classification Methods
◼ Accuracy
◼ classifier accuracy: predicting class label
◼ Speed
◼ time to construct the model (training time)
Accuracy =
Precision =
Recall =
F1-Score =
Issues: Evaluating Regression Methods
Issues: Evaluating Regression Methods
R2 (R-squared):
▪ Underfitting happens when a model is not good enough to understand all the
details in the data
→ Poor performance on both the training and test sets
▪ Overfitting occurs when a model is too complex and memorizes the training
data too well
→ good performance on the training set but poor performance on the test set
Other machine learning models
▪ Ensemble learning:
Other machine learning models
▪ Ensemble learning: