03 Machine Learning Overview
03 Machine Learning Overview
Overview
Course: Artificial Intelligence
Fundamentals
Rules
Programming Answers
Data
Answers
Machine Learning Rules
Data
• Automatic Trading
• Fraud Detection
• Self-driving cars
• Playing chess/poker/go
Supervised Unsupervised
Discrete Data
Classification Clustering
(predict a label) (group similar items)
Continuous Data
Dimensionality
Regression Reduction
(predict a quantity) (reduce n. of variables)
Machine Learning Process
• Exercise:
— Search “machine learning stages” (or steps, or
process) on Google
— Find dozens of “The X stages of Machine
Learning” articles
• No standard process?!
Recap: CRISP-DM
Recap: CRISP-DM
Machine Learning Process
• Numerical features
• Text → bag-of-words
Item Representation
Item Representation
One-hot Encoding
Rome = [1, 0, 0, 0, 0, 0, …, 0]
Paris = [0, 1, 0, 0, 0, 0, …, 0]
Italy = [0, 0, 1, 0, 0, 0, …, 0]
France = [0, 0, 0, 1, 0, 0, …, 0]
Feature Engineering
• More precisely:
— Overfitting
— Underfitting
Overfitting
• Your model learns the details of the training data
set “too well”