2 - Basics of Machine Learning
2 - Basics of Machine Learning
Introduction to
Machine Learning
© ADVANCED RESOURCES AND RISK TECHNOLOGY This document can only be distributed within UFRGS
Machine Learning
o “Give computers the ability to learn without being explicitly programmed” (Arthur Samuel, 1959)
o Algorithms that use data (“learn”) to make predictions
o No “model” ( = function that maps input to output) is explicitly given
o Although no model is given, the users have many algorithms to chose from
o Validation is always required to assess the quality of our predictions
o Goals:
o analyze and describe data: find trends, clusters, anomalies,
o produce predictions that improve in quality and reliability with more and more data,
o optimize decisions based on multiple previous experiments
TRAINING
TESTING
o Used vastly in computer science and anything related to data sets of observations so
large that human calibration is impossible:
o search engines (page ranking, related researches),
o social networks (suggestions, advertisement),
o image pattern recognition (classification of features, face detection, identification)
o natural language processing…
o Scientific uses in all domains of science with enough observations. Sometimes seen as
a statistical “black box” when theory-driven models are considered more elegant. Both
approaches can coexist and be complementary.
o Everything is based on the underlying model that the algorithm forces on the data
o Interpolation = if done well, OK
o Extremes = difficult
o Extrapolation = very dangerous (especially if you cannot explain the underlying statistical model)
o Machine Learning is just a model, as such it always needs validation
lation
extrapo
extra
pola
tion
output
Training domain
input
• Validation metrics
• Accuracy ( = bias) and precision (=variability) : correlations, spearman, bias matrices, etc.
• Sensitivity : true positive rate (or recall, or probability of detection)
• Specificity : true negative rate
• Sensitivity vs. specificity = ROC curves : receiver operating characteristics
The number of machine learning packages is very large, free open-source or proprietary,
language-specific or not, cross-platform or not, cloud-friendly or not, etc.
Fawcett, Tom (2006). "An Introduction to ROC Analysis" (PDF). Pattern Recognition Letters. 27 (8): 861 – 874. doi:10.1016/
j.patrec.2005.10.010.
Powers, David M W (2011). "Evaluation: From Precision, Recall and F-Measure to ROC, Informedness, Markedness &
Correlation" (PDF). Journal of Machine Learning Technologies. 2 (1): 37–63.
Ting, Kai Ming (2011). Encyclopedia of machine learning. Springer. ISBN 978-0-387-30164-8.