Machine Learning
Machine Learning
The algorithm takes these previously labeled samples and uses them
to induce a classifier. This classifier is a function that assigns labels to
samples including the samples that have never been previously seen by the
algorithm.
Data Mining: Tools and Techniques, 3rd Edition. Jiawei Han and Michelline Kamber.
Kevin R Murphy, "Machine Learning - A Probabilistic Perspective", The MIT Press, 2012.
http://www.cse.iitm.ac.in/~vplab/E_machine_learning.html
Computational learning theory studies the time complexity and feasibility
of learning. In computational learning theory, a computation is considered
feasible if it can be done in polynomial time.
Classification problems are those for which the output will be an element
from a discrete set of labels. Classification is very common for machine learning
applications. The input would be represented by a large multidimensional vector
whose elements represent pixels in the picture, say CV applications.
After learning a function based on the training set data, that function is
validated on a test set of data, data that did not appear in the training set.
Computational learning theory
(Wikipedia)
• Probably approximately correct learning (PAC learning) --
Leslie Valiant
• inspired boosting
• VC theory --Vladimir Vapnik
• led to SVMs
• Bayesian inference --Thomas Bayes
• Algorithmic learning theory --E. M. Gold
• Online machine learning --Nick Littlestone
• SRM (Structural risk minimization)
• model estimation
Example: Recognition of Handwritten Digits
l Data: images are single digits 16x16 8-bit l Non-binary classification problem
gray-scale, normalized for size and l Low tolerance to misclassifications
orientation
l Classify: newly written digits
12
Categories of Supervised Learning:
- Bayes
- Regularization
- Logistic Regression, LDA
- Kernel methods & SVM;
- Inductive Learning
- Basis and Dictionary methods;
- Decision Trees
- Model selection
- Deep Learning
- Perceptron, ANN
14
Semi-supervised Learning
• Uses both labeled data (in the form (input, output) pairs)
and unlabelled data for learning
• When labeling of data is a costly affair semi-supervised
techniques could be very useful
• Examples: Generative models, self-training, co-training
15
Example: Semi-supervised Learning
16
Source: Semi-supervised literature survey by X. Zhu, Technical Report
Reinforcement Learning
• Reinforcement learning is the problem faced by an agent that must learn
behavior through trial-and-error interactions with a dynamic environment.
• There is no teacher telling the agent wrong or right
• There is critic that gives a reward / penalty for the agent’s action
• Applications:
– Robotics
– Combinatorial search problems, such as games
– Industrial manufacturing
– Many others!
17
Kernels and SVM
ONLINE Learning
Transfer Learning
Reinforcement Learning
Applications:
- Matrix completion
- Agriculture
Decision trees
• One possible representation for hypotheses
• E.g., here is the “true” tree for deciding whether to wait:
https://www.crondose.com/2016/07/easy-way-understand-decision-trees/
http://www.doc.ic.ac.uk/~sgc/teaching/pre2012/v231/lecture11.html
ONLINE LEARNING (src: Wiki)
In this case, it is necessary for the algorithm to dynamically adapt to new patterns
in the data, or when the data itself is generated as a function of time, e.g. stock price
prediction. Online learning algorithms may be prone to catastrophic interference. This
problem is tackled by incremental learning approaches.
A purely online model would learn based on just the new input , the current best
predictor and some extra stored information (which is usually expected to have storage
requirements independent of training data size).
Limited data
Accuracy
Possible
Models/representations
Complexity
Tradeoff between y=f(x)
accuracy and simplicity
Good models
should enable
Prediction
of new data…
X
Concrete learning paradigm- linear separators
x ↦ (x, x2)
The SVM Idea: an Example
Controlling Computational Complexity
.......
....... K(xixj)
max min wn ⋅ xi
separating h xi
- Online Learning
- Curse of Dimensionality
- Bias-Variance Tradeoff
- Incremental Learning
References and Journals
• Text: The Elements of Statistical Learning by Hastie, Tibshirani, and Friedman (book website: http://www-
stat.stanford.edu/~tibs/ElemStatLearn/)
• Reference books:
• Pattern Classification by Duda, Hart and Stork
• Pattern Recognition and Machine Learning by C.M. Bishop
• Machine Learning by T. Mitchell
• Introduction to Machine Learning by E. Alpaydin
• Some related journals / associations:
• Machine Learning (Kluwer).
• Journal of Machine Learning Research.
• Journal of AI Research (JAIR).
• Data Mining and Knowledge Discovery - An International Journal.
• Journal of Experimental and Theoretical Artificial Intelligence (JETAI).
• Evolutionary Computation.
• Artificial Life.
• Fuzzy Sets and Systems
• IEEE Intelligent Systems (Formerly IEEE Expert)
• IEEE Transactions on Knowledge and Data Engineering
• IEEE Transactions on Pattern Analysis and Machine Intelligence
• IEEE Transactions on Systems, Man and Cybernetics
• Journal of AI Research
• Journal of Intelligent Information Systems
• Journal of the American Statistical Association
• Journal of the Royal Statistical Society 36
References and Journals…
– Pattern Recognition
– Pattern Recognition Letters
– Pattern Analysis and Applications.
– Computational Intelligence .
– Journal of Intelligent Systems .
– Annals of Mathematics and Artificial Intelligence.
– IDEAL, the online scientific journal library by Academic Press.
–
– ACM (Association for Computing Machinery).
– Association for Uncertainty in Artificial Intelligence.
– ACM SIGAR
– ACM SIGMOD
– American Statistical Association.
– Artificial Intelligence
– Artificial Intelligence in Engineering
– Artificial Intelligence in Medicine
– Artificial Intelligence Review
– Bioinformatics
– Data and Knowledge Engineering
– Evolutionary Computation
37
Some Conferences & Workshops
• Congress on Evolutionary Computation
• European Conference on Machine Learning and Principles and Practice of Knowledge Discovery
• The ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
Software Assignment 1:
Announcement: 25-01-2020
Deadline: 25-02-2020
Software Assignment 2:
Announcement: 21-02-2020
Deadline: 05-04-2020