Dr.
Suvarna Pawar
Magic?
No, more like gardening
*Seeds = Algorithms
*Nutrients = Data
*Gardener = You
*Plants = Programs
It’s a task of writing any software which solely learns
from their past experiences.
It is semi automated extraction of knowledge from data
Experience : E
Task : T
Performance measure : P
Ex. Tic-Tac-Toe
*
.
Machine learning is an application of artificial
intelligence (AI) that provides systems the ability to
automatically learn and improve from experience without
being explicitly programmed.
Machine learning focuses on the development of
computer programs that can access data and use it learn
for themselves
• A branch of artificial intelligence, concerned with the design and
development of algorithms that allow computers to evolve behaviors
based on empirical data.
• As intelligence requires knowledge, it is necessary for the computers
to acquire knowledge.
* Training :
It is used to built up the training model
Labels are given
* Testing :
It is used to validate the model built.
Labels are known but not given.
*
*
Data Practical
acquisition usage
Universal
set
(unobserve
d)
Training Testing set
set (unobserve
(observed) d)
*
*Training is the process of making the system able to
learn.
*No free lunch rule:
* Training set and testing set come from the same distribution
* Need to make some assumptions or bias
Testing
Input Learning
Sample Method
System
Training
* Supervised learning _ I_KNW
* Unsupervised learning – I DNT KNW
* Reinforcement learning- MAY BE
Ex. Ham/ Spam email
Customer behaviour in shop
( Predictions )
*
* Supervised learning
*
* Unsupervised learning
*
* Geometric Models
SVM, Classification, K-means, KNN
* Probabilistic models
Naïve-Bayes, perceptron
* Logical Models
Decision tree, Regression
* Face detection
* Object detection and recognition
* Image segmentation
* Multimedia event detection
* Economical and commercial usage
ML in Practice
* Understanding domain, prior knowledge, and goals
* Data integration, selection, cleaning,
pre-processing, etc.
* Learning models
* Interpreting results
* Consolidating and deploying discovered knowledge
* Loop
*It is very hard to write programs that solve problems like
recognizing a face.
*We don’t know what program to write because we don’t
know how our brain does it.
*Even if we had a good idea about how to do it, the
program might be horrendously complicated.
*Instead of writing a program by hand, we collect lots of
examples that specify the correct output for a given input.
*A machine learning algorithm then takes these examples
and produces a program that does the job.
*The program produced by the learning algorithm may
look very different from a typical hand-written program.
It may contain millions of numbers.
*If we do it right, the program works for new cases as well
as the ones we trained it on.
*
*Recognizing patterns:
*Facial identities or facial expressions
*Handwritten or spoken words
*Medical images
*Generating patterns:
*Generating images or motion sequences (demo)
*Recognizing anomalies:
*Unusual sequences of credit card transactions
*Unusual patterns of sensor readings in a nuclear
power plant or unusual sound in your car engine.
*Prediction:
*Future stock prices or currency exchange rates
*The web contains a lot of data. Tasks with very big
datasets often use machine learning
* especially if the data is noisy or non-stationary.
*Spam filtering, fraud detection:
* The enemy adapts so we must adapt too.
*Recommendation systems:
*Lots of noisy data. Million dollar prize!
*Information retrieval:
*Find documents or images with similar content.
*Data Visualization:
*Display a huge database in a revealing way (demo)
*Python
*Java
*Netbeans
*Weka for simulations purpose
*Any open source software
*Deep Leaning with ANN
*Active and passive learning
*Bagging and boosting
so on……………………..