Chapter 2
Chapter 2
Machine Learning(ML)
Part I
Introduction : What is Machine Learning?
❖ There is no universally accepted definition for machine learning.
❖ Was first coined by Arthur Lee Samuel a pioneer in the AI field, in 1959
❖ It is:-
• Study of algorithms that
• improve their performance
• at some task
• with experience
❖ Enable computers to learn without being explicitly
programmed(Arthur Samuel, in 1959 at IBM)
Introduction : What is Machine Learning?
1. Supervised Learning
• Classification
• Regression/Prediction
2. Unsupervised Learning
• Clustering
• Dimensionality Reduction
3. Reinforcement Learning
Supervised learning
❖ It is the process of algorithm learning from the training dataset.
❖ It can be thought of as a teacher supervising the learning process.
❖ The algorithm iteratively makes predictions on the training data and is
corrected(there is correct outputs) by the teacher.
❖ It is in which a machine learns from known datasets (set of training
examples), and then predicts the output.
❖ It is learning from labeled training data (i.e., inputs that also contain
the desired outputs or targets; basically, “examples” of what we want
to predict).
❖ Learning stops when the algorithm achieves an acceptable level of
performance.
Supervised learning
Supervised learning
Supervised learning: Classification
❖Classification deals with categorizing several instances
in one of the predefined categories/classes.
❖It specifies the class to which data elements belong to.
❖ Clustering automatically
categorizes data into groups
according to similarity criteria
❖ It evaluates the similarity
based on a metric like
Euclidean distance, Cosine
similarity, Manhattan distance
Unsupervised Learning: Clustering
Curse of dimensionality
Refers to the challenges that arise when working with high-dimensional data.
So
❖ High-dimensional data is challenging to handle.
❖ More features increase model complexity and risk of
overfitting.
Therefore, Overfitting leads to poor performance on
new data.
Unsupervised Learning: Dimensionality reduction
2. Is the following problem classification or regression? You are given historical data on the
price of a house at several points in time. You want to predict the price of this house next
month.
(A) Classification
(B) Regression
C) This is not supervised learning