0% found this document useful (0 votes)

4 views

Machine_Learning

Machine Learning (ML) is a branch of artificial intelligence focused on algorithms that allow computers to learn from data. It includes supervised learning, where models are trained on labeled data, and unsupervised learning, which identifies patterns in unlabeled data. Common algorithms include Decision Trees, K Nearest Neighbors, and Linear Regression, each serving different purposes in classification and regression tasks.

Uploaded by

Prabhat Kumar

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Machine_Learning

Uploaded by

Prabhat Kumar

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Machine Learning

1
Machine Learning
 Machine Learning:
 Machine Learning (ML) is a branch of artificial
intelligence (AI) that focuses on developing algorithms
and statistical models that enable computers to learn
from and make decisions based on data without being
explicitly programmed.

2
Types of Machine Learning
 Supervised Machine Learning:
 Supervised learning is a type of machine learning where
the model is trained on labeled data. This means that for
each input, there is a corresponding output.
 Unsupervised Machine Learning:
 Unsupervised learning is a type of machine learning
where the model is trained on unlabeled data. The goal is
to uncover hidden patterns or structures within the data
without predefined labels.
3
Supervised Learning Process: Two steps

 Learning (Training): Learn a model using

the training data

 Testing: Test the model using unseen test

data to access the model accuracy

4
Supervised Learning
Supervised learning problems can be further grouped into
regression and classification problems:
 Classification: Classification is a type of supervised learning task
in machine learning where the goal is to assign predefined
labels or categories to input data based on its features.

 Regression, like classification, is a supervised learning task.

However, the goal in regression is to predict a continuous
numeric value rather than discrete classes.
5
Supervised Learning
List of common supervised machine learning
algorithms:

 Decision Tree
 K Nearest Neighbors
 Logistic Regression
 Linear Regression

6
7
Decision Tree
 A Decision Tree (DT) defines a hierarchy of rules to make a
Root Node
prediction
Body
Warm temp. Cold

An Internal Node Non-mammal

A Leaf Node

Gives No
Yes
birth

Mammal Non-mammal

 Root and internal nodes test rules. Leaf nodes make

predictions
8
Learning Decision Tree with Supervision
 The basic idea is very simple

 Recursively partition the training data into homogeneous

regions Even though the rule
What do you mean within each group is
by “homogeneous” simple, we are able to
regions? learn a fairly
sophisticated model
overall (note in this
A homogeneous region example, each rule is
will have all (or a a simple
majority of) training horizontal/vertical
inputs with the classifier but the
same/similar outputs overall decision
boundary is rather
sophisticated)

 Within each group, fit a simple supervised learner (e.g., predict

Decision
Decision Tree
Trees for Classification
for Classification
9

5
NO YES
𝑥1 >3.5 ?
4 Test input

NO 𝑥 2> 2?
YES NO 𝑥 2> 3 ?
YES
3

2
Predict Predict Predict Predict
1 Red Green Green Red

1 2 3 4 5 6
Remember: Root node
Feature 1 ( contains all training
DT is very efficient at test time: To inputs
predict the label of a test point, nearest Each leaf node receives
neighbors will require computing a subset of training
distances from 48 training inputs. DT inputs
predicts the label by doing just 2
feature-value comparisons! Way faster!
K Nearest
Decision Neighbors
Trees for (KNN)
Classification
10

 KNN Classifier is a non-parametric and instance-based learning algorithm.

 Non-parametric makes no assumptions about the distribution of data and
thus avoids the risks of mistaking the underlying distribution of the data.
 Instance-based learning means that the algorithm doesn’t explicitly learn
any parameters.
 For classification, the algorithm obtains a majority vote between the K most
similar instances to a given “unseen” observation. K is a count.
 KNN is not suitable if the data is noisy and the target classes do not have clear
demarcation in terms of attribute values.
 The closest class will be identified using the distance measures like Euclidean
distance.
K Nearest Neighbors (KNN)
Distance
measures
● Euclidean distance between any two points:

● Manhattan distance

1
KNN Methodology
● Let’s say we have a new instances called x.
● Algorithm will calculate distance between x
and all the instances in the training set.

● Arrange these distances in increasing order.

● Find k nearest neighbors. If k = 3, then it will
select three nearest instances based on the
similarity measure.
● Use k neighbors to determine the class of x
using majority voting. (more than 1
instance in this case) of the closest
instances.

1
KNN Methodology
Nearest Neighbor Classifiers
• Basic idea:
• If it walks like a duck, quacks like a duck, then it’s probably a duck

Compute
Distance Test Record

Training Choose k of the

Records “nearest” records
KNN Methodology
Value of K
• Choosing the value of k:
• If k is too small, sensitive to noise points
• If k is too large, neighborhood may include points from other classes

Rule of thumb:
K = sqrt(N)
N: number of training points X
KNN Methodology
earest-Neighbor Classifiers: Issues
 The value of k, the number of nearest neighbors to retrieve
 Choice of Distance Metric to compute the distance between records
 Computational complexity
 Size of training set
 Dimension of data
Linear Regression Model
 This is the base model for
all statistical machine
learning
 x is a one-feature data
variable
 y is the value we are
y w
trying  w1 x  
to0 predict
 The regression model is

 Two parameters to
estimate – the slope of
the line w1 and the y-
Solving the regression problem
 We basically want to find
{w0, w1} that minimize
deviations from the predictor
line

 How do we do it?
 Iterate over all possible w
values along the two
dimensions?
 Same, but smarter? [next
class]
 No, we can do this in
closed form with just plain
calculus
Parameter estimation via calculus
 We just need to set the
partial derivatives to zero (
full derivation)

 Simplifying
Logistic Regression
• Logistic Regression is a statistical technique that predicts probability of a target
variable based on the independent features.
• It predicts the probability of occurrence of a class label. Based on these probabilities
the data points are labelled.
• Probability of an outcome(y) is calculated using sigmoid function S(x)=(1/(1+e-f(x))
which is then used to decide the class based on the threshold value.
• A threshold (or cut-off; commonly a threshold of 0.5 is used) is fixed, then

Class
1
Probability > threshold
0
Probability < threshold
Logistic Regression
● Logistic regression is very much similar to linear regression where the explanatory
variables(X) are combined with weights values to predict a target variable of binary
class(y).
● f(x) = a+bx here, f(x) can have values from -∞ to ∞
● log(p/(1-p)) = f(x)
○ Here, p is the probability that the event y occurs(Y=1) [range 0 to 1]
○ p/(1-p) is the odds ratio [range 0 to infinity]
○ log(p/(1-p)) is log of odds ratio (logit) [-∞ to ∞]

● log(p/(1-p)) = a+bx : log of p/(1-p) is linearly related to the features and can have
value between -∞ to ∞
Logistic Regression
 Exponential of the logit and you have the odds for the two groups in
question.
 p/(1-p) = ef(x) : Odds (range from 0 to infinity with values greater than 1
associated with an event being more likely to occur than to not occur and
values less than 1 associated with an event that is less likely to occur)
 P(Y) = 1/(1+e-f(x)) : Sigmoid function calculates the probability
 p(y) = 1/(1+e-(a+bx)) : If f(x) = 0 then p = 0.5 as f(x) increases, p
approaches 1 and as f(x) gets really small, p approaches 0.
 Note - Logarithm or logit transformation is used to model the non-
linear relationship between Y and X by transforming Y.
Advantages of Supervised Learning
 It allows you to be very specific about the definition
of the labels.
 You can determine the number of classes you want
to have,
 The input data is very well known and is labeled.
 The results produced by the supervised method are
more accurate.
22
Unsupervised Learning
 Unsupervised learning is where you only have input data(X) and
no corresponding learning is to model the underlying structure or
distribution in the data to learn more about the data.

 These are called unsupervised learning because unlike supervised

learning there are no correct answers and there is no teacher.
Algorithms are left to their own devices to discover and present an
interesting structure in the data.

23
Unsupervised Learning
Unsupervised learning problems can be further grouped into
clustering and association problems.
 Clustering: Clustering is a technique in machine learning
and data analysis that involves grouping similar data points
based on certain criteria.
 Association: The primary goal is to identify associations or
dependencies between variables without the need for
predefined labels or a target outcome. Association learning
is commonly used in data mining, market basket analysis,
and discovering patterns in transactional datasets.

24
Advantages of Unsupervised Learning
 Less complexity in comparison with supervised learning.

 It is often easier to get unlabeled data.

 Takes place in real-time such that all the input data is to be

analyzed and labeled in the presence of learners.

25
Unsupervised Learning
List of common supervised machine learning algorithms:

 K-means clustering
 Dimensionality Reduction

26
K-means clustering
 K-means clustering is an algorithm to classify or group the
objects based on features into K number of groups.

 K is a positive integer number.

 The grouping is done by minimizing the sum of squares of

distances between data and the corresponding cluster
centroid.
27
K-means clustering Method

Given k, the k-means algorithm is implemented in four steps:

 Partition objects into k nonempty subsets
 Compute seed points as the centroids of the clusters of the
current partition (the centroid is the center, i.e., mean point, of
the cluster)
 Assign each object to the cluster with the nearest seed point
 Go back to Step 2, stop when no more new assignment
28
K-means clustering Method
The K-Means Clustering Method
• Example
10 10
10
9 9
9
8 8
8
7 7
7
6 6
6
5 5
5
4 4
4
Assign 3 Update 3
3

2 each
2 the 2

1
objects
1

0
cluster 1

0
0
0 1 2 3 4 5 6 7 8 9 10 to
0 1 2 3 4 5 6 7 8 9 10 means 0 1 2 3 4 5 6 7 8 9 10

most
similar reassign reassign
center 10 10

K=2 9 9

8 8

Arbitrarily choose 7 7

K object as initial
6 6

5 5

cluster center 4 Update 4

2
the 3

1 cluster 1

0
0 1 2 3 4 5 6 7 8 9 10
means 0
0 1 2 3 4 5 6 7 8 9 10
K-means clustering Method
The K-Means Clustering Method
Given: {2,4,10,12,3,20,30,11,25}, k=2
 Randomly assign means: m1=3,m2=4
 K1={2,3}, K2={4,10,12,20,30,11,25},
m1=2.5,m2=16
 K1={2,3,4},K2={10,12,20,30,11,25},
m1=3,m2=18
 K1={2,3,4,10},K2={12,20,30,11,25},
m1=4.75,m2=19.6
 K1={2,3,4,10,11,12},K2={20,30,25},
m1=7,m2=25
 Stop as the clusters with these means are the same.
Dimensionality Reduction
 Dimensionality reduction is a technique used in machine learning
and data analysis to reduce the number of features or variables in
a dataset while preserving its essential information.

 The high dimensionality of a dataset (large number of features)

can lead to challenges such as increased computational
complexity, the curse of dimensionality, and difficulties in
visualizing or interpreting the data.

31
Types of Dimensionality Reduction
1. Feature Selection:
 Feature selection involves choosing a subset of the most relevant
features from the original set. This is done by evaluating the
importance of each feature based on certain criteria, such as
statistical tests, information gain, or correlation analysis.

 Common techniques for feature selection include filter methods

(e.g., based on statistical tests), wrapper methods (e.g., using the
performance of a specific model), and embedded methods (e.g.,
feature importance from tree-based models).
32
Types of Dimensionality Reduction
2. Feature Extraction:
 Feature extraction transforms the original features into a new set
of features, typically of lower dimensionality. This is achieved by
creating new features that capture the most important
information in the original data.
 Principal Component Analysis (PCA) is a popular linear technique
for feature extraction. It identifies orthogonal directions (principal
components) along which the data varies the most and projects
the data onto these components.
33
Steps of PCA
 Let be the mean vector  For matrix C, vectors e (=column
(taking the mean of all rows) vector) having same direction as
 Adjust the original data by the Ce :
mean  eigenvectors of C is e such that
 X’ = X – Ce=e,
  is called an eigenvalue of C.
 Compute the covariance
 Ce=e  (C-I)e=0
matrix C of adjusted X
 Find the eigenvectors and
eigenvalues of C.
34
THANK YOU

Solo Leveling Volume 3
50% (4)
Solo Leveling Volume 3
345 pages
Blue Tec Brochure EN
100% (2)
Blue Tec Brochure EN
120 pages
Machine Learning 1
No ratings yet
Machine Learning 1
29 pages
Module 3 Intro 1ef1ea17a8ab2a794dc68a0a1e2efe59
No ratings yet
Module 3 Intro 1ef1ea17a8ab2a794dc68a0a1e2efe59
46 pages
Unit 4 Supervised Learning
100% (1)
Unit 4 Supervised Learning
75 pages
Supervised Learning
No ratings yet
Supervised Learning
46 pages
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
No ratings yet
Machine Learning (Part 1) : Iykra Data Fellowship Batch 3
28 pages
Module 3 (1)
No ratings yet
Module 3 (1)
63 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
New Machine Learning Algo
No ratings yet
New Machine Learning Algo
8 pages
CH 4
No ratings yet
CH 4
106 pages
Evolutional Study On KNN and K-Means Algorithms (SP)
No ratings yet
Evolutional Study On KNN and K-Means Algorithms (SP)
9 pages
Chap2 SupervisedLearning
No ratings yet
Chap2 SupervisedLearning
24 pages
AIML Unit-IV & V
100% (1)
AIML Unit-IV & V
47 pages
BECE352E Module 3
No ratings yet
BECE352E Module 3
64 pages
Topics in Module-3-: ML & Cloud Computing For Iot
No ratings yet
Topics in Module-3-: ML & Cloud Computing For Iot
149 pages
Machine Learning
No ratings yet
Machine Learning
15 pages
Unit 1
No ratings yet
Unit 1
15 pages
Machine Learning Algorithms
No ratings yet
Machine Learning Algorithms
5 pages
Unit 4
No ratings yet
Unit 4
23 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
The KNN
No ratings yet
The KNN
31 pages
[English (Auto-generated)] All Machine Learning Algorithms Explained in 17 Min [DownSub.com]
No ratings yet
[English (Auto-generated)] All Machine Learning Algorithms Explained in 17 Min [DownSub.com]
19 pages
Chapter Four
No ratings yet
Chapter Four
75 pages
Classification
No ratings yet
Classification
74 pages
ML notes
No ratings yet
ML notes
10 pages
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
No ratings yet
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
12 pages
Machine Learning File
No ratings yet
Machine Learning File
7 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
Machine learning algorithms laiki
No ratings yet
Machine learning algorithms laiki
123 pages
Machine Learning For Beginners PDF
No ratings yet
Machine Learning For Beginners PDF
29 pages
Interview Preparing - ML Draft
No ratings yet
Interview Preparing - ML Draft
12 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Week 8
No ratings yet
Week 8
70 pages
DAC ML Tutorial Final Deck
No ratings yet
DAC ML Tutorial Final Deck
150 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
DW&M Unit 3 Part I
No ratings yet
DW&M Unit 3 Part I
101 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
Session 5 ppt
No ratings yet
Session 5 ppt
36 pages
ML Unit 3
No ratings yet
ML Unit 3
12 pages
Unit 6
No ratings yet
Unit 6
22 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Module Iii
No ratings yet
Module Iii
15 pages
Classification
No ratings yet
Classification
7 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Chapter 2
No ratings yet
Chapter 2
31 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Machine Learning
No ratings yet
Machine Learning
53 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
No ratings yet
Fulldoc - Dsec Mca - Crime Prediction (1) - 051521
65 pages
Algorithms 1
No ratings yet
Algorithms 1
23 pages
unit 1
100% (1)
unit 1
13 pages
Classification Algorithms 3rd
No ratings yet
Classification Algorithms 3rd
15 pages
05 Lecture ML Supervised - Learning SVM
No ratings yet
05 Lecture ML Supervised - Learning SVM
69 pages
ML
No ratings yet
ML
17 pages
Supervised Learning Notes
No ratings yet
Supervised Learning Notes
13 pages
Unit 3 big data
No ratings yet
Unit 3 big data
50 pages
Unit - 4 Machine Learning
100% (1)
Unit - 4 Machine Learning
84 pages
Accelerated Data Science Introduction To Machine Learning Algorithms
No ratings yet
Accelerated Data Science Introduction To Machine Learning Algorithms
37 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Hexagon Puzzle
No ratings yet
Hexagon Puzzle
10 pages
How To Split An Atom
No ratings yet
How To Split An Atom
3 pages
Script For My Assignment
No ratings yet
Script For My Assignment
2 pages
2024 Common Mock For H.E - Questions
No ratings yet
2024 Common Mock For H.E - Questions
7 pages
A Novel Kind of Concrete Superplasticizer Based On Lignite
No ratings yet
A Novel Kind of Concrete Superplasticizer Based On Lignite
8 pages
F&E Drug Study
No ratings yet
F&E Drug Study
2 pages
Electricity Worksheets
100% (1)
Electricity Worksheets
21 pages
Service Manual: Color Monitor
No ratings yet
Service Manual: Color Monitor
44 pages
SWMS Work in A Confined Space
No ratings yet
SWMS Work in A Confined Space
5 pages
Petition Before SDM, Panipat and Samalkha Under Section 133 CRPC To Remove Public Nuisance - Abhishek Kadyan
No ratings yet
Petition Before SDM, Panipat and Samalkha Under Section 133 CRPC To Remove Public Nuisance - Abhishek Kadyan
78 pages
1305 - 364 - Unit-1 Matrix
No ratings yet
1305 - 364 - Unit-1 Matrix
49 pages
Alpha 250s
No ratings yet
Alpha 250s
10 pages
7Th Grade Persuasive Essay
100% (2)
7Th Grade Persuasive Essay
6 pages
Alufusion Eng Trocal
No ratings yet
Alufusion Eng Trocal
226 pages
Rotation Is The Movement of The Earth in Its Own Axis
No ratings yet
Rotation Is The Movement of The Earth in Its Own Axis
5 pages
Environmental Archaeology
No ratings yet
Environmental Archaeology
5 pages
Microbes As Bio Fertilizer: Name:-Deep Gaikwad STD:-XII (Science) Subject:-Biology
No ratings yet
Microbes As Bio Fertilizer: Name:-Deep Gaikwad STD:-XII (Science) Subject:-Biology
12 pages
Journal Homepage: - : Introduction
No ratings yet
Journal Homepage: - : Introduction
7 pages
Local Vegetable Food Kaikai Recipe in Solomon-Islands
No ratings yet
Local Vegetable Food Kaikai Recipe in Solomon-Islands
69 pages
The Simple Past Grammar 1
No ratings yet
The Simple Past Grammar 1
4 pages
Iot2050 Operating Instructions en en-US
No ratings yet
Iot2050 Operating Instructions en en-US
151 pages
Waste Management - Jasleen Arora - IIFT Delhi
No ratings yet
Waste Management - Jasleen Arora - IIFT Delhi
10 pages
Acidification of Urine
100% (1)
Acidification of Urine
30 pages
Forecasting Energy Consumption Using Hybrid CNN and LSTM Auto-Encoder Network With Hyperband Optimization
No ratings yet
Forecasting Energy Consumption Using Hybrid CNN and LSTM Auto-Encoder Network With Hyperband Optimization
17 pages
rbc ppt
No ratings yet
rbc ppt
28 pages
2.2 Partial Derivatives
No ratings yet
2.2 Partial Derivatives
12 pages
Naukri RaghunathPotu (9y 0m)
No ratings yet
Naukri RaghunathPotu (9y 0m)
3 pages
Genbio Exam
No ratings yet
Genbio Exam
13 pages

Machine_Learning

Uploaded by

Machine_Learning

Uploaded by

Machine Learning

 Learning (Training): Learn a model using

 Testing: Test the model using unseen test

 Regression, like classification, is a supervised learning task.

An Internal Node Non-mammal

 Root and internal nodes test rules. Leaf nodes make

 Recursively partition the training data into homogeneous

 Within each group, fit a simple supervised learner (e.g., predict

 KNN Classifier is a non-parametric and instance-based learning algorithm.

● Arrange these distances in increasing order.

Training Choose k of the

 These are called unsupervised learning because unlike supervised

 It is often easier to get unlabeled data.

 Takes place in real-time such that all the input data is to be

 K is a positive integer number.

 The grouping is done by minimizing the sum of squares of

Given k, the k-means algorithm is implemented in four steps:

cluster center 4 Update 4

 The high dimensionality of a dataset (large number of features)

 Common techniques for feature selection include filter methods

You might also like