0% found this document useful (0 votes)

0 views28 pages

Lecture 03

The document outlines a course on Machine Learning, covering topics such as supervised and unsupervised learning, regression techniques, clustering algorithms, and reinforcement learning. It provides detailed explanations of various algorithms, including Linear Regression, K-Means, and Hierarchical Clustering, along with their applications and differences. The document serves as a comprehensive guide for understanding key concepts and methodologies in the field of Machine Learning.

Uploaded by

shawon.iitju.48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views28 pages

Lecture 03

Uploaded by

shawon.iitju.48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Machine Learning

ICT-4261

By-
Dr. Jesmin Akhter
Professor
Institute of Information Technology
Jahangirnagar University
Contents
The course will mainly cover the following topics:
 A Gentle Introduction to Machine Learning
 Linear Regression
 Logistic Regression
 Naive Bayes
 Support Vector Machines
 Decision Trees and Ensemble Learning
 Clustering Fundamentals
 Hierarchical Clustering
 Neural Networks and Deep Learning
 Unsupervised Learning……
Outline

 A Gentle Introduction to Machine Learning

– Unsupervised learning
– Reinforcement learning
 Regression
Types of Unsupervised Learning
 Unsupervised learning can be broken down into three main tasks:
– Clustering
– Association rules
– Dimensionality reduction.
Types of Unsupervised Learning
Clustering Algorithms
 Clustering algorithms only interpret the input data and find natural groups or clusters in feature space

 Clustering is a popular type of unsupervised learning approach.You can even break it down further into different
types of clustering; for example:
– Exlcusive clustering: or “hard” clustering
• It is the kind of grouping in which one piece of data can belong only to one cluster.
– Overlapping clustering:
• A soft cluster in which a single data point may belong to multiple clusters with varying degrees of
membership.
Types of Unsupervised Learning
 Clustering Algorithms
– Hierarchical Clustering:
• Hierarchical clustering develops a hierarchy of clusters
by merging or splitting them depending on their
similarity. Here, two close cluster are going to be in the
same cluster.
• In case you start with all data items attached to the same
cluster and then perform splits until each data item is set
as a separate cluster, the approach will be called top-
down or divisive hierarchical clustering.
• Two clusters that are closest to one another are then
merged into a single cluster. The merging goes on
iteratively till there's only one cluster left at the top. Such
an approach is known as bottom-up or agglomerative.
• The example shows how seven different clusters (data
points) are merged step by step based on distance until
they all create one large cluster.
Types of Unsupervised Learning
 Clustering Algorithms
– K-Means Clustering
• In K-means clustering, data is grouped in terms of
characteristics and similarities.
• K is a letter that represents the number of clusters. For
example, if K=5, then the number of desired clusters is 5. If
K=10, then the number of desired clusters is 10.
• It puts the data points into the predefined number of
clusters K.
• Each data item then gets assigned to the nearest cluster
center, called centroids (black dots in the picture). The latter
act as data accumulation areas.
• The procedure of clustering may be repeated several times
until the clusters are well-defined.
Types of Unsupervised Learning
 Clustering Algorithms
– Fuzzy K-means
• It is an extension of the K-means algorithm used to perform overlapping clustering. Unlike the K-means
algorithm, fuzzy K-means implies that data points can belong to more than one cluster with a certain level of
closeness towards each.
• The closeness is measured by the distance from a data point to the centroid of the cluster. So, sometimes
there may be an overlap between different clusters.

–
Types of Unsupervised Learning
 Clustering Algorithms
– Gaussian Mixture Models (GMMs) is an algorithm used in
probabilistic clustering.
• Models assume that there is a certain number of Gaussian
distributions, each representing a separate cluster.
• The algorithm is basically utilized to decide which cluster a
particular data point belongs to.

– DBSCAN (Density-Based Spatial Clustering of

Applications with Noise): DBSCAN groups data points
based on their density, identifying clusters of high-density
regions and classifying outliers as noise.
Types of Unsupervised Learning

 Association Rule Mining

– Association rule mining focuses on discovering interesting
relationships or patterns between variables in the large database. It
determines the set of items that occurs together in the dataset.
Association rule makes marketing strategy more effective. Such as
people who buy X (suppose a bread) also tend purchase Y
(Butter/Jam) item. The widely used algorithm for association rule
mining is the Apriori algorithm.

• A real-life example of this is market basket analysis, where

retailers analyze customer purchase data to identify
relationships between products frequently bought together. For
instance, this analysis might reveal that customers who purchase
diapers also tend to buy baby wipes
Types of Unsupervised Learning
 Dimensionality Reduction Algorithms Dimensionality reduction techniques are used to
reduce the number of input variables or features while retaining meaningful information. Some
popular dimensionality reduction algorithms include:

– Principal Component Analysis (PCA): PCA transforms the original features into a
lower-dimensional space while preserving the maximum amount of information.
– The PCA method is particularly useful when the variables within the data set are highly
correlated. Correlation indicates that there is redundancy in the data. Due to this
redundancy, PCA can be used to reduce the original variables into a smaller number of new
variables ( principal components) explaining most of the variance in the original variables.
What is the difference between supervised and
unsupervised learning?

 Supervised learning requires labeled data with input features and corresponding output labels, while
unsupervised learning aims to discover patterns or structures in unlabeled data without predefined
output labels.
Reinforcement machine learning
 Reinforcement Learning(RL) enables an agent to learn in an interactive environment by trial and error
using feedback from its own actions and experiences.
 Here, agents are self-trained on reward and punishment mechanisms.
 It can take actions and interact with it.
 Reinforcement machine learning algorithm isn’t trained using sample data.
 A sequence of successful outcomes will be reinforced to develop the best recommendation or policy
for a given problem.

Basic Diagram of Reinforcement Learning

Reinforcement machine learning
 Through a series of Trial and Error methods, an agent keeps
learning continuously in an interactive environment from its
own actions and experiences.
 The only goal of it is to find a suitable action model which
would increase the total cumulative reward of the agent.
 It learns via interaction and feedback.
 You can see a dog and a master. Let’s imagine you are
training your dog to get the stick. Each time the dog gets a
stick successfully, you offered him a feast (a bone ).
 Eventually, the dog understands the pattern, that whenever
the master throws a stick, it should get it as early as it can
to gain a reward (a bone) from a master in a lesser time.

Reinforcement Learning Example

Important Terms in Reinforcement Learning
 Agent: Agent is the model that is being trained via reinforcement learning. It is the sole decision-maker
and learner
 Environment: a physical world where an agent learns and decides the actions to be performed
 Action: All possible steps that can be taken by the model/agent
 State: The current position/ condition returned by the model/the current situation of the agent in the
environment
 Reward: To help the model move in the right direction, it is rewarded/points are given to it to appraise
some action. It’s usually a scalar value and nothing but feedback from the environment
 Policy: Policy determines how an agent will behave at any time. It acts as a mapping between Action and
present State. The agent prepares strategy(decision-making) to map situations to actions.
 Value — Future reward that an agent would receive by taking an action in a particular state
Outline

 Linear Regression
– Linear models
– A bidimensional example
– Hypothesis function for Linear Regression
– Cost function
Regression
 Linear regression is a type of supervised machine learning algorithm that computes the linear
relationship between a dependent variable and one or more independent features.
 When the number of the independent feature, is 1 then it is known as Univariate Linear
regression, and in the case of more than one feature, it is known as multivariate linear
regression.
 It predicts the continuous output variables based on the independent input variable like the
prediction of house prices based on different parameters like house age, distance from the main
road, location, area, size etc.
Linear Regression

 The goal of the algorithm is to find the best linear equation that can predict the value of the
dependent variable based on the independent variables.
 The equation provides a straight line that represents the relationship between the dependent and
independent variables.
 The slope of the line indicates how much the dependent variable changes for a unit change in the
independent variable(s).
 Linear regression is used in many different fields, including finance, economics, and psychology, to
understand and predict the behavior of a particular variable.
– For example, in finance, linear regression might be used to understand the relationship between
a company’s stock price and its earnings or to predict the future value of a currency based
on its past performance.
Linear Regression

 Y is a dependent or target variable and X is an independent variable also known as the predictor
of Y.
 There are many types of functions or modules that can be used for regression.
– A linear function is the simplest type of function.
 X may be a single feature or multiple features representing the problem.
 Linear regression performs the task to predict a dependent variable value (y) based on a given
independent variable (x)). Hence, the name is Linear Regression.
 In the figure, X (input) is the work experience and Y (output) is the salary of a person.
Hypothesis function for Linear Regression
 salary of a person
– m = number of training examples
– x = input variables / features
– y = output variable "target" variables
• (x,y) - single training example
• (xi, yj) - specific example (ith training example)
 We have assumed that our independent feature is the experience 𝑥 and the respective salary y is
the dependent variable.
x(Work experience in years ) y(salary)
2 3000
3 4000
4 5000
5 6000
10 11000
12 13000
Hypothesis function for Linear Regression
 Let’s assume there is a linear relationship between X and Y then the salary can be predicted using:

ℎ𝜃 (𝑥) = 𝜃0 + 𝜃1 𝑥

 The model gets the best regression fit line by finding the best θ0 and θ1 values.
 θ0: intercept
 θ1: coefficient of x or gradient
 Chosen these parameters so hθ(x) is close to y for our training examples
 Once we find the best θ0 and θ1 values, we get the best-fit line. So when we are finally using our
model for prediction, it will predict the value of ℎ𝜃 (𝑥) for the input value of x.
– Different values give you different functions
• If θ0 is 1.5 and θ1 is 0 then we get straight line parallel with X along 1.5
• If θ1 is > 0 then we get a positive slope
– Think of hθ(x) as a "y imitator" - it tries to convert the x into y, and considering we already
have y we can evaluate how well hθ(x) does this
Linear models
 We can learn with a larger number of features
– So may have other parameters which contribute towards a price of houses
– Size
– Age
– Number bedrooms
– Number floors
– x1, x2, x3, x4
– With multiple features becomes hard to plot
• Can't really plot in more than 3 dimensions
• Notation becomes more complicated too
– Best way to get around with this is the notation of linear algebra
– Gives notation and set of things you can do with matrices and vectors
 Now we have multiple features. A linear model is based on the assumption that it's possible to
approximate the output values through a regression process based on the rule. Hypothesis can be written
• ℎ𝜃 (𝑥) = 𝜃0 + 𝜃1 𝑥1 + 𝜃2 𝑥2 +𝜃3 𝑥3 + 𝜃4 𝑥4 = 𝜃0 + 𝑚 𝑖=1 𝜃𝑖 𝑥𝑖
Implementation: cost function(Cont.)
 Hypothesis - is like your prediction machine, throw in an x value,
get a putative y value
 The cost function or the loss function is nothing but the error or
difference between the predicted value ℎ𝜃 (𝑥 ) and the true value Y.
It is the Mean Squared Error (MSE) between the predicted
value and the true value.
 Cost function - is a way to, use your training data, determine
values for your θ values which make the hypothesis as accurate as
possible
• This cost function is reasonable choice for most
regression functions
• Probably most commonly used function
Cost function - a deeper look
 Lets consider some intuition about the cost function and why we
want to use it
– The cost function determines parameters
– The value associated with the parameters determines how your
hypothesis behaves, with different values
– To achieve the best-fit regression line, the model aims to predict
the target value. So the Cost function updates the θ0 and
θ1 values, to reach the best value that minimizes the error
between the predicted ℎ𝜃 𝑥 value and the true y value.
 Simplified hypothesis
– Assumes θ0 = 0
 Cost function and goal here are very similar to when we have θ0,
but with a simpler parameter
– Simplified hypothesis makes visualizing cost function J(θ1) a bit
easier
 So hypothesis pass through 0,0
Cost function - a deeper look (Cont.)
• Two key functions we want to – Plot
understand • θ1 vs J(θ1)
• Data
– hθ(x)
– 1)
• Hypothesis is a function of x - function » θ1 = 1
of what the size of the house is
» J(θ1) = 0
– J(θ1) – 2)
• Is a function of the parameter of θ1 » θ1 = 0.5
– So for example » J(θ1) = ~0.58
– 3)
– θ1 = 1 » θ1 = 0
– J(θ1) = 0 » J(θ1) = ~2.3
Cost function - a deeper look (Cont.)
 If we compute a range of values plot
– J(θ1) vs θ1 we get a polynomial (looks like a quadratic)

 The optimization objective for the learning algorithm is find the

value of θ1 which minimizes J(θ1)
– So, here θ1 = 1 is the best value for θ1
Thank You

ML Ch-3 Unsupervised Learning
100% (1)
ML Ch-3 Unsupervised Learning
31 pages
ML - Unit - 1
No ratings yet
ML - Unit - 1
47 pages
Machine Learning Approachs (AI)
100% (1)
Machine Learning Approachs (AI)
11 pages
MXQ Pro TV Box Android Instruction
No ratings yet
MXQ Pro TV Box Android Instruction
12 pages
Unit 3 ML
No ratings yet
Unit 3 ML
119 pages
Unit 5
No ratings yet
Unit 5
77 pages
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
100% (1)
Introduction To Basics of Machine Learning Algorithms: Pankaj Oli
13 pages
Machine Learning Lab Viva
100% (1)
Machine Learning Lab Viva
9 pages
Chapter Two-FFnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
No ratings yet
Chapter Two-FFnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn
62 pages
Unit 2 Unsupervised Learning
No ratings yet
Unit 2 Unsupervised Learning
86 pages
4 U1 Aiml Part4
No ratings yet
4 U1 Aiml Part4
38 pages
Module IV - Machine Learning
No ratings yet
Module IV - Machine Learning
53 pages
Wa0007
No ratings yet
Wa0007
48 pages
Module 1
No ratings yet
Module 1
122 pages
23ECE205 FoDS 13 Introduction To ML
No ratings yet
23ECE205 FoDS 13 Introduction To ML
41 pages
Presentation 3
No ratings yet
Presentation 3
43 pages
AI Chapter 5
No ratings yet
AI Chapter 5
42 pages
ML Theory
No ratings yet
ML Theory
54 pages
FDS Assignment
No ratings yet
FDS Assignment
76 pages
Module 1 PPT
No ratings yet
Module 1 PPT
122 pages
FAM Unit5
No ratings yet
FAM Unit5
47 pages
FML AAT (Techtalk)
No ratings yet
FML AAT (Techtalk)
25 pages
ML Lecture 2 3 Types
No ratings yet
ML Lecture 2 3 Types
27 pages
AIML
No ratings yet
AIML
26 pages
ML UT 1 Merged
No ratings yet
ML UT 1 Merged
31 pages
IMTC634 - Data Science - Chapter 6
No ratings yet
IMTC634 - Data Science - Chapter 6
22 pages
Supervised Unsupervised Reinforcement
No ratings yet
Supervised Unsupervised Reinforcement
39 pages
Unit 3-Introduction To Machine Learning
No ratings yet
Unit 3-Introduction To Machine Learning
44 pages
Core Concepts of Supervised, Unsupervised, and Reinforcement Learning
No ratings yet
Core Concepts of Supervised, Unsupervised, and Reinforcement Learning
3 pages
Unit 1
No ratings yet
Unit 1
52 pages
AI Using Python
No ratings yet
AI Using Python
26 pages
Module XM
No ratings yet
Module XM
29 pages
Group 2 ML Asignmet
No ratings yet
Group 2 ML Asignmet
23 pages
Intro To Machine Learning 1
No ratings yet
Intro To Machine Learning 1
14 pages
Session 3 Types of Machine Learning
No ratings yet
Session 3 Types of Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
LKSK ML typesToStudents
No ratings yet
LKSK ML typesToStudents
18 pages
Deep Learning
No ratings yet
Deep Learning
9 pages
ML Unit-1 Notes
No ratings yet
ML Unit-1 Notes
13 pages
Machine Learning For Beginners
100% (1)
Machine Learning For Beginners
30 pages
MLT Unit 1
No ratings yet
MLT Unit 1
15 pages
Types of Machine Learning
No ratings yet
Types of Machine Learning
14 pages
Machine Learning and AI
No ratings yet
Machine Learning and AI
13 pages
DS&ML 1
No ratings yet
DS&ML 1
9 pages
AIML - Practical No.01
No ratings yet
AIML - Practical No.01
9 pages
2 - Types of Machine Learning
No ratings yet
2 - Types of Machine Learning
26 pages
Types of Machine Learning
No ratings yet
Types of Machine Learning
5 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
25 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Unit 1
No ratings yet
Unit 1
8 pages
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
No ratings yet
Machine Learning For Beginners Overview of Algorithm TypesStart Learning Machine Learning From Here
13 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
AI Lab6
No ratings yet
AI Lab6
7 pages
Machine Learning Is The Branch of
No ratings yet
Machine Learning Is The Branch of
12 pages
Three Techniques of Machine Learning
No ratings yet
Three Techniques of Machine Learning
2 pages
Meta Motion Fitness Tracker 241109 213742 (1) Removed
No ratings yet
Meta Motion Fitness Tracker 241109 213742 (1) Removed
20 pages
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
No ratings yet
Colloquium Evaluation: Faculty of Computer Science and Engineering To:Kanika Gupta Ma'Am Bhavya Sethi 16csu082
12 pages
Unit - 2 Notes DSF
No ratings yet
Unit - 2 Notes DSF
30 pages
Introduction To Machine Learing
No ratings yet
Introduction To Machine Learing
4 pages
Summer Training-Diesel Loco Shed 2014, Tughlakabad
80% (10)
Summer Training-Diesel Loco Shed 2014, Tughlakabad
53 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
CKT Lab Manual
No ratings yet
CKT Lab Manual
67 pages
To Study Various Factors On Which The Internal Resistance Emf of A Cell Depends
No ratings yet
To Study Various Factors On Which The Internal Resistance Emf of A Cell Depends
14 pages
Les Set Analysis - ENG
100% (1)
Les Set Analysis - ENG
20 pages
OFAN FTTH Presentation
No ratings yet
OFAN FTTH Presentation
101 pages
Conceptual Meaning
0% (1)
Conceptual Meaning
2 pages
ESC0102P - Profram For Problem Solving Lab File
No ratings yet
ESC0102P - Profram For Problem Solving Lab File
42 pages
Deliverable - Farmer Wolf Goat Cabbage
No ratings yet
Deliverable - Farmer Wolf Goat Cabbage
19 pages
Walter W. Differential and Integral Inequalities
No ratings yet
Walter W. Differential and Integral Inequalities
363 pages
ML Unit 4 Part A Material
No ratings yet
ML Unit 4 Part A Material
15 pages
Also Electric Curcuit Workbook With Solutions
No ratings yet
Also Electric Curcuit Workbook With Solutions
27 pages
Unit 9 Notes
No ratings yet
Unit 9 Notes
17 pages
Design of Steel Towers: 1. Abstract
No ratings yet
Design of Steel Towers: 1. Abstract
3 pages
MM - Brazil - Fiscal - Book - Mastersaf 2 1
No ratings yet
MM - Brazil - Fiscal - Book - Mastersaf 2 1
30 pages
121CS0184 ComputerNetworks
No ratings yet
121CS0184 ComputerNetworks
10 pages
Comparative Analysis of Online Review Platforms: Implication in Electronic Service Quality For Video Players and Editor Apps (Youtube and Tiktok)
No ratings yet
Comparative Analysis of Online Review Platforms: Implication in Electronic Service Quality For Video Players and Editor Apps (Youtube and Tiktok)
28 pages
Activity 4
No ratings yet
Activity 4
5 pages
Unbalanced Faults PDF
No ratings yet
Unbalanced Faults PDF
36 pages
An Introduction To Direct-Sequence Spread-Spectrum Communications
No ratings yet
An Introduction To Direct-Sequence Spread-Spectrum Communications
11 pages
Operation Guide 2888: About This Manual
No ratings yet
Operation Guide 2888: About This Manual
5 pages
Presentation On: Optical Fiber
No ratings yet
Presentation On: Optical Fiber
23 pages
Design of Rounded Dovetail Joints
No ratings yet
Design of Rounded Dovetail Joints
6 pages
Grub Command Android x86
No ratings yet
Grub Command Android x86
5 pages
Prasoon: Career Summary
No ratings yet
Prasoon: Career Summary
12 pages
DBMS Information Sheet
No ratings yet
DBMS Information Sheet
3 pages
Tahmid's Blog - Zero Crossing Detection With PIC16F877A
No ratings yet
Tahmid's Blog - Zero Crossing Detection With PIC16F877A
4 pages
11.6 Absolute Convergence and The Ratio and Root Tests
No ratings yet
11.6 Absolute Convergence and The Ratio and Root Tests
3 pages
ME 501 Full Marks: 30 Time: 1 HR.: 2 2xdas
No ratings yet
ME 501 Full Marks: 30 Time: 1 HR.: 2 2xdas
1 page
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet

Lecture 03

Uploaded by

Lecture 03

Uploaded by

Machine Learning

 A Gentle Introduction to Machine Learning

– DBSCAN (Density-Based Spatial Clustering of

 Association Rule Mining

• A real-life example of this is market basket analysis, where

Basic Diagram of Reinforcement Learning

Reinforcement Learning Example

 The optimization objective for the learning algorithm is find the

You might also like