Module I-Part 1

Uploaded by

Nitesh Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views48 pages

Module I-Part 1

Uploaded by

Nitesh Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 48

Advanced Machine Learning Code: 18AI72

Module I - Advanced 6
Machine Learning

Dr. Varalatchoumy M
Prof.&Head – Dept. of AIML
Head –CHOSS,
Cambridge Institute of Technology, Bangalore
6.1 | OVERVIEW
Machine learning algorithms are a subset of artificial intelligence (AI) that imitates
the human learning process.
Humans learn through multiple experiences how to perform a task.
Similarly, machine learning algorithms develop multiple models (usually using
multiple datasets) and each model is analogous to an experience
Mitchell (2006) defined machine learning as follows:
Machine learns with respect to a particular task T, performance metric P
following experience E, if the system reliably improves its performance P at task
T following experience E.
• Let the task T be a classification problem
• Performance P can be measured through several metrics such as overall accuracy,
sensitivity, specificity, and area under the receive operating characteristic curve
(AUC)
• Experience E is analogous to different classifiers generated in machine learning
algorithms
The major difference between statistical learning and machine learning is that
statistical learning depends heavily on validation of model assumptions and hypothesis testing,
objective of machine learning is to improve prediction accuracy.

For example, while developing a regression model, we check for assumptions such as normality of residuals,
significance of regression parameters and so on. However, in the case of the random forest using classification
trees, the most important objective is the accuracy/performance of the model.

Two ML algorithms:
1. Supervised Learning: In supervised learning, the datasets have the values of input
variables (feature values) and the corresponding outcome variable. The algorithms learn
from the training dataset and predict the outcome variable for a new record with values
of input variables. Linear regression and logistic regression are examples of supervised
learning algorithms.
2. Unsupervised Learning: In this case, the datasets will have only input variable
values, but not the output. The algorithm learns the structure in the inputs. Clustering
and factor analysis are examples of unsupervised learning and will be discussed in
Chapter 7.
6.1.1 | How Machines Learn?
In supervised learning, the algorithm learns using a function called loss function, cost
function or error function, which is a function of predicted output and the desired output. If
h(Xi) is the predicted output and yi is the desired output, then the loss function is

n is the total number of records for which the predictions are made.
The function defined above is a sum of squared error (SSE).
SSE is the loss function for a regression model.
The objective is to learn the values of parameters (aka feature weights) that minimize the
cost function.
Machine learning uses optimization algorithms which can be used for minimizing the loss
function.
Most widely used optimization technique is called the Gradient Descent.
In the next section, we will discuss a regression problem and understand how gradient descent algorithm minimizes the
loss function and learn the model parameters.

6.2 | GRADIENT DESCENT ALGORITHM

In this section, we will discuss how gradient descent (GD) algorithm can be used for estimating the values of
regression parameters, given a dataset with inputs and outputs
The error is given by,
6.2.1 | Developing a Gradient Descent Algorithm for Linear Regression Model

• For better understanding the GD algorithm, we will implement the GD algorithm using the dataset
Advertising.csv.
• The dataset contains the examples of advertisement spends across multiple channels such as Radio,
TV, and Newspaper, and the corresponding sales revenue generated at different time periods.

The dataset has the following elements:

1. TV – Spend on TV advertisements
2. Radio – Spend on radio advertisements
3. Newspaper – Spend on newspaper advertisements
4. Sales – Sales revenue generated

For predicting future sales using spends on different advertisement channels, we can build a regression
model.
6.2.1.1 Loading the Dataset
6.2.1.2 Set X and Y Variables
For building a regression model, the inputs TV, Radio, and Newspaper are taken as X
features and Sales Y is taken as the outcome variable.
6.2.1.3 Standardize X and Y
It is important to convert all variables into one scale. This can be done by subtracting mean from each
value of the variable and dividing by the corresponding standard deviation of the variable.
def initialize( dim ):
np.random.seed(seed=42)
random.seed(42)
#Initialize the bias.
b = random.random()
#Initialize the weights.
w = np.random.rand( dim )
return b, w
#dim - is the number of weights to be
initialized besides the bias
To initialize the bias and 3 weights, as we have three input variables TV, Radio and
Newspaper, we can invoke the initialize() method as follows:
b, w = initialize( 3 )
print( "Bias: ", b, "Weights: ", w )
Method 2: Predict Y Values from the Bias and Weights
Calculate the Y values for all the inputs, given the bias and weights. We will use matrix multiplication of weights with
input variable values. matmul() method in numpy library can be used for matrix multiplication. Each row of X can be
multiplied with the weights column to produce the predicted outcome variable.
# Inputs:
# b - bias
# w - weights
# X - the input matrix
6.2.1.5 Finding the Optimal Bias and Weights
The updates to the bias and weights need to be done iteratively, until the cost is minimum. It can take several
iterations and is time-consuming. There are two approaches to stop the iterations:
1. Run a fixed number of iterations and use the bias and weights as optimal values at the end these
iterations.
2. Run iterations until the change in cost is small, that is, less than a predefined value (e.g., 0.001).

We will define a method run_gradient_descent(), which takes alpha and num_iterations as parameters
and invokes methods like initialize(), predict_Y(), get_cost(), and update_beta().

Also, inside the method,

1. variable gd_iterations_df keeps track of the cost every 10 iterations.
2. default value of 0.01 for the learning parameter and 100 for number of iterations will be used.
6.3.1 | Steps for Building Machine Learning Models

The steps to be followed for building, validating a machine learning model and
measuring its accuracy
are as follows:
1. Identify the features and outcome variable in the dataset.
2. Split the dataset into training and test sets.
3. Build the model using training set.
4. Predict outcome variable using a test set.
5. Compare the predicted and actual values of the outcome variable in the test set and
measure accuracy using measures such as mean absolute percentage error (MAPE) or
root mean square error (RMSE).
6.3.1.2 Building Linear Regression Model with Train Dataset
Linear models are included in sklearn.linear_model module. We will use
LinearRegression method for building the model and compare with the results we
obtained through our own implementation of gradient descent algorithm.
https://scikit-
learn.org/stable/modules/linear_model.html#:~:text=The%20f
ollowing%20are%20a%20set,if%20is%20the%20predicted%20v
alue.
6.3.1.2 Building Linear Regression Model with Train Dataset
6.3.1.4 Measuring Accuracy
Root Mean Square Error (RMSE) and R-squared are two key accuracy measures
for Linear Regression Models.
sklearn.metrics package provides methods to measure various metrics.
For regression models, mean_squared_error and r2_score can be used to calculate
MSE and R-squared values, respectively.

## Importing metrics from sklearn

from sklearn import metrics
6.3.2 | Bias-Variance Trade-off

Model errors can be decomposed into two components: bias and variance.
Understanding these two components is key to diagnosing model accuracies
and avoiding model overfitting or underfitting.
High bias can lead to building underfitting model, whereas high variance
can lead to overfitting models.

The term "variance" refers to the degree of change that may be expected in the
estimation of the target function as a result of using multiple sets of training data. The
disparity between the values that were predicted and the values that were actually
observed is referred to as bias

Chapter 6 - Advanced Machine Learning PDF
No ratings yet
Chapter 6 - Advanced Machine Learning PDF
37 pages
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
No ratings yet
Unit 02 - Nonlinear Classification, Linear Regression, Collaborative Filtering - MD
14 pages
Machine Learning Cheat Sheet ??? - ?
No ratings yet
Machine Learning Cheat Sheet ??? - ?
231 pages
Machine Learning Cheat Sheet
100% (1)
Machine Learning Cheat Sheet
211 pages
Mapping Stakeholder Engagement Updated 42718
100% (1)
Mapping Stakeholder Engagement Updated 42718
13 pages
Module I Complete Notes
No ratings yet
Module I Complete Notes
136 pages
Unit-III Advanced Machine Learning
No ratings yet
Unit-III Advanced Machine Learning
8 pages
Advanced Machine Learning: Module-1
No ratings yet
Advanced Machine Learning: Module-1
164 pages
ML 21-22 Sem
No ratings yet
ML 21-22 Sem
10 pages
Module3_Ch1
No ratings yet
Module3_Ch1
83 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Machine Learning Theory and Applications - 2024 - Vasques - Machine Learning Alg (1)
No ratings yet
Machine Learning Theory and Applications - 2024 - Vasques - Machine Learning Alg (1)
98 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
3.Linear Regression
No ratings yet
3.Linear Regression
18 pages
Take It Easy: Created Status Last Read
No ratings yet
Take It Easy: Created Status Last Read
55 pages
Lecture 3 - Linear Regression
No ratings yet
Lecture 3 - Linear Regression
31 pages
ML Primer PDF
No ratings yet
ML Primer PDF
122 pages
Brief Summary ML
No ratings yet
Brief Summary ML
25 pages
Week11_regularization and optimization
No ratings yet
Week11_regularization and optimization
75 pages
ML 20 04 23
No ratings yet
ML 20 04 23
19 pages
Lecture 2
No ratings yet
Lecture 2
66 pages
Linear Regression
No ratings yet
Linear Regression
37 pages
Module 3
No ratings yet
Module 3
27 pages
Neural Networks Cheat Sheet - 2020 PDF
No ratings yet
Neural Networks Cheat Sheet - 2020 PDF
14 pages
Basic Machine Learning: Case Study
No ratings yet
Basic Machine Learning: Case Study
11 pages
Lecture - 4 - Logistic Regression
No ratings yet
Lecture - 4 - Logistic Regression
62 pages
Aiml Unit 3
No ratings yet
Aiml Unit 3
9 pages
Linear Regression
No ratings yet
Linear Regression
61 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Forecasting and Learning Theory
No ratings yet
Forecasting and Learning Theory
46 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Gradient descent
No ratings yet
Gradient descent
16 pages
BITS F464 ML Lecture Notes
No ratings yet
BITS F464 ML Lecture Notes
86 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Chapter 6 Supervised Learning
No ratings yet
Chapter 6 Supervised Learning
6 pages
ML Notes
No ratings yet
ML Notes
14 pages
7 محاضرات
No ratings yet
7 محاضرات
36 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
12 pages
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
No ratings yet
Machine Learning Using Matlab: Lecture 8 Advice On ML Application
30 pages
ML Cheatsheet
100% (1)
ML Cheatsheet
219 pages
vertopal.com_22644501_lab02 (4)
No ratings yet
vertopal.com_22644501_lab02 (4)
14 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
ML-2
No ratings yet
ML-2
155 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Predictive Maintenance
No ratings yet
Predictive Maintenance
66 pages
A Layman's Guide to the Project
No ratings yet
A Layman's Guide to the Project
34 pages
Machine Learning-2
No ratings yet
Machine Learning-2
87 pages
Unit 2
No ratings yet
Unit 2
35 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
25 pages
ML Cheatsheet PDF
100% (1)
ML Cheatsheet PDF
211 pages
Experiment N1
No ratings yet
Experiment N1
7 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
Machine Learning - SoS 2017
No ratings yet
Machine Learning - SoS 2017
15 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: SUPPORT VECTOR MACHINE, LOGISTIC REGRESSION, DISCRIMINANT ANALYSIS and DECISION TREES: Examples with MATLAB
César Pérez López
No ratings yet
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
11-Memory management-TLB PDF
No ratings yet
11-Memory management-TLB PDF
10 pages
Solving Quadratic Equations
No ratings yet
Solving Quadratic Equations
3 pages
C I C Services Explained
No ratings yet
C I C Services Explained
3 pages
WinAC Target DOKU V12 en
No ratings yet
WinAC Target DOKU V12 en
91 pages
Form 31640929
No ratings yet
Form 31640929
5 pages
IB-01 User Manual
No ratings yet
IB-01 User Manual
35 pages
Linear Algebra Final Exam: 1:00-3:00, Sunday, June 2 Bradley 102
No ratings yet
Linear Algebra Final Exam: 1:00-3:00, Sunday, June 2 Bradley 102
6 pages
Project 2nd 1
No ratings yet
Project 2nd 1
11 pages
OS Lecture-14 (File Systems)
No ratings yet
OS Lecture-14 (File Systems)
70 pages
SS Macro
No ratings yet
SS Macro
87 pages
Sequence Taken From The Movie "D - Wars" (Dragon Wars) : Bam 403 Computer Lab On Compositing - 1
No ratings yet
Sequence Taken From The Movie "D - Wars" (Dragon Wars) : Bam 403 Computer Lab On Compositing - 1
3 pages
Nptel: Parallel Computing - Video Course
No ratings yet
Nptel: Parallel Computing - Video Course
3 pages
Building Intelligent Web Theory Practice 518nG6RnbzL
No ratings yet
Building Intelligent Web Theory Practice 518nG6RnbzL
3 pages
Sorting Algorithms: Bubble, Insertion, Selection, Quick, Merge, Bucket, Radix, Heap
No ratings yet
Sorting Algorithms: Bubble, Insertion, Selection, Quick, Merge, Bucket, Radix, Heap
24 pages
Cisco ASR 9000 Series Aggregation Services Router L2VPN and Ethernet Services Configuration Guide
No ratings yet
Cisco ASR 9000 Series Aggregation Services Router L2VPN and Ethernet Services Configuration Guide
384 pages
Notice of Vacancies (Calasiao) Up To June 04
No ratings yet
Notice of Vacancies (Calasiao) Up To June 04
1 page
HP Notebook Pcs - Bios Setup Information and Menu Options: Caution
No ratings yet
HP Notebook Pcs - Bios Setup Information and Menu Options: Caution
8 pages
Keywords: Open Ended, Creative Thinking, SPLDV: Pendahuluan A. Latar Belakang
No ratings yet
Keywords: Open Ended, Creative Thinking, SPLDV: Pendahuluan A. Latar Belakang
13 pages
Welcome To Math 463: Introduction To Mathematical Biology
No ratings yet
Welcome To Math 463: Introduction To Mathematical Biology
36 pages
PPSC_1CSM1_T4_2024_25
No ratings yet
PPSC_1CSM1_T4_2024_25
2 pages
ESi-7569 Cfar Formulas
No ratings yet
ESi-7569 Cfar Formulas
18 pages
Galois Tech Talk: A Scalable Io Manager For GHC
No ratings yet
Galois Tech Talk: A Scalable Io Manager For GHC
22 pages
Simple Android and Java Bluetooth Application
No ratings yet
Simple Android and Java Bluetooth Application
8 pages
Logic BIST State-Of-The-Art and Open Problems
No ratings yet
Logic BIST State-Of-The-Art and Open Problems
7 pages
4.3 Keyboard Map: RAPT User Manual RAPT User Manual
No ratings yet
4.3 Keyboard Map: RAPT User Manual RAPT User Manual
8 pages
Lecture 12-13 Time Domain Analysis of 1st Order Systems
No ratings yet
Lecture 12-13 Time Domain Analysis of 1st Order Systems
56 pages
IT4305: Rapid Software Development: University of Colombo, Sri Lanka
No ratings yet
IT4305: Rapid Software Development: University of Colombo, Sri Lanka
11 pages
FoG Computing
No ratings yet
FoG Computing
5 pages
Lab 21 Manual CSE215 Summer 2024
No ratings yet
Lab 21 Manual CSE215 Summer 2024
3 pages

Module I-Part 1

Uploaded by

Module I-Part 1

Uploaded by

Advanced Machine Learning Code: 18AI72

6.2 | GRADIENT DESCENT ALGORITHM

The dataset has the following elements:

Also, inside the method,

## Importing metrics from sklearn

You might also like