0% found this document useful (0 votes)

25 views12 pages

Advance ML - Unit 1

Uploaded by

bsmn027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views12 pages

Advance ML - Unit 1

Uploaded by

bsmn027

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Self-Learning Material

[email protected]
svim0023

Program: MCA
Specialization: AI/ML
Semester: 3
Course Name: Advanced Machine Learning
Course Code: 21VMT6S305
Unit Name: MACHINE LEARNING – RECAP

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
MACHINE LEARNING – RECAP 3

LINEAR ALGEBRA 3
SOME TERMINOLOGIES AND THEIR DEFINITIONS 3
WHAT IS MACHINE LEARNING AND ITS IMPORTANCE? 4
MACHINE LEARNING LIFECYCLE 5
TYPES OF MACHINE LEARNING ALGORITHMS 6
SUPERVISED LEARNING 6
UNSUPERVISED LEARNING 6S
REINFORCEMENT LEARNING 6
LINEAR REGRESSION 6
ASSUMPTIONS OF LINEAR REGRESSION 8
POLYNOMIAL REGRESSION 8
RIDGE REGRESSION 9
LASSO REGRESSION 9
LOGISTIC REGRESSION 10
GENERALISED LINEAR MODELS (GLM) 10
ASSUMPTIONS OF GLM 11
COMPONENTS OF GLM 11

[email protected]
svim0023

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
MACHINE LEARNING – RECAP
Linear Algebra

Machines understand only numbers and they have to be represented in a way which allows
machines to learn from data in an efficient manner. In Machine Learning, the learning involves
finding parameters of a function that best fit the data.Linear Algebra is the mathematical
foundation that helps solve problem of representing data as well as computations in ML
models. Linear Algebra involves processing of vectors, matrices and tensors.

Using Linear Algebra allows us to represent data as vectors and allow us to perform matrix
operations like dot product

If we want to find similarity between two text documents – each document is represented as
vector and then cosine similarity is calculated which involves computing the dot product of the
two vectors as well as the magnitude of the two vectors.

In algorithms like PCA, Linear Algebra allows us to calculate the Eigen Vectors and use them
to reduce the dimensions of the data by getting I portant components, that are linear
combinations of the original features.

Today, in deep learning space which involves solving problems like Image Classification or
[email protected]
svim0023 Building Natural Language Models like Q&A Answering system or model that detects Toxic
Comments – we use tensors which allow for vectorized operations to learn patterns. Tensors
are nothing but arrays and using Linear Algebra we perform the mathematical operations.

In Recommendation Systems, representing the users and items as embeddings which are dense
vectors allow us to capture the information about the user and item and allow us to recommend
personalized items to users.

Some Terminologies and their Definitions

Artificial Intelligence: AI gives the machine the ability to imitate human behaviours. Work
on AI started way back in 1956, but it was the advent of availability of GPU’s that speeded the
AI boom

Machine Learning: Machine Learning is subset of AI that uses algorithms to process, learn
and make sense or predict the pattern of available data.

Deep Learning: Deep Learning is the subset of Machine Learning, which employs Neural
Networks for training data to achieve decision making. These methods try to mimic the human
brain.This is employed in problems like Image Classification, Natural Language
Understanding, Machine Translation etc.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
What is
[email protected] Machine Learning and Its Importance?
svim0023
Machine Learning is a subset of Artificial Intelligence that helps computer systems learn and
improve from experience by using large amount data to provide actionable insights which
includes predictions or detecting patterns in data. In today’s digital world, every single day
there is 2.5 quintillion bytes of data every single day and this number is set to increase rapidly
with the use of IoT (Internet of Things). Google in a day processes 20 petabytes of data.
With the rapid increase in data both in terms of volume and variety and affordable
computational power and high speed internet, Machine Learning has become important so that
we can make sense of data. These factors, make it possible for building models that can analyse
large and complex data. In Business, machine learning can help reduce costs, mitigate risks
and improve user experiences. In traditional institutions like Banking, Machine Learning plays
a key role in fraud detection, in identifying customers who can default on a loan, In automatic
cheque processing etc. With increase in access to data and computation power – applications
of machine learning can be found across domains and in every facet of human life.

Some applications of Machine Learning are : Recommendation Systems, Google Search,

Facebook Recommending Posts or Followers, Fraud Detection .

The performance of any Machine Learning Model is dependent on two major aspects:
1. Quality of Input Data : The most common saying you will hear in the Machine
Learning world “Garbage In, Garbage Out” – this simply means if your data is messy,
then even the most sophisticated machine learning algorithms will fail.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
2. The choice of Model: Not every problem needs to be solved with a Neural Network
or a ensembled model. It is important to always remember that many a times a simpler
model will yield better results. In some cases like in Banking, it is also important that
the model is interpretable – in such cases the focus is more on interpretability than on
improving the score by 2% or 3% - hence using a complex model may not be
acceptable. While choosing the model, it is important to keep the business goals in
mind.

Machine Learning Lifecycle

There are 5 basic steps involved in any machine learning task.

• Collecting the data: This step involves loading and reading the data from the data
sources. The data sources can include excel files, csv files, data from tables, raw text
files etc.

• Preparing and Understanding the Data: The quality of any machine learning model
is dependent on the quality of data. Ideally, in any Machine Learning project this is the
step where most of the time is spent. This involves identifying and handling outliers,
handling missing data, creating features from data. This allows involves performing
Exploratory Analysis which involves understanding the data and the relationship
[email protected] between variables.
svim0023
• Training a Model: This step involves dividing the data into training and validation
set. The validation set is not used for training but to check how well the model will
perform on unseen data. In training, the train set is used to develop a model. Cross
Validation techniques are used to understand performance of a model while training

• Evaluating a Model: This step uses the validation data to see how well the model
performs on unseen data. Various metrics like accuracy, precision,recall or f-score in
case of classification, RMSE,MSE etc in case of regression are measured and the model
is validated. The metrics that are achieved during training is compared against the
metrics achieved during evaluation on the validation set to check if the model in
underfitting or overfitting.

• Improving the Model: This step may involve either choosing a different model or
creating new features in the data that can help improve the quality of model or in some
cases – can also lead to collect more data.

Machine Learning Lifecyle is not straightforward but instead a cycle iterating between
improving the data, model and evaluation and is never really completed.

Following the cyclical approach is important because it focuses on using the model and its
results to refine your data.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Types of Machine Learning Algorithms

Supervised Learning

This is also known as predictive modelling and is used to predict future outcome based on
historical data. This is a task-driven approach. When the predicted outcome is a continuous
variable it is called Regression and when it is categorical it is called Classification. Some
examples of supervised learning are : Fraud Detection, Predicting which customers are most
likely to churn or Image Classification, Forecasting Sales .

Unsupervised
[email protected] Learning
svim0023
This is a data-driven learning approach. In this method there are no predefined outcome
variable and is used to identify patterns in data. One of the most commonly used method of
unsupervised learning is clustering. Examples of Unsupervised Learning are: customer
segmentation, topic modelling, identifying which products customers are most likely to buy
together
Reinforcement Learning

In this type of learning machines are trained to take specific decisions that maximise the
efficiency. The main idea behind this kind of learning is the machine learns from its
environment continuously and applies the knowledge to the business. In Reinforcement
Learning, the machine learns by interacting with the environment. Self-Driving cars is an
example of Reinforcement Learning

Linear Regression

Linear regression is a statistical model that allows to explain a dependent variable y based on
variation in one or multiple independent variables.

Linear Regression is type of model, where it is assumed that the relationship between the
dependent and independent variable is linear in nature. The equation for Linear Regression is
given as:

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
𝒀 =𝒄+𝒎∗𝑿+𝒆
where, Y is the dependent variable, X is the independent variable, ‘c’ is the intercept, ‘m’ is
the slope and ‘e’ is the Error Term

Linear Regression generated a regression line which is also called the line of Best Fit. To model
linear regression, there can be only one dependent and one independent variable or multiple
independent and one dependent variable.

Plotting the line of best fit between a dependent and independent variable can help understand
the relationship between them – for example a positive slope indicates a positive correlation
and negative slope indicates a negative correlation between the variables.
Also, to understand how Y changes when X changes, we can use the equation:
[email protected]
svim0023 Y=mX+c+e
If X increases by n units, then
X1=X+n
=> Y1=m(X+n)+c+e
=> Y1=mX+mn+c+e
=> Y1=Y+mn

If we have multiple independent variables then the equation can be given as:
𝒀 = 𝜷𝟎 + 𝜷𝟏 𝑿𝟏 + 𝜷𝟐 𝑿𝟐 + ⋯ + 𝜷𝒏 𝑿𝒏 + 𝒆

Where , 𝛽0 is the intercept, 𝛽1, 𝛽2 ,… 𝛽𝑛 are the slope for each independent variable,
𝑋1 , 𝑋2 … , 𝑋𝑛 are the independent variables and ‘e’ is the error term
In case of Multiple Linear Regression, the if 𝜷𝟏 is positive, it indicates that there is a positive
correlation between 𝑿𝟏 and Y.

If we increase X1 by n units then how much does Y change by?

Y1=𝛽0 + 𝛽1 (𝑋1 + 𝑛) + 𝛽2 𝑋2 + ⋯ + 𝛽𝑛 𝑋𝑛 + 𝑒
 Y1=Y+𝜷𝟏 𝒏

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
In a similar way, we can understand how various factors impact the outcome or the independent
variable using Linear Regression. Also, note one of the key assumptions of Multiple Linear
Regression is that there is no correlation between independent variables.
The cost function used for Linear Regression is OLS (Ordinary Least Squares). The cost
function that we use to determine the coefficients is given by

𝑛 𝑝

𝐶𝑜𝑠𝑡 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛 = ∑(𝑦𝑖 − ∑ 𝑥𝑖𝑗 𝛽𝑗 )2

𝑖=1 𝑗=1

In the above equation i represents the ith data observation where n is the total number of samples
in the data. 𝑦𝑖 is the dependent variable for the ith observation. j is the feature number where p
is the total number of features or independent variables. 𝑥𝑖𝑗 is the jth independent variable for
the ith row and 𝛽𝑗 is the coefficient associated with the jth feature

Assumptions of Linear Regression

1. Linearity: The relationship between the dependent and independent variables is linear
in nature
2. Homoscedasticity: The variance of the residual or the error term is constant. That is
the error term does not vary much as the value of the dependent variable changes.
3. Independence: Observations are independent of each other
[email protected]
svim0023 4. Normality: The residuals of Linear Regressions should follow a normal distribution.
5. Multicollinearity: There should be no or very little correlation between the
independent variables. Multicollinearity can be tested using Variance Inflation Factor
(VIF)

Polynomial Regression
When there is a non-linear relationship between dependent and independent variable, we use
polynomial regression analysis. It is like multiple linear regression the difference being instead
of a straight line a curve is fit.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Polynomial Regression

Ridge Regression
Multiple Linear Regression does not work well when there is correlation between independent
variables (there is multicollinearity) – to handle such scenarios we use L2 regularization. For
Ridge Regression, the cost function becomes:

𝑛 𝑝 𝑝
[email protected]
svim0023 𝐶𝑜𝑠𝑡 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛 = ∑(𝑦𝑖 − ∑ 𝑥𝑖𝑗 𝛽𝑗 ) + 𝜆 ∑ 𝛽𝑗2 2

𝑖=1 𝑗=1 𝑗=1

In the cost function when 𝜆 = 0, it becomes the same as OLS cost function. If 𝜆 is large it will
cause underfitting. Ridge Regression is one of the techniques to avoid overfitting.

Lasso Regression
Lasso Regression is also a regularization technique the helps in reducing overfitting. In Lasso
we use L1 Regularization. The cost function for Lasso Regression is:

𝑛 𝑝 𝑝

𝐶𝑜𝑠𝑡 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛 = ∑(𝑦𝑖 − ∑ 𝑥𝑖𝑗 𝛽𝑗 )2 + 𝜆 ∑ |𝛽𝑗 |

𝑖=1 𝑗=1 𝑗=1

The key difference between Lasso and Ridge regression is that Lasso shrinks the less important
feature coefficients to zero (hence helps with feature selection).

Both Ridge and Lasso Regression helps in reducing the complexity of the model especially
when you have large number of features, and hence reduce over-fitting

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
Logistic Regression

This is used when the target variable is categorical in nature. This is used for example to predict
whether the email is spam or ham or whether the tumor is malignant or benign.While Linear
Regression is unbounded, logistic regression predicts value between 0 and 1 which is nothing
but a probability score and is bounded. Generally, logistic regression is used for Binary
Classification, when there is multiple classes, we use Multinomial or Ordered Logistic
Regression.

The cost function used for Logistic Regression is sigmoid. Sigmoid function maps any
real-valued number to between 0 and 1.

[email protected]
svim0023

Cost Function of Logistic Regression is given as :

𝟏
𝑪𝒐𝒔𝒕 𝑭𝒖𝒏𝒄𝒕𝒊𝒐𝒏 =
𝟏+ 𝒆−(𝜷𝟎 +𝜷𝟏 𝑿)

As we can see, the Cost Function of Logistic Regression is the same as sigmoid function
applied on the Linear Regression Cost Function. The Logistic Regression returns a probability
score between 0 and 1. To determine which class the data belongs to we can set a probability
threshold.

For example: If there are two classes : spam and ham (spam=1 and ham is 0) and the
probability threshold is 0.5 and the predicted probability is 0.48, then the data point belongs to
ham . If the predicted probability is greater than the probability threshold then the data point
belongs to spam.

Generalised Linear Models (GLM)

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
In Linear Regression, one of the assumptions is that there is linear relationship between the
dependent variable (y) and independent variable (X) and the error distribution is normal. But,
in real-world there can be other relationships like exponential between y and X. GLM’s allow
us to build a linear relationship between response and predictors even though their underlying
relationship is not linear. This is achieved by using a “link” function. In GLM models, the
residuals are not assumed to follow a normal distribution.

If there is an exponential relationship between X and y as shown below, then we cannot use
Linear regression as Linearity Assumption is invalidated.

Assumptions of GLM

1. Independence: Observations are independent of each other

[email protected]
2. The distribution of the residuals need not be normal, but belong to any of the
svim0023
exponential distributions like binomial, Poisson, multinomial or normal
3. The dependent variable need not have an linear relationship with the independent
variables.
Note: Logistic Regression also belongs to the family of GLM’s.

Components of GLM

1. Linear Predictor
2. Link Function
3. Probability Distribution

If the distribution is a Poisson distribution then GLM equation becomes:

𝐥𝐧 𝝀𝒊 = 𝜷𝟎 + 𝜷𝟏 𝑿
𝒚𝒊 = 𝑷𝒐𝒊𝒔𝒔𝒐𝒏(𝝀𝒊 ) – this is the probability distribution function
where ln is the Link Function, 𝛽0 + 𝛽1 𝑋 𝑖𝑠 𝑡ℎ𝑒 𝐿𝑖𝑛𝑒𝑎𝑟 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑜𝑟
 𝝀𝒊 = 𝒆𝒙𝒑(𝜷𝟎 + 𝜷𝟏 𝑿)
For Poisson Regression the link function is a log function.

For Linear Regressions, the probability distribution function is normal. The link function is
“identity function”

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.
 𝝁𝒊 = 𝜷𝟎 + 𝜷𝟏 𝑿 & 𝒚𝒊 = 𝚴(𝝁𝒊 , ∈)

For Logistic Regression, the probability distribution is Binomial and the link function is the
logit function (sigmoid function).

𝟏
 𝒒𝒊 = 𝟏+𝒆−(𝜷𝟎 +𝜷𝟏 𝑿) & 𝒚𝒊 = 𝐁𝐞𝐫𝐧(𝒒𝒊 )

[email protected]
svim0023

This file is meant for personal use by [email protected] only.
Sharing or publishing the contents in part or full is liable for legal action.

MACHINE LEARNING R23 Material
100% (11)
MACHINE LEARNING R23 Material
32 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
Introduction To Data Science Module 3
No ratings yet
Introduction To Data Science Module 3
24 pages
Introduction To Machine Learning - 2023
No ratings yet
Introduction To Machine Learning - 2023
44 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Data Management and Data Transformation, Introduction To Machine Learning
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
54 pages
Load Out
100% (2)
Load Out
239 pages
Zinc Flake Coating Ex Geomet
No ratings yet
Zinc Flake Coating Ex Geomet
7 pages
Introduction To Machine Learning: Dr.S.Sankar Ganesh Vellore Institute of Technology
No ratings yet
Introduction To Machine Learning: Dr.S.Sankar Ganesh Vellore Institute of Technology
132 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Machine Learning Practical File
No ratings yet
Machine Learning Practical File
41 pages
Unit1 ML
No ratings yet
Unit1 ML
23 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
S.No. Name of The Agency Contact Details: M/s M.P. Printers
100% (1)
S.No. Name of The Agency Contact Details: M/s M.P. Printers
3 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
19 pages
2019 Genes Ejercicio
No ratings yet
2019 Genes Ejercicio
543 pages
ENP Energy Efficient Free Cooling For Data Centers
No ratings yet
ENP Energy Efficient Free Cooling For Data Centers
16 pages
ML Unit-1
No ratings yet
ML Unit-1
139 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
1.2.1 ML Intro
No ratings yet
1.2.1 ML Intro
18 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Loop Breaker Manual
No ratings yet
Loop Breaker Manual
62 pages
PHD Thesis On Physics Education
100% (3)
PHD Thesis On Physics Education
5 pages
UNIT-1 Machine Learning
No ratings yet
UNIT-1 Machine Learning
43 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Understanding The Law of Resonance
No ratings yet
Understanding The Law of Resonance
13 pages
ML Unit-I Notes
No ratings yet
ML Unit-I Notes
86 pages
Mlunit 1
No ratings yet
Mlunit 1
139 pages
Ilpobservation Submission 1163492602
No ratings yet
Ilpobservation Submission 1163492602
11 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
ML Notes
No ratings yet
ML Notes
101 pages
Thcs An Lac - Thi HK I. k9. 2020-2021
No ratings yet
Thcs An Lac - Thi HK I. k9. 2020-2021
8 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
Gartner - SWOT SAS Institute
100% (1)
Gartner - SWOT SAS Institute
26 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
Regional Plan - 2021 For National Capital Region: Addressing The Planned Growth of Delhi by Adopting Regional Approach.
No ratings yet
Regional Plan - 2021 For National Capital Region: Addressing The Planned Growth of Delhi by Adopting Regional Approach.
14 pages
ML 1
No ratings yet
ML 1
79 pages
Numerical Analysis: MATLAB Practical (Autumn 2020) B.E. III Semester Thapar Institute of Engineering & Technology Patiala
No ratings yet
Numerical Analysis: MATLAB Practical (Autumn 2020) B.E. III Semester Thapar Institute of Engineering & Technology Patiala
6 pages
Module - 1 Lecture-1
No ratings yet
Module - 1 Lecture-1
40 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
Unit 1
No ratings yet
Unit 1
38 pages
Unit Iii
No ratings yet
Unit Iii
41 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Intro To ML - 1
No ratings yet
Intro To ML - 1
29 pages
ML Cahp 1
No ratings yet
ML Cahp 1
35 pages
ML Lecture Notes Unit-1
No ratings yet
ML Lecture Notes Unit-1
45 pages
ML Mdu 2024 10939237
No ratings yet
ML Mdu 2024 10939237
20 pages
Data Science Vs Machine Learning Vs Deep Learning: The Difference
No ratings yet
Data Science Vs Machine Learning Vs Deep Learning: The Difference
19 pages
RDZ Search Options
No ratings yet
RDZ Search Options
74 pages
Module1 Introduction
No ratings yet
Module1 Introduction
35 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Minutes of Meeting Attendance: Present
No ratings yet
Minutes of Meeting Attendance: Present
3 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Using The TI-73:: A Guide For Teachers
No ratings yet
Using The TI-73:: A Guide For Teachers
86 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
ML Module 4
No ratings yet
ML Module 4
25 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
ML Report
No ratings yet
ML Report
19 pages
MMT Bus E-Ticket Nu 25147911932077 Hyderabad-Pune
No ratings yet
MMT Bus E-Ticket Nu 25147911932077 Hyderabad-Pune
2 pages
Module 1
No ratings yet
Module 1
22 pages
New Microsoft Word Document (3) BBBB
No ratings yet
New Microsoft Word Document (3) BBBB
85 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
CFF Regular
No ratings yet
CFF Regular
2 pages
MLT Unit-1
No ratings yet
MLT Unit-1
19 pages
Lec 001
No ratings yet
Lec 001
17 pages
Exercise 37. Read and Find The Appropriate Translation For The Words Below in The Text
No ratings yet
Exercise 37. Read and Find The Appropriate Translation For The Words Below in The Text
3 pages
IGCSE Chemistry AO3 G10-2 Sungbeen Hong
No ratings yet
IGCSE Chemistry AO3 G10-2 Sungbeen Hong
14 pages
Disruptive Technologies AI Lecture 2
No ratings yet
Disruptive Technologies AI Lecture 2
12 pages
Karthik
No ratings yet
Karthik
10 pages
DLL Spa 1
No ratings yet
DLL Spa 1
2 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
Adv PT1
No ratings yet
Adv PT1
23 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Sun and Eames in ST of Energy 1995
No ratings yet
Sun and Eames in ST of Energy 1995
16 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
ML 2
No ratings yet
ML 2
4 pages
Cbsyllabus Bda 1
No ratings yet
Cbsyllabus Bda 1
4 pages
Presenttion 33
No ratings yet
Presenttion 33
2 pages
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
No ratings yet
BUSS 1020 - Quantitative Business Analysis Individual ASSIGNMENT Semester 2, 2015
3 pages
Quotation for Air cond - 240108 - eng version (giá gốc)
No ratings yet
Quotation for Air cond - 240108 - eng version (giá gốc)
3 pages
Studies Soil Improvement of An Expansive Soil Using Addiction of Lime (Caco3)
No ratings yet
Studies Soil Improvement of An Expansive Soil Using Addiction of Lime (Caco3)
4 pages
SYNTHESIS
No ratings yet
SYNTHESIS
2 pages
Characteristics (Typical Figures) Agip Arum HT 220
No ratings yet
Characteristics (Typical Figures) Agip Arum HT 220
1 page
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
From Everand
Mastering Classification Algorithms for Machine Learning: Learn how to apply Classification algorithms for effective Machine Learning solutions (English Edition)
PARTHA MAJUMDAR
No ratings yet

Advance ML - Unit 1

Uploaded by

Advance ML - Unit 1

Uploaded by

Self-Learning Material

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Some Terminologies and their Definitions

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Some applications of Machine Learning are : Recommendation Systems, Google Search,

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Machine Learning Lifecycle

There are 5 basic steps involved in any machine learning task.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

If we increase X1 by n units then how much does Y change by?

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

𝐶𝑜𝑠𝑡 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛 = ∑(𝑦𝑖 − ∑ 𝑥𝑖𝑗 𝛽𝑗 )2

Assumptions of Linear Regression

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

𝑖=1 𝑗=1 𝑗=1

𝐶𝑜𝑠𝑡 𝐹𝑢𝑛𝑐𝑡𝑖𝑜𝑛 = ∑(𝑦𝑖 − ∑ 𝑥𝑖𝑗 𝛽𝑗 )2 + 𝜆 ∑ |𝛽𝑗 |

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Cost Function of Logistic Regression is given as :

Generalised Linear Models (GLM)

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

1. Independence: Observations are independent of each other

If the distribution is a Poisson distribution then GLM equation becomes:

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

Proprietary content. All rights reserved. Unauthorized use or distribution prohibited.

You might also like