0% found this document useful (0 votes)

27 views14 pages

Machine Learning

Uploaded by

Rohit Sapkal Vlog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views14 pages

Machine Learning

Uploaded by

Rohit Sapkal Vlog

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Dr. Kapil K.

Misal Unit 1- Machine Learning

Unit 1: Introduction to Machine Learning

Introduction: What is Machine Learning, Examples of Machine Learning applications,

Training versus Testing, Positive and Negative Class, Cross-validation. Types of Learning:
Supervised, Unsupervised and Semi-Supervised Learning. Dimensionality Reduction:
Introduction to Dimensionality Reduction, Subset Selection, Introduction to Principal
Component Analysis

1. What is Machine Learning:

Machine learning (ML) is a type of artificial intelligence (AI) that allows software
applications to become more accurate at predicting outcomes without being explicitly
programmed to do so. Machine learning algorithms use historical data as input to
predict new output values.
In the real world, we are surrounded by humans who can learn everything from their
experiences with their learning capability, and we have computers or machines which
work on our instructions. But can a machine also learn from experiences or past data
like a human does? So here comes the role of Machine Learning.
Dr. Kapil K. Misal Unit 1- Machine Learning

Machine learning enables a machine to automatically learn from data, improve

performance from experiences, and predict things without being explicitly
programmed.

 How does Machine Learning work?

A Machine Learning system learns from historical data, builds the prediction models,
and whenever it receives new data, predicts the output for it. The accuracy of
predicted output depends upon the amount of data, as the huge amount of data helps
to build a better model which predicts the output more accurately.
Suppose we have a complex problem, where we need to perform some predictions,
so instead of writing a code for it, we just need to feed the data to generic algorithms,
and with the help of these algorithms, machine builds the logic as per the data and
predict the output. Machine learning has changed our way of thinking about the
problem. The below block diagram explains the working of Machine Learning
algorithm:
Dr. Kapil K. Misal Unit 1- Machine Learning

2. Examples of Machine Learning applications:

Machine learning is a buzzword for today's technology, and it is growing very rapidly
day by day. We are using machine learning in our daily life even without knowing it
such as Google Maps, Google assistant, Alexa, etc. Below are some most trending real-
world applications of Machine Learning:
Dr. Kapil K. Misal Unit 1- Machine Learning

3. Training versus Testing:

Let’s say you want to create a model based on some database. In machine learning,
this data is divided into two parts: training and testing data.

 Training data : Training data is the one you feed to a machine learning model, so
it can analyse it and discover some patterns and dependencies. This training set has 3
main characteristics:
 Size. The training set normally has more data than testing data. The more data you
feed to the machine, the better quality model you have. Once a machine learning
algorithm is provided with data from your records, it learns patterns from it and makes
a model for decision-making.
 Label. A label is a value of what we try to predict (response variables). For example, if
we want to forecast if the patient will be diagnosed with cancer, based on their
symptoms, the response variable will be Yes/No for the cancer diagnosis. The training
data can be labelled and unlabelled. Both types can be used in machine learning for
different cases.
 Case details. Algorithms make decisions based on the information you give them. You
need to make sure that the data is relevant and has various cases with different
outcomes. For instance, if you need a model that can score potential borrowers, you
need to include in the training set the information you normally know about your
potential client during the application process:
 Name and contact details, location;
 Demographics, social and behavioural characteristics;
 Source of origin (Meta Ads, website landing page, third party, etc.)
 Factors connected to the behaviour/activity on websites, conversions, time spent on
a website, number of clicks, and more.

 Testing Data :
Dr. Kapil K. Misal Unit 1- Machine Learning

 After the machine learning model is built, you need to check its work. The AI
platform uses testing data to evaluate the performance of your model and adjust
or optimize it for better forecasts. The testing set should have the following
characteristics:
 Unseen. You cannot reuse the same information that was in the training set.
 Large. The data set should be large enough so that the machine can make
predictions.
 Representative. The data should represent the actual dataset.

Luckily, you don’t need to collect new data and compare predictions with actual
data manually. The AI can split the existing data into two parts, put testing set
aside while training, and then run tests comparing predictions and actual results
all by itself. Data science has different options for data split, but the most common
proportions are 70/30, 80/20, and 90/10.
So having a massive data set at hand, we can check if it’s possible to make
predictions based on that model or not.
 To make it simple to understand, let’s consider this following definition:
 Wolf is a positive class
 No wolf is a negative class
 True Positive (TP): is the result that we get if we correctly predict the
positive class
 False Positive (FP): is the outcome that we get if we predict a negative class
as a positive class
 True Negative (TN): is the result that we get if we correctly predict the
negative class
 False Negative (FN): is the outcome that we get if we predict a positive class
as a negative class
Dr. Kapil K. Misal Unit 1- Machine Learning

4. Cross-validation :
Cross-validation is a technique for evaluating ML models by training several ML
models on subsets of the available input data and evaluating them on the
complementary subset of the data. Use cross-validation to detect overfitting, ie, failing
to generalize a pattern.
In Amazon ML, you can use the k-fold cross-validation method to perform cross-
validation. In k-fold cross-validation, you split the input data into k subsets of data
(also known as folds). You train an ML model on all but one (k-1) of the subsets, and
then evaluate the model on the subset that was not used for training. This process
is repeated k times, with a different subset reserved for evaluation (and excluded
from training) each time.
The following diagram shows an example of the training subsets and complementary
evaluation subsets generated for each of the four models that are created and trained
during a 4-fold cross-validation. Model one uses the first 25 percent of data for
evaluation, and the remaining 75 percent for training. Model two uses the second
subset of 25 percent (25 percent to 50 percent) for evaluation, and the remaining
three subsets of the data for training, and so on.
Dr. Kapil K. Misal Unit 1- Machine Learning

Cross-validation is a technique in which we train our model using the subset of the
data-set and then evaluate using the complementary subset of the data-set. The three
steps involved in cross-validation are as follows:
 Reserve some portion of sample data-set.
 Using the rest data-set train the model.
 Test the model using the reserve portion of the data-set.

Validation In this method, we perform training on the 50% of the given data-set
and rest 50% is used for the testing purpose. The major drawback of this method
is that we perform training on the 50% of the dataset, it may possible that the
remaining 50% of the data contains some important information which we are
leaving while training our model i.e higher bias. LOOCV (Leave One Out Cross
Validation) In this method, we perform training on the whole data-set but leaves
only one data-point of the available data-set and then iterates for each data-point.
It has some advantages as well as disadvantages also. An advantage of using this
method is that we make use of all data points and hence it is low bias. The major
drawback of this method is that it leads to higher variation in the testing model as
we are testing against one data point. If the data point is an outlier it can lead to
higher variation. Another drawback is it takes a lot of execution time as it iterates
over ‘the number of data points’ times. K-Fold Cross Validation In this method, we
split the data-set into k number of subsets(known as folds) then we perform
training on the all the subsets but leave one(k-1) subset for the evaluation of the
trained model. In this method, we iterate k times with a different subset reserved
for testing purpose each time.

5. Types of Machine Learning:

 Supervised Machine Learning:
Supervised learning is a type of machine learning in which the algorithm is
trained on the labeled dataset. It learns to map input features to targets based
on labeled training data. In supervised learning, the algorithm is provided with
Dr. Kapil K. Misal Unit 1- Machine Learning

input features and corresponding output labels, and it learns to generalize

from this data to make predictions on new, unseen data.
There are two main types of supervised learning:
 Regression: Regression is a type of supervised learning where the
algorithm learns to predict continuous values based on input features.
The output labels in regression are continuous values, such as stock
prices, and housing prices. The different regression algorithms in
machine learning are: Linear Regression, Polynomial Regression, Ridge
Regression, Decision Tree Regression, Random Forest Regression,
Support Vector Regression, etc
 Classification: Classification is a type of supervised learning where the
algorithm learns to assign input data to a specific category or class
based on input features. The output labels in classification are discrete
values. Classification algorithms can be binary, where the output is one
of two possible classes, or multiclass, where the output can be one of
several classes. The different Classification algorithms in machine
learning are: Logistic Regression, Naive Bayes, Decision Tree, Support
Vector Machine (SVM), K-Nearest Neighbors (KNN), etc

 Unsupervised Machine Learning:

Unsupervised learning is a type of machine learning where the algorithm learns
to recognize patterns in data without being explicitly trained using labeled
examples. The goal of unsupervised learning is to discover the underlying
structure or distribution in the data.
There are two main types of unsupervised learning:
 Clustering: Clustering algorithms group similar data points together
based on their characteristics. The goal is to identify groups, or clusters,
of data points that are similar to each other, while being distinct from
other groups. Some popular clustering algorithms include K-means,
Hierarchical clustering, and DBSCAN.
Dr. Kapil K. Misal Unit 1- Machine Learning

 Dimensionality reduction: Dimensionality reduction algorithms reduce

the number of input variables in a dataset while preserving as much of
the original information as possible. This is useful for reducing the
complexity of a dataset and making it easier to visualize and analyze.
Some popular dimensionality reduction algorithms include Principal
Component Analysis (PCA), t-SNE, and Autoencoders.

 Reinforcement Machine Learning:

Reinforcement learning is a type of machine learning where an agent learns to
interact with an environment by performing actions and receiving rewards or
penalties based on its actions. The goal of reinforcement learning is to learn a
policy, which is a mapping from states to actions, that maximizes the expected
cumulative reward over time.
There are two main types of reinforcement learning:
 Model-based reinforcement learning: In model-based reinforcement
learning, the agent learns a model of the environment, including the
transition probabilities between states and the rewards associated
with each state-action pair. The agent then uses this model to plan its
actions in order to maximize its expected reward. Some popular model-
based reinforcement learning algorithms include Value Iteration and
Policy Iteration.
 Model-free reinforcement learning: In model-free reinforcement
learning, the agent learns a policy directly from experience without
explicitly building a model of the environment. The agent interacts with
the environment and updates its policy based on the rewards it
receives. Some popular model-free reinforcement learning algorithms
include Q-Learning, SARSA, and Deep Reinforcement Learning.

6. What is Dimensionality Reduction?

Dimensionality reduction is a technique used to reduce the number of features in a
dataset while retaining as much of the important information as possible. In other
Dr. Kapil K. Misal Unit 1- Machine Learning

words, it is a process of transforming high-dimensional data into a lower-dimensional

space that still preserves the essence of the original data.
In machine learning, high-dimensional data refers to data with a large number of
features or variables. The curse of dimensionality is a common problem in machine
learning, where the performance of the model deteriorates as the number of features
increases. This is because the complexity of the model increases with the number of
features, and it becomes more difficult to find a good solution. In addition, high-
dimensional data can also lead to overfitting, where the model fits the training data
too closely and does not generalize well to new data.
Dimensionality reduction can help to mitigate these problems by reducing the
complexity of the model and improving its generalization performance. There are two
main approaches to dimensionality reduction: feature selection and feature
extraction.
 Feature Selection: Feature selection involves selecting a subset of the original
features that are most relevant to the problem at hand. The goal is to reduce the
dimensionality of the dataset while retaining the most important features. There are
several methods for feature selection, including filter methods, wrapper methods, and
embedded methods. Filter methods rank the features based on their relevance to the
target variable, wrapper methods use the model performance as the criteria for
selecting features, and embedded methods combine feature selection with the model
training process.
 Feature Extraction: Feature extraction involves creating new features by combining
or transforming the original features. The goal is to create a set of features that
captures the essence of the original data in a lower-dimensional space. There are
several methods for feature extraction, including principal component analysis (PCA),
linear discriminant analysis (LDA), and t-distributed stochastic neighbor embedding (t-
SNE). PCA is a popular technique that projects the original features onto a lower-
dimensional space while preserving as much of the variance as possible.
 Why is Dimensionality Reduction important in Machine Learning and Predictive
Modeling?
An intuitive example of dimensionality reduction can be discussed through a simple
e-mail classification problem, where we need to classify whether the e-mail is spam
Dr. Kapil K. Misal Unit 1- Machine Learning

or not. This can involve a large number of features, such as whether or not the e-mail
has a generic title, the content of the e-mail, whether the e-mail uses a template, etc.
However, some of these features may overlap. In another condition, a classification
problem that relies on both humidity and rainfall can be collapsed into just one
underlying feature, since both of the aforementioned are correlated to a high degree.
Hence, we can reduce the number of features in such problems. A 3-D classification
problem can be hard to visualize, whereas a 2-D one can be mapped to a simple 2-
dimensional space, and a 1-D problem to a simple line. The below figure illustrates this
concept, where a 3-D feature space is split into two 2-D feature spaces, and later, if
found to be correlated, the number of features can be reduced even further.

 Components of Dimensionality Reduction:

There are two components of dimensionality reduction:
 Feature selection: In this, we try to find a subset of the original set of variables,
or features, to get a smaller subset which can be used to model the problem.
It usually involves three ways:
 Filter
 Wrapper
Dr. Kapil K. Misal Unit 1- Machine Learning

 Embedded
 Feature extraction: This reduces the data in a high dimensional space to a
lower dimension space, i.e. a space with lesser no. of dimensions.
 Methods of Dimensionality Reduction: The various methods used for
dimensionality reduction include:
 Principal Component Analysis (PCA)
 Linear Discriminant Analysis (LDA)
 Generalized Discriminant Analysis (GDA)

Dimensionality reduction may be both linear and non-linear, depending upon

the method used. The prime linear method, called Principal Component
Analysis, or PCA

7. Principal Component Analysis

This method was introduced by Karl Pearson. It works on the condition that while the
data in a higher dimensional space is mapped to data in a lower dimension space, the
variance of the data in the lower dimensional space should be maximum.

It involves the following steps:

 Construct the covariance matrix of the data.
 Compute the eigenvectors of this matrix.
Dr. Kapil K. Misal Unit 1- Machine Learning

Eigenvectors corresponding to the largest eigenvalues are used to reconstruct a large

fraction of variance of the original data.
Hence, we are left with a lesser number of eigenvectors, and there might have been
some data loss in the process. But, the most important variances should be retained
by the remaining eigenvectors.

 Advantages of Dimensionality Reduction:

 It helps in data compression, and hence reduced storage space.
 It reduces computation time.
 It also helps remove redundant features, if any.
 Improved Visualization: High dimensional data is difficult to visualize, and
dimensionality reduction techniques can help in visualizing the data in 2D or 3D, which
can help in better understanding and analysis.
 Overfitting Prevention: High dimensional data may lead to overfitting in machine
learning models, which can lead to poor generalization performance. Dimensionality
reduction can help in reducing the complexity of the data, and hence prevent
overfitting.
 Feature Extraction: Dimensionality reduction can help in extracting important features
from high dimensional data, which can be useful in feature selection for machine
learning models.
 Data Pre-processing: Dimensionality reduction can be used as a pre-processing step
before applying machine learning algorithms to reduce the dimensionality of the data
and hence improve the performance of the model.
 Improved Performance: Dimensionality reduction can help in improving the
performance of machine learning models by reducing the complexity of the data, and
hence reducing the noise and irrelevant information in the data.

 Disadvantages of Dimensionality Reduction

 It may lead to some amount of data loss.
Dr. Kapil K. Misal Unit 1- Machine Learning

 PCA tends to find linear correlations between variables, which is sometimes

undesirable.
 PCA fails in cases where mean and covariance are not enough to define datasets.
 We may not know how many principal components to keep- in practice, some thumb
rules are applied.
 Interpretability: The reduced dimensions may not be easily interpretable, and it may
be difficult to understand the relationship between the original features and the
reduced dimensions.
 Overfitting: In some cases, dimensionality reduction may lead to overfitting, especially
when the number of components is chosen based on the training data.
 Sensitivity to outliers: Some dimensionality reduction techniques are sensitive to
outliers, which can result in a biased representation of the data.
 Computational complexity: Some dimensionality reduction techniques, such as
manifold learning, can be computationally intensive, especially when dealing with
large datasets.

Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
This Study Resource Was: Weekly Quiz 3 (AS)
No ratings yet
This Study Resource Was: Weekly Quiz 3 (AS)
6 pages
UNIT 1 Notes
No ratings yet
UNIT 1 Notes
13 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
FAIML_Unit_4_Introduction_to_ML
No ratings yet
FAIML_Unit_4_Introduction_to_ML
22 pages
UCS551 Chapter 5 - Machine Learning (Intro)
No ratings yet
UCS551 Chapter 5 - Machine Learning (Intro)
25 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
Unit1 ML NGP
No ratings yet
Unit1 ML NGP
106 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Machine Learning Ans
No ratings yet
Machine Learning Ans
60 pages
1-Introduction to Machine Learning
No ratings yet
1-Introduction to Machine Learning
61 pages
ML
No ratings yet
ML
4 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
AIML Interview Questions
No ratings yet
AIML Interview Questions
17 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
Unit-3
No ratings yet
Unit-3
37 pages
ml
No ratings yet
ml
333 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Module 4
No ratings yet
Module 4
28 pages
Unit 1
No ratings yet
Unit 1
62 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Tutorial Sheet1 (M.L.)
No ratings yet
Tutorial Sheet1 (M.L.)
49 pages
ML Unit1
No ratings yet
ML Unit1
25 pages
Machine Learning Notes "2023
No ratings yet
Machine Learning Notes "2023
31 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
31 pages
Machine Learning Tutorial For Beginners
No ratings yet
Machine Learning Tutorial For Beginners
15 pages
Ml_unit_1
No ratings yet
Ml_unit_1
29 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Machine Learning concise notes
No ratings yet
Machine Learning concise notes
7 pages
CH 4
No ratings yet
CH 4
106 pages
Introduction to ML Unit-1 PPT
No ratings yet
Introduction to ML Unit-1 PPT
90 pages
Al_Lec 3
No ratings yet
Al_Lec 3
30 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
MLP IA1
No ratings yet
MLP IA1
26 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Chapter - 1 PPT
No ratings yet
Chapter - 1 PPT
56 pages
ML Notes
No ratings yet
ML Notes
52 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Null 5
No ratings yet
Null 5
16 pages
unit V
No ratings yet
unit V
67 pages
ML Unit-1
No ratings yet
ML Unit-1
12 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
UNIT-1 Machine Learning
No ratings yet
UNIT-1 Machine Learning
43 pages
Ch7 Introduction to Machine Learning
No ratings yet
Ch7 Introduction to Machine Learning
29 pages
Guru Nanak Dev Engineering College, Ludhiana
No ratings yet
Guru Nanak Dev Engineering College, Ludhiana
48 pages
ml_unit_1
No ratings yet
ml_unit_1
19 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
ML1-Introduction To Machine Learning
No ratings yet
ML1-Introduction To Machine Learning
46 pages
machineLearning-unit1
No ratings yet
machineLearning-unit1
9 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
ML viva questions
No ratings yet
ML viva questions
25 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
UNIT 3__ML
No ratings yet
UNIT 3__ML
15 pages
Unit 3
No ratings yet
Unit 3
17 pages
Mastering Machine Learning: A Comprehensive Guide to Success
From Everand
Mastering Machine Learning: A Comprehensive Guide to Success
Rick Spair
No ratings yet
Recent Advances in The Monitoring, Modelling and Control of Crystallization Systems PDF
No ratings yet
Recent Advances in The Monitoring, Modelling and Control of Crystallization Systems PDF
20 pages
Local - PCA - It Is Against PCA
No ratings yet
Local - PCA - It Is Against PCA
24 pages
Stein Et Al 2014 Environmental Heterogeneity
No ratings yet
Stein Et Al 2014 Environmental Heterogeneity
15 pages
DTREG
No ratings yet
DTREG
395 pages
1. Heart rate estimation using facial video-review
No ratings yet
1. Heart rate estimation using facial video-review
15 pages
Jurnal Internasional 4
100% (1)
Jurnal Internasional 4
9 pages
MPIDevelopment and Initial Validation of TH
No ratings yet
MPIDevelopment and Initial Validation of TH
16 pages
Exam With Model Answers
No ratings yet
Exam With Model Answers
4 pages
UNIT - 2 .DataScience 04.09.18
No ratings yet
UNIT - 2 .DataScience 04.09.18
53 pages
Result Prediction For European Football Games: Xiaowei Liang Zhuodi Liu Rongqi Yan
No ratings yet
Result Prediction For European Football Games: Xiaowei Liang Zhuodi Liu Rongqi Yan
5 pages
Impact of Hand Sanitizer Format Gel Foam Liquid and Dose 2018 Journal of
No ratings yet
Impact of Hand Sanitizer Format Gel Foam Liquid and Dose 2018 Journal of
7 pages
ME-CSE-2020-SYLLABUS
No ratings yet
ME-CSE-2020-SYLLABUS
74 pages
CQF FINAL PROJECT MARCELO BRANDAO Final
No ratings yet
CQF FINAL PROJECT MARCELO BRANDAO Final
40 pages
Teacher_s_Attitudes_Toward_Multicultural
No ratings yet
Teacher_s_Attitudes_Toward_Multicultural
13 pages
P336-Rao
No ratings yet
P336-Rao
10 pages
Form Function and Evolution in Didelphid Skull
No ratings yet
Form Function and Evolution in Didelphid Skull
11 pages
Introduction To Principal Components and Factoranalysis
No ratings yet
Introduction To Principal Components and Factoranalysis
29 pages
Meng S 2020
No ratings yet
Meng S 2020
7 pages
Principal Component
No ratings yet
Principal Component
3 pages
The Pearsonica Package: R Topics Documented
No ratings yet
The Pearsonica Package: R Topics Documented
5 pages
Place Attachment of Shoppers: A Study of Palms Mall, Ibadan, Nigeria
No ratings yet
Place Attachment of Shoppers: A Study of Palms Mall, Ibadan, Nigeria
16 pages
Automation With Python Using Excel
No ratings yet
Automation With Python Using Excel
14 pages
Customer Loyalty in The Fast Food Restaurants of Bangladesh: British Food Journal September 2019
No ratings yet
Customer Loyalty in The Fast Food Restaurants of Bangladesh: British Food Journal September 2019
26 pages
(Ebooks PDF) Download Outlier Analysis 2nd Edition Charu C. Aggarwal (Auth.) Full Chapters
100% (6)
(Ebooks PDF) Download Outlier Analysis 2nd Edition Charu C. Aggarwal (Auth.) Full Chapters
54 pages
Facial Expression Recognition Using SVM Classifier: Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
No ratings yet
Facial Expression Recognition Using SVM Classifier: Indonesian Journal of Electrical Engineering and Informatics (IJEEI)
5 pages
Research Methods in Education 7th Edition by Louis Cohen, Lawrence Manion, Keith Morrison 0415583365 978-0415583367 instant download
100% (3)
Research Methods in Education 7th Edition by Louis Cohen, Lawrence Manion, Keith Morrison 0415583365 978-0415583367 instant download
67 pages
Whats New in ioGAS 7.0 PDF
No ratings yet
Whats New in ioGAS 7.0 PDF
15 pages
SCCS Poster 2024
No ratings yet
SCCS Poster 2024
1 page
Image Processing Methods For Biometric Applications: December 2014
No ratings yet
Image Processing Methods For Biometric Applications: December 2014
22 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

Dr. Kapil K.

Misal Unit 1- Machine Learning

Unit 1: Introduction to Machine Learning

Introduction: What is Machine Learning, Examples of Machine Learning applications,

1. What is Machine Learning:

Machine learning enables a machine to automatically learn from data, improve

 How does Machine Learning work?

2. Examples of Machine Learning applications:

3. Training versus Testing:

5. Types of Machine Learning:

input features and corresponding output labels, and it learns to generalize

 Unsupervised Machine Learning:

 Dimensionality reduction: Dimensionality reduction algorithms reduce

 Reinforcement Machine Learning:

6. What is Dimensionality Reduction?

words, it is a process of transforming high-dimensional data into a lower-dimensional

 Components of Dimensionality Reduction:

Dimensionality reduction may be both linear and non-linear, depending upon

7. Principal Component Analysis

It involves the following steps:

Eigenvectors corresponding to the largest eigenvalues are used to reconstruct a large

 Advantages of Dimensionality Reduction:

 Disadvantages of Dimensionality Reduction

 PCA tends to find linear correlations between variables, which is sometimes

You might also like