0% found this document useful (0 votes)

13 views65 pages

Lec-1 Introduction

The document introduces Machine Learning (ML) and its various types, including supervised, unsupervised, semi-supervised, and reinforcement learning, along with practical examples. It discusses the significance of decision trees in ML, covering concepts like overfitting, pruning, and different types of data and features. The document also highlights the historical development of ML and its applications across various industries.

Uploaded by

tarun.nemaai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views65 pages

Lec-1 Introduction

Uploaded by

tarun.nemaai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 65

Machine Learning Introduction

Lecture-1
Topics Covered
• Introduction to Machine Learning; Decision Trees:

• Overview of Supervised (regression and classification), unsupervised (clustering and dimensionality

reduction), semi-supervised, and reinforcement learning with practical examples - Machine learning
nomenclature: raw data, types of features and outputs, feature vector.

• Decision tree model of learning - Classification and regression using decision trees - Splitting criteria:
entropy, information gain, Gini impurity – Overfitting & Pruning in decision trees.
Introduction
• Machine Learning (ML) is considered as the most dynamic and
progressive form of human-like Artificial Intelligence.

• Today ML is being used extensively in various industries like

automobiles, genetics, medicine, finance etc. to automate
procedures, in reducing the processing time and to remove the
possibility of human errors.

• ML helps in analyzing at a large scale, thus helping in making quicker

and better decisions.

8/13/2024 Arockiaraj S 3
Artificial Intelligence (AI) & Machine Learning (ML)?
Artificial intelligence:
• Artificial intelligence is the name
given to the process in which the
computer makes decisions,
mimicking a human.
AI ML DL
Machine learning:
• Computer makes decisions
based on experience. Computing
Growing popularity

• The word “Big Data” you keep hearing about is mainly made possible
through ML.

8/13/2024 Arockiaraj S 5
Growing popularity
Machine Learning Market Size, Share & Trends Analysis Report

8/13/2024 Arockiaraj S 6
(banking, financial services and insurance)

https://www.grandviewresearch.com/industry-analysis/machine-learning-market

8/13/2024 Arockiaraj S 7
8/13/2024 Arockiaraj S 8
Machine Learning

We are going from programming computers to training computers.

8/13/2024 Arockiaraj S 9
Programming and Machine Learning
Programming Solution

Machine Learning solution

Input Program Data Output

Computer Algorithm - Machine

Output Learn through experience

8/13/2024 Arockiaraj S 10
Machine Learning - Definition
• 1997, Tom Mitchell gave a definition:
Machine Learning Scientist,
Carnegie Mellon University

A computer program is said to learn from experience E with respect

to some class of tasks T and performance measure P, if its
performance at tasks in T, as measured by P, improves with
experience E.

8/13/2024 Arockiaraj S 11
What Machine Learning does???

Training Decision Function

Examples / Hypothesis

New Query?

8/13/2024 Arockiaraj S 12
Introduction – History
• 1950s
• Arthur Samuel (IBM)
• Program – playing Checkers game

• 1960s
• Rosenblatt
• Perceptron - Neural Network Model
• Pattern Recognition
• Later delta learning rule
• Rule for perceptron learning
• Good classifier
Introduction – History
• 1950s • 1969
• Minsky and Papert
• Arthur Samuel (IBM) • Limitation of perceptron model
• Program – playing Checkers game • Problem could not be represented
• Inseparable data distribution

• 1960s • 1970s
• Rosenblatt • Symbolic concept (AI)
• Perceptron - Neural Network Model
• Pattern Recognition • 1986
• Later delta learning rule • Quinlan – Decision Tree
• Rule for perceptron learning • ID3 Algorithm
• Good classifier • Improved: Regression
• Still popular in ML
Introduction – History
• 1990s: Machine learning involved
• 1997 – Deep Blue beats Garry Kasparov
statistics to a large extent

• 1994 – Self driving car road test • 2009 – Google builds self driving car

• In 1995 –Support Vector Machines • 2011 – Watson wins Jeopardy

(SVMs) • 2015 - machine translation systems

• In 1997, ensembles or boosting - driven by NN - better than statistical

algorithm for classification machine translation systems

Current status
• Today,
• Algorithms – developed for learning
tasks

• Theoretical understanding - emerged

• Practical computer programs -

developed

• Commercial applications - appear.

Data and Features
What is data? What are features?
• Data is simply a table with information • Features are simply the columns of the table.
• Each row is a data point • Features may be size, name, type, weight, etc.
• Each row represented by certain • Some features are special, and we call them labels.
features

NO. SIZE COLOR SHAPE FRUIT NAME

1 Big Red Rounded shape with a depression at the top Apple

2 Small Red Heart-shaped to nearly globular Cherry

3 Big Green Long curving cylinder Banana

4 Small Green Round to oval, Bunch shape Cylindrical Grape

What is the difference between labelled and unlabelled data?

Labels?
• If we are trying to predict a feature based on the others, that feature is the label.
• Labeled data: Data that comes with a label. • The set of algorithms
• Unlabeled data: Data that comes without a label in which we use a
labeled dataset is
called supervised
learning.

• The set of algorithms

in which we use an
unlabeled dataset, is
called unsupervised
learning.
Types of Machine Learning
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning
• Deep Learning
Supervised Learning

• A branch of machine learning that works with

labeled data.
• Some of the most common applications:
• image recognition
• various forms of text processing
• recommendation systems.

• Goal of a supervised learning model: predict the

labels.
Types of labeled datasets
• Numbers and states are the two types of data
used in supervised learning models.

• In this dataset, the labels are numbers.

• We call this type - numerical data

• numerical data - is any type of data that uses

numbers such as 4, 2.35, or –199. In this example, each data
• Example: prices, sizes, or weights. point in the dataset is labeled
with the weight of the animal.
Types of labeled datasets
• In this dataset, the labels are states.

• We call this type - categorical data.

• categorical data - is any type of data that uses

categories, or states, such as male/female or
cat/dog/bird.

• For this type of data, we have a finite set of

categories to associate to each of the data
points.
each data point in the dataset is
labeled with the type of animal (dog
or cat)
Types of supervised learning models
• regression models are the types of models that predict numerical
data.
• The output of a regression model is a number, such as the weight of
the animal.

• classification models are the types of models that predict categorical

data.
• The output of a classification model is a category, or a state, such as
the type of animal (cat or dog).
Example of
regression
model
• Model 1: housing prices model.
• Each data point is a house.
• The label of each house is its
price.
• Goal: when a new house (data
point) comes on the market, we
would like to predict its label
(price).
Example of
classification
model
• Model 2: email spam–detection model.
• Each data point is an email.
• The label of each email is either spam
or ham.
• Goal: when a new email (data point)
comes into our inbox, we would like to
predict its label (whether it is spam or
ham).
Examples of supervised learning models
• Difference between models 1 and 2.

• Housing prices model, can return a number from many possibilities,

such as $100, $250,000, or $3,125,672.33.
• Thus, it is a regression model.

• The spam detection model, can return only two things: spam or ham.
Thus, it is a classification model.
Regression models
• Predict numbers based on the features
• In the housing example, the features can be anything that describes a
house, such as the size, the number of rooms, the distance to the
closest school, or the crime rate in the neighborhood.

House size No. of rooms Distance to school Crime rate in the neighborhood Price

IDV IDV IDV IDV DV

Other applications of regression model:
• Stock market: predicting the price of a certain stock based on other stock
prices and other market signals

• Medicine: predicting the expected life span of a patient or the expected

recovery time, based on symptoms and the medical history of the patient

• Sales: predicting the expected amount of money a customer will spend,

based on the client’s demographics and past purchase behavior

• Video recommendations: predicting the expected amount of time a user

will watch a video, based on the user’s demographics and other videos
they have watched
Linear
regression
• The most common method
used for regression is linear
regression, which uses linear
functions (lines or similar
objects) to make our
predictions based on the
features.
Supervised Learning
Learning the physical characters of
fruits through training.

Apple:
• Size: Big
• Color: Red
• Shape: Rounded shape with a
depression at the top

<Apple> <Big, Red, Rounded shape with a depression at the top>

Supervised Learning
Learning the physical characters of fruits
through training.

Cherry:
• Size: Small
• Color: Red
• Shape: Heart-shaped to nearly globular

<Cherry> <Small, Red, Heart-shaped to nearly globular>

Supervised Learning
Learning the physical characters of fruits through training.

Banana:
• Size: Big
• Color: Green
• Shape: Long curving cylinder
Supervised Learning
Learning the physical characters of fruits
through training.

Grape:
• Size: Small
• Color: Green
• Shape: Round to oval, Bunch shape
Cylindrical
Supervised Learning
Machine already learned about the fruits through training.
Input: <Big, Red, Rounded shape>
Response: <Apple>
Input: <Small, Red, Heart-shaped>
Response: <Cherry>
Apply that
Input: <Big, Green, Long curving cylinder> knowledge to
Response: <Banana> the test data
Input: <Small, Green, Round to oval shape>
Response: <Grape>
Supervised Learning
• Already learned about the physical characters of fruits through
training.

NO. SIZE COLOR SHAPE FRUIT NAME

Rounded shape with a depression at
1 Big Red Apple
the top
2 Small Red Heart-shaped to nearly globular Cherry
3 Big Green Long curving cylinder Banana
Round to oval, Bunch shape
4 Small Green Grape
Cylindrical
Supervised Learning
Input attributes + output

Apple

Decision Function
/ Hypothesis

Orange

Supervised Classification

For a training input, output is known

Unsupervised Learning

Decision Function
/ Hypothesis

Unsupervised Classification
Unsupervised Learning
• Consider physical character of that particular fruit.
• Suppose you have considered color.
• Arrange them on considering base condition as color
• Then the groups will be some thing like this.

RED COLOR GREEN

GROUP: COLOR
apples & GROUP:
cherry fruits bananas &
grapes
Unsupervised Learning
GREEN
RED COLOR COLOR
GROUP: GROUP:
apples & bananas &
cherry fruits grapes

SMALL
SMALL BIG SIZE
SIZE SIZE
BIG SIZE grapes
bananas
apples cherry
Unsupervised learning

• Machine learning algorithms that

works with unlabeled data.

• MLA must extract as much

information as possible from a
dataset (has no labels, or targets)
to predict.
• Determine - two pictures are
similar or different
Unsupervised learning
• Even if the labels are there, we can still use unsupervised learning
techniques on our data to preprocess it and apply supervised learning
methods more effectively.

• clustering algorithms The algorithms that group data into clusters based
on similarity.

• dimensionality reduction algorithms The algorithms that simplify our data

and describe it with fewer features

• generative algorithms The algorithms that can generate new data points
that resemble the existing data
Clustering
• Consider the two datasets used in “Supervised learning”—the housing
dataset and the email dataset.

• Imagine that they have no labels

• House price prediction – price is not available
• Email classification – spam or ham is not available.

• housing dataset - What can we do with this dataset?

• Here is an idea: we could somehow group the houses by similarity.
• For example, we could group them by location, size, or a combination of these
factors.
• This process is called clustering.

Clustering is an unsupervised machine learning - group the elements in our dataset into
clusters where all the data points are similar.
Clustering
• Example – email dataset
• The dataset is unlabeled, we don’t know whether each email is spam or ham.

• We can apply some clustering to this dataset.

• Group the emails – based on the number of words in the message, the
sender, the number and size of the attachments, or the types of links inside
the email.
• After clustering the dataset, a human (or a combination of a human and a
supervised learning algorithm) could label these clusters by categories such as
“Personal,” “Social,” and “Promotions.”
cluster the emails into three categories based on size and
number of recipients

Social

Promotions
Personal
Other applications of clustering
• Market segmentation: dividing customers into groups based on
demographics and previous purchasing behavior to create different
marketing strategies for the groups

• Genetics: clustering species into groups based on gene similarity

• Medical imaging: splitting an image into different parts to study different

types of tissue

• Video recommendations: dividing users into groups based on

demographics and previous videos watched and using this to recommend
to a user the videos that other users in their group have watched
Popular clustering algorithms
• K-means clustering: this algorithm groups points by picking some random centers of
mass and moving them closer and closer to the points until they are at the right spots.

• Hierarchical clustering: this algorithm starts by grouping the closest points together and
continuing in this fashion, until we have some well-defined groups.

• Density-based spatial clustering (DBSCAN): this algorithm starts grouping points

together in places with high density, while labeling the isolated points as noise.

• Gaussian mixture models: this algorithm does not assign a point to one cluster but
instead assigns fractions of the point to each of the existing clusters.
• For example, if there are three clusters, A, B, and C, then the algorithm could determine
that 60% of a particular point belongs to group A, 25% to group B, and 15% to group C.
Dimensionality reduction
• Simplifies data without losing too much information

• Example: housing dataset

Imagine the features are the following:

• C1: Size
• C2: Number of bedrooms
• C3: Number of bathrooms
• C4: Crime rate in the neighborhood
• C5: Distance to the closest school

This dataset has five columns of data.

What if we wanted to turn the dataset into a simpler one with fewer columns, without
losing a lot of information?
Dimensionality
reduction
• first three features are
similar, because they are all
related to the size of the
house.
• fourth and fifth features are
similar to each other,
because they are related to
the quality of the
neighborhood.
• If we have a table full of data, each row corresponds to a data
point, and each column corresponds to a feature.
Clustering &
dimensionality reduction • we can use clustering to reduce the number of rows in our dataset
and dimensionality reduction to reduce the number of columns
Other ways of simplifying our data: Matrix factorization and singular value
decomposition

• How can we reduce both the rows and the columns at the same time?

• matrix factorization and singular value decomposition (SVD).

• These two algorithms express a big matrix of data into a product of smaller matrices

• Netflix use matrix factorization extensively to generate recommendations.

• a large table where each row corresponds to a user, each column to a movie, and each entry in
the matrix is the rating that the user gave the movie.

• With matrix factorization, one can extract certain features, such as type of movie, actors
appearing in the movie, and others, and be able to predict the rating that a user gives a movie,
based on these features.
• Two common types of unsupervised learning algorithms are
clustering and dimensionality reduction.

• Clustering is used to group data into similar clusters to extract information or

make it easier to handle.

• Dimensionality reduction is a way to simplify our data, by joining certain

similar features and losing as little information as possible.

• Matrix factorization and singular value decomposition are other algorithms

that can simplify our data by reducing both the number of rows and columns.
• Generative machine learning is an innovative type of unsupervised
learning, consisting of generating data that is similar to our dataset.
• Generative models can paint realistic faces, compose music, and write poetry.
Goal:
To solve problems that cannot be
solved by numerical means alone

ML / DL

8/13/2024 Arockiaraj S 54
Machine Learning - Examples
• In general, to have a well-defined learning problem, we must identity these
three features:
• The class of tasks (T)
• The measure of performance (P)to be improved and
• The source of experience (E)

A chess learning problem:

• Task T: playing chess
• Performance measure P: % of games won against opponents (70%)
• Training experience E: playing practice games against itself
Machine Learning - Examples
• A handwriting recognition learning
problem:

• Task T: recognizing and classifying

handwritten words within images

• Performance measure P: percent of

words correctly classified

• Training experience E: database of

handwritten words
Machine Learning - Examples
• A robot driving learning problem:

• Task T: driving on public four-lane highways

• Performance measure P: average distance traveled before an error

• Training experience E: sequence of images and steering commands recorded

while observing a human driver
Applications of Machine Learning
• Speech and Hand Writing Recognition
• Robotics (Robot locomotion)
• Search Engines (Information Retrieval)
• Learning to Classify new astronomical structures
• Medical Diagnosis
• Learning to drive an autonomous vehicle
• Computational Biology/Bioinformatics
• Computer Vision (Object Detection algorithms)
•
•
Detecting credit card fraud
Stock Market analysis
ML solves problems
• Game playing that cannot be
solved by numerical
• ………………..
• ………………….
means alone
Reinforcement Learning
Reward based learning

Wake-
up had
bath
• Reinforcement Learning 10

• Rewards (+ve or –ve)

• Wake-up, brushed 1
teeth, had bath, had
breakfast, reach college brush
had
teeth Attend 15 breakfast
offline
8 class
Types of Machine Learning
• Supervised Learning
• Unsupervised Learning
• Reinforcement Learning
• Deep Learning
Deep Learning

In the past few years, Deep Learning has generated much

excitement in Machine Learning

Many breakthrough results in speech recognition, computer vision

and text processing.

8/13/2024 Arockiaraj S 61
Deep Learning

“very large neural networks and huge amounts of data that we have access to”
8/13/2024 Arockiaraj S 62
Why Deep Learning?
- Performance…

8/13/2024 Arockiaraj S 63
Low level representation

8/13/2024 Arockiaraj S 64
Thank you

ML Merged
No ratings yet
ML Merged
433 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
MachineLearning Jan2nd
100% (2)
MachineLearning Jan2nd
171 pages
1 Introduction To ML
No ratings yet
1 Introduction To ML
43 pages
Dorks For Dorks
No ratings yet
Dorks For Dorks
2 pages
Process Analysis and Simulation in Chemical Engineering
No ratings yet
Process Analysis and Simulation in Chemical Engineering
5 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
No ratings yet
6.EBS1-PTFA27-SAQA-PLQA-1002-D00 - Project Quality Plan
28 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Chapter 01 Notes
No ratings yet
Chapter 01 Notes
11 pages
Lecture 02
No ratings yet
Lecture 02
34 pages
Module 1-Basics of ML
No ratings yet
Module 1-Basics of ML
142 pages
Aiml Mca
100% (1)
Aiml Mca
38 pages
Lecture 02
No ratings yet
Lecture 02
34 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Lecture 6 - AI and ML
No ratings yet
Lecture 6 - AI and ML
32 pages
W1 - Introduction To ML
No ratings yet
W1 - Introduction To ML
57 pages
DLP PHY G10 W8 D1 Rev
No ratings yet
DLP PHY G10 W8 D1 Rev
7 pages
Unit 1 ML
No ratings yet
Unit 1 ML
96 pages
Lecture 1.1. Introduction
No ratings yet
Lecture 1.1. Introduction
48 pages
Unit-3 Machine Learning
No ratings yet
Unit-3 Machine Learning
81 pages
Introduction To AI
No ratings yet
Introduction To AI
51 pages
WEEK 01 Merged
No ratings yet
WEEK 01 Merged
606 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
178 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Technical Brief Stats Concepts 19c
No ratings yet
Technical Brief Stats Concepts 19c
27 pages
4.1 Machine Learning Basics
No ratings yet
4.1 Machine Learning Basics
26 pages
ML Unit-1 Notes
No ratings yet
ML Unit-1 Notes
23 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
AI
No ratings yet
AI
52 pages
Machine Learning
No ratings yet
Machine Learning
122 pages
Chapter 5 Machine Learning
No ratings yet
Chapter 5 Machine Learning
96 pages
ML Unit1
No ratings yet
ML Unit1
15 pages
ML - Module 1
No ratings yet
ML - Module 1
30 pages
Basics of Machine Learning and Deep Learning
No ratings yet
Basics of Machine Learning and Deep Learning
49 pages
Intro To ML
No ratings yet
Intro To ML
34 pages
Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
Machine Learning - UNIT I Notes
No ratings yet
Machine Learning - UNIT I Notes
31 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
VCF 50 Design
No ratings yet
VCF 50 Design
253 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Carried Interest Directory
100% (1)
Carried Interest Directory
108 pages
CHP 1
No ratings yet
CHP 1
47 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
36 pages
CE880 Lecture5 Slides
No ratings yet
CE880 Lecture5 Slides
32 pages
ML Unit 1
No ratings yet
ML Unit 1
19 pages
1 - Introduction
No ratings yet
1 - Introduction
82 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Ai Unit-4-1
No ratings yet
Ai Unit-4-1
9 pages
Machine Learning Types
No ratings yet
Machine Learning Types
30 pages
Unit 1
No ratings yet
Unit 1
24 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Drag Force: The Basics of Transport Phenomena
No ratings yet
Drag Force: The Basics of Transport Phenomena
12 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Outscraper-2024050715234182c9 Kedai Plastik
No ratings yet
Outscraper-2024050715234182c9 Kedai Plastik
33 pages
C 0.8 Reference Manual: Ardpeek
No ratings yet
C 0.8 Reference Manual: Ardpeek
60 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Plan 2023-PhD-Training - Martin - Marañón
No ratings yet
Plan 2023-PhD-Training - Martin - Marañón
3 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
4 pages
Technology Management
No ratings yet
Technology Management
15 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Operations Management Term Paper Topics
100% (1)
Operations Management Term Paper Topics
4 pages
Portfolio Submission
No ratings yet
Portfolio Submission
47 pages
Módulos Canadian 440Wp
No ratings yet
Módulos Canadian 440Wp
2 pages
Tech 301 To 400
No ratings yet
Tech 301 To 400
4 pages
Decision Trees
No ratings yet
Decision Trees
13 pages
Computer Assembly and Repair Lab Manual
No ratings yet
Computer Assembly and Repair Lab Manual
52 pages
Quotation For Diesel Type Generator
No ratings yet
Quotation For Diesel Type Generator
2 pages
M2 Cert Pro Dev Exam v2 - 3 Study GD PDF
No ratings yet
M2 Cert Pro Dev Exam v2 - 3 Study GD PDF
21 pages
Migration From MegaAVR To AVR DxMCU Fam DS00003731A
No ratings yet
Migration From MegaAVR To AVR DxMCU Fam DS00003731A
43 pages
Date Sheet For The BS in Computer Science 4 Year Programe 1st 2nd 3rd 69039
No ratings yet
Date Sheet For The BS in Computer Science 4 Year Programe 1st 2nd 3rd 69039
3 pages
CCMS LLD V1.1
No ratings yet
CCMS LLD V1.1
44 pages
Канада
No ratings yet
Канада
9 pages
Aukey Ep t25
No ratings yet
Aukey Ep t25
1 page
Thirteen Questions A Business Process Map Should Answer
No ratings yet
Thirteen Questions A Business Process Map Should Answer
4 pages
Conf Ospf
No ratings yet
Conf Ospf
3 pages
Roubleshooting Percona XtraDB Cluster
No ratings yet
Roubleshooting Percona XtraDB Cluster
4 pages
Upgrade Guide Build 8010 10020
No ratings yet
Upgrade Guide Build 8010 10020
3 pages
SANS Cheatsheet Google-Workspace
No ratings yet
SANS Cheatsheet Google-Workspace
1 page
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet

Lec-1 Introduction

Uploaded by

Lec-1 Introduction

Uploaded by

Machine Learning Introduction

• Overview of Supervised (regression and classification), unsupervised (clustering and dimensionality

• Today ML is being used extensively in various industries like

• ML helps in analyzing at a large scale, thus helping in making quicker

We are going from programming computers to training computers.

Machine Learning solution

Input Program Data Output

Computer Algorithm - Machine

Output Learn through experience

A computer program is said to learn from experience E with respect

Training Decision Function

• In 1995 –Support Vector Machines • 2011 – Watson wins Jeopardy

(SVMs) • 2015 - machine translation systems

• In 1997, ensembles or boosting - driven by NN - better than statistical

algorithm for classification machine translation systems

• Theoretical understanding - emerged

• Practical computer programs -

• Commercial applications - appear.

NO. SIZE COLOR SHAPE FRUIT NAME

1 Big Red Rounded shape with a depression at the top Apple

2 Small Red Heart-shaped to nearly globular Cherry

3 Big Green Long curving cylinder Banana

4 Small Green Round to oval, Bunch shape Cylindrical Grape

• The set of algorithms

• A branch of machine learning that works with

• Goal of a supervised learning model: predict the

• In this dataset, the labels are numbers.

• numerical data - is any type of data that uses

• We call this type - categorical data.

• categorical data - is any type of data that uses

• For this type of data, we have a finite set of

• classification models are the types of models that predict categorical

• Housing prices model, can return a number from many possibilities,

IDV IDV IDV IDV DV

• Medicine: predicting the expected life span of a patient or the expected

• Sales: predicting the expected amount of money a customer will spend,

• Video recommendations: predicting the expected amount of time a user

<Apple> <Big, Red, Rounded shape with a depression at the top>

<Cherry> <Small, Red, Heart-shaped to nearly globular>

NO. SIZE COLOR SHAPE FRUIT NAME

For a training input, output is known

RED COLOR GREEN

• Machine learning algorithms that

• MLA must extract as much

• dimensionality reduction algorithms The algorithms that simplify our data

• Imagine that they have no labels

• housing dataset - What can we do with this dataset?

• We can apply some clustering to this dataset.

• Genetics: clustering species into groups based on gene similarity

• Medical imaging: splitting an image into different parts to study different

• Video recommendations: dividing users into groups based on

• Density-based spatial clustering (DBSCAN): this algorithm starts grouping points

• Example: housing dataset

Imagine the features are the following:

This dataset has five columns of data.

• matrix factorization and singular value decomposition (SVD).

• Netflix use matrix factorization extensively to generate recommendations.

• Clustering is used to group data into similar clusters to extract information or

• Dimensionality reduction is a way to simplify our data, by joining certain

• Matrix factorization and singular value decomposition are other algorithms

A chess learning problem:

• Task T: recognizing and classifying

• Performance measure P: percent of

• Training experience E: database of

• Task T: driving on public four-lane highways

• Performance measure P: average distance traveled before an error

• Training experience E: sequence of images and steering commands recorded

• Rewards (+ve or –ve)

In the past few years, Deep Learning has generated much

Many breakthrough results in speech recognition, computer vision

You might also like