0% found this document useful (0 votes)

32 views24 pages

2.0 Machine Learning Introduction

The document provides an overview of Machine Learning, defining it as a process where systems improve their performance based on past experiences without explicit programming. It discusses the significance of Machine Learning in handling big data, the challenges associated with it, and different types of learning such as supervised, unsupervised, and reinforcement learning. Additionally, it covers concepts like regression and classification, highlighting their applications and importance in data analysis.

Uploaded by

vanshlakhotya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views24 pages

2.0 Machine Learning Introduction

Uploaded by

vanshlakhotya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Machine Learning

Introduction
DATA SCIENCE
Machine Learning
• Learning is any process by which a system
improves performance from past experiences.
Herbert Simon
~ Herbert Simon

• Machine Learning corresponds to computer

programs or algorithms which improve their
performance through experience without
being explicitly programmed.
Why?
• Develop systems which can automatically adapt and customize
themselves.
• Discover new knowledge from large datasets (Data mining)
• Ability to mimic human and replace monotonous tasks. Eg. OCR,
handwriting recognition.
• To develop systems that are too difficult and expensive to construct
manually.
• Speed up the innovation and analytics
• Make more sense of chaotic world around us
Why now?

• Surge of Big data in our lives : We

create new data in amounts of
petabytes everyday through our calls,
sms, chats, selfies, videos, emails.
• Increasing Computational power
(faster CPUs and GPU cores)
• Growing collaboration among
researchers in academia and industry.
• Profit margin for corporates
Challenges with Big Data
• Volume
The sheer size of data we are handling is increasing exponentially everyday
• Velocity
The rate at which new data is being gathered by our systems and sensors
• Variety
Diverse formats and sensor type result in different data point to represent
• Veracity
The data quality is really bad in most cases with little or no structure in them
Concept of learning in ML
We have a task T, performance measure P, experience E.

We say a system is learning when while performing task T, the performance

measure P improving as it goes through more experiences E.

Case in point: Task = To make better route decisions

Performance = Less travel time
Experience = Travelling through different routes and time

If the system is learning it should start making decisions about selecting routes
as it experiences more routes and time associated.

Depending on various features learning can go in either direction.

Take an example of object detection

The problem statement is to detect object present in a scene.

Now consider any natural image as one presented here. It can have
multiple features, brightness level and color difference through all
pixels it contains.

The way computer sees the data of just 4 x 4 pixel at base level:
x0011FD x0011FF x0011FD x0012FF
x0011FD x0011FF x0011ED x0011FF
x0011BD x0011FF x0011FE x0011FE
x0011ED x0011FF x0011FA x0011AB

The idea is to output the coordinates of pixels which form closest

rectangle around the object, label of the class and percentage.

If the system is learning ideally it should return correct classes with

closest coordinates around the object and with higher confidence
value.
What are the Challenges?
• To create systems which perform with accuracy and precision of
human intelligence and can leverage machine’s innate architecture to
accelerate and keep up with big data (Scales of terabytes or petabytes
of data)
• To allow for faster and better decision making through machines
wherever possible
• To convert large amounts of raw data into useful analytics
• To help forecast the future trends and correct our estimates based on
the analytic output
Terminology
Population : The population is any specific collection of objects of interest.

Sample : Sample is any subset of the population.

Measurement : A measurement is a number or attribute computed for each member of a

population or of a sample.

Parameter: Parameter or feature is a value that summarizes some aspects of the whole
population.

Inference: Any key knowledge about the data sample or population as a whole from its
attributes and properties
Features?

Color: Red
Type: Fruit
Weight: 100 gm
Price: 140/kg
Availability: Yes
Sweet: Yes
Organic: No
Types of Machine Learning
• Supervised Learning (We provide output labels/tags with input data)
• Regression/Prediction
• Classification

• Unsupervised Learning (No output labels are given)

• Reinforcement Learning (Works on reward policy feedback system)

Supervised
learning
(Labelled
Data)
We provide human annotated
data to the model as ground
truth
Unsupervis
ed learning
To infer a function that describes
the structure of "unlabeled" data
(i.e. data that has not been
classified or categorized)
Reinforcem
ent
learning
A bot/software ought to take
actions in an environment so as
to maximize some notion of
cumulative reward
Supervised vs Unsupervised
• The procedures in supervised learning are well comprehensible due
to their structure. It is possible to contrast different methods, to
parameterize and thereby find a solution that is optimal for the
application . The interpretation of the data is easier due to the given
traceability than with unsupervised learning methods.
• The disadvantage, however, is often a very high manual effort in the
preparation of the data.
• It require lot of man hours to prepare fully formatted data to work
with supervised learning.
Supervised vs Unsupervised
• The advantages of unsupervised learning are the partially fully
automated creation of models. These can produce a very good
prognosis about new data or even create new content. The model
learns with each new record and at the same time refines its
calculations and classifications. Manual intervention is no longer
necessary.
• The biggest disadvantages are there is no control over what model
learns. It can start to cluster wrong type as one group
• It can give bad results amiss the output labels and can lead to lot of
misclassifications.
Regression
• Regression refers to correlation between dependent and independent
variables.
Consider eqn of line: Y = mx + c
Constant/parameter

Independent variable

Dependent Variable Coefficient/parameter

In statistics regression is defined as a measure of the relation between the

mean value of one variable (e.g. output) and corresponding values of other
variables (e.g. time and cost).
• Regression is used to fit a
eqn through the given data
points.
• We try to approximate the
best function to fit through
the data
• The core idea it to predict
the points in future. We
assume the data
distribution will hold up and
new points will be near to
the function.
• It reduces uncertainty with
the data and helps in
correct estimation over a
period of time.
• Few Cases for Regression could be:
• Sales Prediction
• Product Pricing
• Expense overrun variables
• House pricing prediction
• Population estimates
• Pollution particulate estimates

Honestly regression finds its usage anywhere we want to find correlations:

Temperature vs. Number of cones sold at ice cream store

Inches of rain vs. new cars sold
Daily Snowfall vs. number of skier visits
Classification

• In machine learning and

statistics, classification is the
problem of identifying to
which of a set of categories
(sub-populations) a new
observation belongs, on the
basis of a training set of
data containing
observations (or instances)
whose category
membership is known.
• Use cases of Classification
• Spam Filtering
• Cancer diagnosis
• Text classification
• Sentiment analysis
• Object classification
• Face detection

Similar to Regression this technique can be generalized into any problem where
we know there are discrete classes.

Classification of people based on their eating habits:

Vegetarian, Vegan, Non-Vegetarian

ML COMPLETE (Pure Sem Ka)
No ratings yet
ML COMPLETE (Pure Sem Ka)
347 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
Regression 0
No ratings yet
Regression 0
108 pages
Unit-1 - Machine Learning
No ratings yet
Unit-1 - Machine Learning
85 pages
Lecture 1 - Introduction To Machine Learning
No ratings yet
Lecture 1 - Introduction To Machine Learning
35 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
AAI Lecture 9 SP 25
No ratings yet
AAI Lecture 9 SP 25
26 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
ML - 1 - Sovan - Introduction To ML
No ratings yet
ML - 1 - Sovan - Introduction To ML
83 pages
23ECE205 FoDS 13 Introduction To ML
No ratings yet
23ECE205 FoDS 13 Introduction To ML
41 pages
Presentation On ML
No ratings yet
Presentation On ML
469 pages
86 37 196 Mod 5
No ratings yet
86 37 196 Mod 5
52 pages
Introduction 1175
No ratings yet
Introduction 1175
58 pages
Ch3-Machine Learning
No ratings yet
Ch3-Machine Learning
124 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
178 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
ML Introduction
No ratings yet
ML Introduction
76 pages
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d B39decf3b0fc
No ratings yet
AI Unit4 Learning Dd83e0ee 7d19 48c7 Bc5d B39decf3b0fc
19 pages
Introduction To ML - MCA - 2023
No ratings yet
Introduction To ML - MCA - 2023
30 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
Ai Chapter 5
No ratings yet
Ai Chapter 5
45 pages
Unit5 ML Introduction
No ratings yet
Unit5 ML Introduction
32 pages
Substitution Rules and Exits in SAP
100% (2)
Substitution Rules and Exits in SAP
12 pages
MachineLearning Jan2nd
100% (2)
MachineLearning Jan2nd
171 pages
Module 1
No ratings yet
Module 1
50 pages
Chapter 2
No ratings yet
Chapter 2
35 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
15 pages
Cs 550: Machine Learning: Çiğdem Gündüz Demir
No ratings yet
Cs 550: Machine Learning: Çiğdem Gündüz Demir
14 pages
Unit 1
No ratings yet
Unit 1
24 pages
Chapter 1 Introduction To Machine Learning
No ratings yet
Chapter 1 Introduction To Machine Learning
29 pages
Classification
No ratings yet
Classification
53 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
5 Le
No ratings yet
5 Le
36 pages
Week 4 - Intro To ML
No ratings yet
Week 4 - Intro To ML
37 pages
CHP 1
No ratings yet
CHP 1
47 pages
4 Ai ML - 2
No ratings yet
4 Ai ML - 2
31 pages
Module 1 PPT
No ratings yet
Module 1 PPT
122 pages
Selection of Pipe Repair Methods DOT Project 359
100% (2)
Selection of Pipe Repair Methods DOT Project 359
174 pages
DS - NLP
No ratings yet
DS - NLP
39 pages
Unit IV - Learning
No ratings yet
Unit IV - Learning
18 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
25 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
Intro - Types of Machine Learning
No ratings yet
Intro - Types of Machine Learning
24 pages
ML 1 2 3
No ratings yet
ML 1 2 3
54 pages
Curso Vista Cuarta Parte
No ratings yet
Curso Vista Cuarta Parte
48 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Chapter Introduction
No ratings yet
Chapter Introduction
7 pages
AI Lab6
No ratings yet
AI Lab6
7 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
DataScience Unit1 (+notes)
No ratings yet
DataScience Unit1 (+notes)
56 pages
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
No ratings yet
MIT - Machine Learning Notes From Chapter 1 - 14 PDF
101 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Machine Learning
No ratings yet
Machine Learning
51 pages
Machine Learning
No ratings yet
Machine Learning
9 pages
Mindray LabXpert Software Operator's Manual V50 E 250526 162922
No ratings yet
Mindray LabXpert Software Operator's Manual V50 E 250526 162922
228 pages
Lecture 2
No ratings yet
Lecture 2
22 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
BR100 Warehouse Management System Application Setup V1.6
No ratings yet
BR100 Warehouse Management System Application Setup V1.6
94 pages
E560 IEC61850 Sub
No ratings yet
E560 IEC61850 Sub
42 pages
Engineering Data Users Guide
No ratings yet
Engineering Data Users Guide
72 pages
This Story Paraphrased From A Post On 9/4/12
No ratings yet
This Story Paraphrased From A Post On 9/4/12
7 pages
Chi Square Notes
No ratings yet
Chi Square Notes
5 pages
AGB Unit
No ratings yet
AGB Unit
63 pages
Manual de Operacion de Lactoscan SL
No ratings yet
Manual de Operacion de Lactoscan SL
47 pages
Statistics Beginners Guide
No ratings yet
Statistics Beginners Guide
42 pages
Sem With Amos I PDF
100% (1)
Sem With Amos I PDF
68 pages
Binomial Poisson Normal Distribution
No ratings yet
Binomial Poisson Normal Distribution
9 pages
Barton-TBM Tunnelling in Sheared and Fractured Rock Masses. Cartagena, Colombia
No ratings yet
Barton-TBM Tunnelling in Sheared and Fractured Rock Masses. Cartagena, Colombia
36 pages
Chapter 2 - Estimation PDF
No ratings yet
Chapter 2 - Estimation PDF
25 pages
Spatial Autocorrelation
No ratings yet
Spatial Autocorrelation
10 pages
Full Download Accelerated Life Models Modeling and Statistical Analysis 1st Edition Vilijandas Bagdonavicius PDF
100% (7)
Full Download Accelerated Life Models Modeling and Statistical Analysis 1st Edition Vilijandas Bagdonavicius PDF
72 pages
Adehabitat HR
No ratings yet
Adehabitat HR
60 pages
Imp - Maximum Likelihood Estimation - STAT 414 - 415
No ratings yet
Imp - Maximum Likelihood Estimation - STAT 414 - 415
8 pages
Topic 2 Estimation
No ratings yet
Topic 2 Estimation
56 pages
Analysis and Optimization For The Process of Glass
No ratings yet
Analysis and Optimization For The Process of Glass
8 pages
Huawei HO Parameters
No ratings yet
Huawei HO Parameters
9 pages
The Fragmentation Energy-Fan Model in Quarry Blast
No ratings yet
The Fragmentation Energy-Fan Model in Quarry Blast
16 pages
Planning, Motivation, and Evaluation in Orientation To The Future: Latent Structure Analysis
No ratings yet
Planning, Motivation, and Evaluation in Orientation To The Future: Latent Structure Analysis
8 pages
How To Interpret Backtest Results
No ratings yet
How To Interpret Backtest Results
7 pages
Module 4 Statistical Inference Estimation of Parameters Lesson 1 Estimation of Parameters Part 1.p
No ratings yet
Module 4 Statistical Inference Estimation of Parameters Lesson 1 Estimation of Parameters Part 1.p
5 pages
Clarifying The Underlying and Fundamental Meaning of The Approximate Linear Inversion of Seismic Data
No ratings yet
Clarifying The Underlying and Fundamental Meaning of The Approximate Linear Inversion of Seismic Data
13 pages
A Program For Simulating Air-Launched Missiles
No ratings yet
A Program For Simulating Air-Launched Missiles
10 pages
Journal of Petroleum Science and Engineering: Mihály Dobróka, Norbert Péter Szabó
No ratings yet
Journal of Petroleum Science and Engineering: Mihály Dobróka, Norbert Péter Szabó
9 pages
X n θ by less than any arbitrary constant c > 0. Also using Chebyshev's theorem, we see > 0
No ratings yet
X n θ by less than any arbitrary constant c > 0. Also using Chebyshev's theorem, we see > 0
2 pages
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet

2.0 Machine Learning Introduction

Uploaded by

2.0 Machine Learning Introduction

Uploaded by

Machine Learning

• Machine Learning corresponds to computer

• Surge of Big data in our lives : We

We say a system is learning when while performing task T, the performance

Case in point: Task = To make better route decisions

Depending on various features learning can go in either direction.

The problem statement is to detect object present in a scene.

The idea is to output the coordinates of pixels which form closest

If the system is learning ideally it should return correct classes with

Sample : Sample is any subset of the population.

Measurement : A measurement is a number or attribute computed for each member of a

• Unsupervised Learning (No output labels are given)

• Reinforcement Learning (Works on reward policy feedback system)

Dependent Variable Coefficient/parameter

In statistics regression is defined as a measure of the relation between the

Honestly regression finds its usage anywhere we want to find correlations:

Temperature vs. Number of cones sold at ice cream store

• In machine learning and

Classification of people based on their eating habits:

You might also like