0% found this document useful (0 votes)

21 views43 pages

Intro ML 1 Day

The document provides an overview of machine learning including definitions of key concepts like supervised and unsupervised learning. It also discusses machine learning models and algorithms like linear regression and decision trees. Types of errors in machine learning like bias, variance and overfitting are also explained.

Uploaded by

ravinyse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views43 pages

Intro ML 1 Day

Uploaded by

ravinyse

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

Machine Learning

Written by Abhishek kaushik (Abhi)

~ Material is for educational purpose for the specific audience.

Distribution in any form is not allowed.
OVERVIEW
● About Me
● Data Incorporate
AI
● Data Description
Techniques
● Types of Learning
● ML Models
● Workflow Models
● Errors
● My Workflow
● Evaluation This Photo by Unknown Author is licensed
under CC BY-NC-ND

● Application
● Conclusion
ABOUT ME
● Qualification
PhD track (Ireland), M.Sc. (Germany) and
Bachelor of Technology (India)
● Teaching Experience & Industrial Experience
Dublin Business School
Dublin City University And TU Dublin
National College of Ireland And American
College Dublin
Adapt Centre AI Labs (Ireland), EagleBurgmann
(Germany), Siemens AG (Germany) and
Consulting in start-up & TSYS (India)
● Research Interest
Information retrieval , Information seeking
behaviour, Chatbots, Machine learning, Deep
Learning and Conversational Information
retrieval
● Supervision
Exchange Interns (France), ICT and Masters
Dissertation
MY DBS PROFILE
● Subjects
Big Data Visualisation
Research Methods in Computing
Research Methods in FinTech
Machine Learning
Research methods Anaytics
● 7 Students represented DBS in IPRC conference under my supervision.
● Supervision (Master Dissertation & ICT)
50 (Completed) + 5 (In Progress)
● Paper Publication
4 Published
1 Accepted
2 In Progress
Paper Published
List of Paper Published (with DBS Students)

● Kaur, G., Kaushik, A. and Sharma, S., 2019. Cooking Is Creating Emotion: A
Study on Hinglish Sentiments of Youtube Cookery Channels Using
Semi-Supervised Approach. Big Data and Cognitive Computing, 3(3), p.37.
● Das, J., Sharma, S. and Kaushik, A., 2019. Views of Irish Farmers on Smart
Farming Technologies: An Observational Study. AgriEngineering, 1(2),
pp.164-187.
● Nair, S., Kaushik, A. and Dhoot, H., 2019. Conceptual framework of a
skill-based interactive employee engaging system: In the Context of
Upskilling the present IT organization. Applied Computing and Informatics.
● Ajumi, O., & Kaushik, A.,2019. Exchange Rates Prediction via Deep Learning
and Machine Learning : A Literature Survey on Currency Forecasting.
● Sentiment Analysis on Google Play Store Data using Deep Learning Accepted
in Springer (2019)
Evaluation
• Two assignments (30% each)
– Handed out on weeks 4 and 8
– Due two weeks later
– Main Exam (40%)
– Mix of:
• Implementing machine learning algorithms
• Applying them to real datasets
• Exercises
Source Materials

● Material provided my me
● Material provided in The class by Online Instructor
● Bishop, Christopher M. Pattern recognition and machine learning.
springer, 2006.
● Witten, Ian H., et al. Data Mining: Practical machine learning tools and
techniques. Morgan Kaufmann, 2016.
● Zhang, Cha, and Yunqian Ma, eds. Ensemble machine learning:
methods and applications. Springer Science & Business Media, 2012.
● Brownlee, Jason. "Machine learning mastery." URL:
http://machinelearningmastery.
com/discover-feature-engineering-howtoengineer-features-and-how-to-
getgood-at-it (2014).
A Few Quotes
DATA INCORPORATE AI
● Artificial Intelligence (AI)
Reproducing human intelligence
in machines, especially computer
systems through learning ,
reasoning and self-correction

● Machine Learning (ML)

Machine learning(ML) is a set of
statistical tools to learn from
data.
e.g. Model = Algorithm (Data)
------------(1) Source: Facebook Developer
Circles Lagos
Output = Model (New Data)
----------(2)
● Deep learning (DL)
Data goes through multiple number
of non-linear transformations to
obtain an output
● Data Science (DS)
Data science has an intersection
with artificial intelligence but
is not a subset of artificial
intelligence. Processing data,
analyzing and visualizing this
data, so as to make meaning out of
it for business strategies.

This Photo by Unknown Author is licensed

under CC BY-SA

DATA INCORPORATE AI
Data Computer Output
Program

Data
Computer Program
Output
Magic?

•
•
•
•
Sample Applications
ML in a Nutshell
Representation

• .
Evaluation
Optimization
Types of Learning
Inductive Learning
•
•
What We’may ll Cover*

* it may varies time to time depending upon the external

factors
MACHINE LEARNING (1)
● Machine learning is subdivided into three major parts
● Supervised
All data is labelled and the algorithm need to predict
the output from the input data such as Regression and
Classification
● Unsupervised
All data is unlabelled and the algorithm learns to
inherit structure from the input data such as
clustering and Associations
● Semi-Supervised
Some data is labelled but most of them are unlabelled
and a mixture of supervised and unsupervised
techniques can be used
MACHINE LEARNING (2) Featur Output/labelle
es d data
● Features vectors
or attribute
● Output value or
predictions or
Labelled data

This Photo by Unknown Author is licensed

under CC BY-SA
ML in Practice
DATA ANALYSIS TECHNIQUES (1)

There are major six data analysis techniques

● Descriptive
Describes a set of data
● Exploratory
An approach to analyzing data sets
to find previously unknown relationships.
● Inferential
Aims to test theories about the nature of
the world in general
DATA ANALYSIS TECHNIQUES (2)

● Predictive
Analyze current and historical facts to
make predictions about future events
● Causal
To find out what happens to one variable
when you change another.
● Mechanistic
Understand the exact changes in variables
that lead to changes in other variables for
individual objects.
MODELS
● Machine learning Models` are Parametric and Non Parametric
● Parametric Models
It summarizes the data with the set variables of fixed
size
Independent of number of training example
Y = MX + C ------------(3) where X is
Input variable, Y is Output predicted and C is Bias
Such as Logistic regression and Perceptron
● Non-Parametric Models
Don’t make the strong assumptions about mapping the
functions
Free to form any functional form
Such as Decision Tree and Support Vector Machine
● Benefits
Simpler (easier to
understand)
Speed (fast in Processing)

BENEFITS Less data (require less

data for training)
AND ● Limitations

LIMITATIONS Constrained functional form

(limited to specific
(PARAMETRIC functional form)

)
Limited Complexity (method
are more suited to simpler
problem)
Poor fit (In practise the
methods are unlikely to
match the mapping
functions)
● Benefits
Flexible (capable into
fitting into large data
set),

BENEFITS AND Power (no assumption

needed)

LIMITATIONS Performance (higher

performance on complex
(NON-PARAME model)
● Limitations
TRIC) More data,
Slower
Overfitting
● 200 year old method
● Model with Linear relationship
with input and predicted
variables
● Y = B0 + B1* x -------------(4)
where Y is predicted value, B0,
B1 are coefficient and x is
input variable or plane
● Linear Transformation
● Remove Noise
● Remove Collinearity
● Gaussian Distributions
This Photo by Unknown Author is
● Rescale inputs licensed under CC BY

LINEAR REGRESSION (PARAMETRIC)

CART OR DECISION TREE
(NON-PARAMETRIC)
● Know as
Classification
and Regression
tree
● Introduced by
Leo Breiman
● Ginni Index
method to split
● Greedy methods
● Pruning effect
This Photo by Unknown Author is licensed
under CC BY-SA
This Photo by Unknown Author is licensed
under CC BY-SA

WORKFLOW OF MACHINE LEARNING

ERRORS (1)

● Bias Error
Assumptions made by the Model to make
the target function easier to learn
● Variance Error
It is the amount to estimate the target
function with change in different
training data
● Irreducible Error
It can’t be reduce regardless of what
algorithm is used such as error caused
by unknow variables
ERRORS (2)

● Overfitting
Training data learn well but testing
data predict poorly
More with Non-parametric Algorithm
Remedy is to features selections
Cross Validation and Hold back
Validation dataset
● Underfitting
Failing to learn from the train data
Remedy is to try alternate algorithm
MY ML FLOWCHART
Text Cleaning
Splitting the Data (70% Training data and 30% testing data)
Implement Cross validation on training data using multiple algorithms
Variations in Parameters to study the effect of bias and variance
Choose the best classifier or regression model
Retrain the Model on 70% data
Validation test on testing data
Identify the underfitting and overfitting
Retrain the model on whole data set
Save the Model and build the API over it
Classification Accuracy

Logarithmic Loss

Confusion Matrix

Area under Curve

EVALUATION
F1 Score

Mean Absolute Error

Mean Squared Error

APPLICATION
● Chatbots
● Facial recognition
● Image tagging
This Photo by Unknown Author is licensed
under CC BY-NC

● Machine translation
● Sales prediction
● Self-driving cars
● Sentiment analysis
● Data is very powerful
● Patterns talk about the
personality
● ML and DL is having high
potential

CONCLUSION ● Understanding the Data and

Algorithm is the state of
Art
● Big Data needs ML
● Exploratory data Analysis
is a must for ML Scientist
THANK
YOU!!!!!!!!!
QUESTIONS?
APPENDIX
EVALUATION (1)
● Classification Accuracy

● Log Loss

● Confusion metrics
EVALUATION (2)
● Area under Curve

● F1 Score
EVALUATION (3)

Machine Learning?
100% (2)
Machine Learning?
114 pages
Data Management and Data Transformation, Introduction To Machine Learning
No ratings yet
Data Management and Data Transformation, Introduction To Machine Learning
54 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
Roboguide Training Manual (FRDE) (Z KAE TRN Roboguide 1 01 En)
93% (30)
Roboguide Training Manual (FRDE) (Z KAE TRN Roboguide 1 01 En)
95 pages
K-Lite 3 & 5 Service Manual
100% (4)
K-Lite 3 & 5 Service Manual
24 pages
ML 1 PPT Unit 1
No ratings yet
ML 1 PPT Unit 1
93 pages
Global Superstore 2016
No ratings yet
Global Superstore 2016
6,865 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
Module 1 ML Mumbai University
No ratings yet
Module 1 ML Mumbai University
47 pages
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
No ratings yet
Machine Learning: Sri Chandrasekharendra Saraswathi Viswa Mahavidyalaya
333 pages
Artificial Intelligence - Machine Learning Fundamentals
No ratings yet
Artificial Intelligence - Machine Learning Fundamentals
31 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
ML Revision
No ratings yet
ML Revision
207 pages
DATA 2024 - Dist
No ratings yet
DATA 2024 - Dist
72 pages
1-Introduction To Machine Learning
No ratings yet
1-Introduction To Machine Learning
61 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
138 pages
ML Unit-1
No ratings yet
ML Unit-1
139 pages
EE653 OpenDSS Tutorial and Cases
100% (1)
EE653 OpenDSS Tutorial and Cases
403 pages
Engineer Being Machine Learning Notes
No ratings yet
Engineer Being Machine Learning Notes
95 pages
360DigiTmg E Book Data Science
100% (1)
360DigiTmg E Book Data Science
168 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
Calculate Cable Trunking Size
100% (1)
Calculate Cable Trunking Size
2 pages
Social Media Analytics Techniques
No ratings yet
Social Media Analytics Techniques
77 pages
Basic Concepts of Machine Learning For Beginners 1732109263
No ratings yet
Basic Concepts of Machine Learning For Beginners 1732109263
102 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
360DigiTMG Practical Data Science New
100% (1)
360DigiTMG Practical Data Science New
168 pages
AI and ML For Business Antim Prahar WITH ANSWERS
No ratings yet
AI and ML For Business Antim Prahar WITH ANSWERS
26 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Lecture 1 Machine Learning
No ratings yet
Lecture 1 Machine Learning
22 pages
Gradient Descent Algorithm
No ratings yet
Gradient Descent Algorithm
5 pages
Big Data Lecture # 08
No ratings yet
Big Data Lecture # 08
21 pages
Machine Learning
No ratings yet
Machine Learning
57 pages
Construction Specifications Writing Principles
No ratings yet
Construction Specifications Writing Principles
8 pages
July4 SaketAnand FriendlyIntroToML
No ratings yet
July4 SaketAnand FriendlyIntroToML
84 pages
Study Notes - Lesson 1 - 7 PDF
No ratings yet
Study Notes - Lesson 1 - 7 PDF
25 pages
Week 12 Intro To DS and ML
No ratings yet
Week 12 Intro To DS and ML
67 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
Chapter 02 Overview - 4
No ratings yet
Chapter 02 Overview - 4
43 pages
Simatic S5: S5EPROM For USB Prommer
No ratings yet
Simatic S5: S5EPROM For USB Prommer
10 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
AI Unit 1
No ratings yet
AI Unit 1
30 pages
Machine Learning
No ratings yet
Machine Learning
42 pages
Machine Learning
No ratings yet
Machine Learning
30 pages
Air Quality Prediction Using Machine Learning
No ratings yet
Air Quality Prediction Using Machine Learning
29 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
59 pages
Module 2
No ratings yet
Module 2
73 pages
UML Class Diagram Examples of Common Scenarios - EdrawMax
No ratings yet
UML Class Diagram Examples of Common Scenarios - EdrawMax
12 pages
Notes Unit 1-3 Part-II
No ratings yet
Notes Unit 1-3 Part-II
20 pages
Internatiional Financial Management: Unit I
No ratings yet
Internatiional Financial Management: Unit I
51 pages
Ass Bigd
No ratings yet
Ass Bigd
9 pages
Lecture - 2 Classification (Machine Learning Basic and KNN)
No ratings yet
Lecture - 2 Classification (Machine Learning Basic and KNN)
90 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Machine Learning For Data Science Unit-4
No ratings yet
Machine Learning For Data Science Unit-4
16 pages
Fam Question Bank CT
No ratings yet
Fam Question Bank CT
14 pages
Artificial Intelligence in Metallurgical Engineering
No ratings yet
Artificial Intelligence in Metallurgical Engineering
12 pages
BWMS-C20 A-10776
No ratings yet
BWMS-C20 A-10776
3 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Advance ML - Unit 1
No ratings yet
Advance ML - Unit 1
12 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
Chapter 07
No ratings yet
Chapter 07
47 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Finance With Python and MPT
100% (1)
Finance With Python and MPT
31 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Machine Learning1
100% (1)
Machine Learning1
11 pages
Assign
100% (1)
Assign
11 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
ML Unit1
No ratings yet
ML Unit1
25 pages
Module - 1
No ratings yet
Module - 1
9 pages
Machine Learning - Course
No ratings yet
Machine Learning - Course
6 pages
Machine Learning in Unit-1
No ratings yet
Machine Learning in Unit-1
10 pages
Unit 3 - DS - 1st Year
No ratings yet
Unit 3 - DS - 1st Year
5 pages
From Field Problems To Machine Learning
No ratings yet
From Field Problems To Machine Learning
51 pages
Programmes Offered by Ksou: A Under Graduate Programmes - (05) Sl. No. Proogrammes Duration of The Programme
No ratings yet
Programmes Offered by Ksou: A Under Graduate Programmes - (05) Sl. No. Proogrammes Duration of The Programme
3 pages
Project - Scheduling Wild Air
No ratings yet
Project - Scheduling Wild Air
88 pages
Air Quality UCI
No ratings yet
Air Quality UCI
540 pages
iSTAR Ultra LT QSG 8200 1335 01 D0 en
No ratings yet
iSTAR Ultra LT QSG 8200 1335 01 D0 en
27 pages
Bag of Words
No ratings yet
Bag of Words
32 pages
Computer Organization and Architecture
No ratings yet
Computer Organization and Architecture
12 pages
All About Stock Market - Read It
No ratings yet
All About Stock Market - Read It
53 pages
Testing Class
No ratings yet
Testing Class
10 pages
Account Statement From 1 Jan 2020 To 13 Aug 2020: TXN Date Value Date Description Ref No./Cheque No. Debit Credit Balance
No ratings yet
Account Statement From 1 Jan 2020 To 13 Aug 2020: TXN Date Value Date Description Ref No./Cheque No. Debit Credit Balance
6 pages
Farhan Sir PDF
No ratings yet
Farhan Sir PDF
17 pages
M-Audio Profire 2626: Audio & MIDI Interface (Mac/PC)
No ratings yet
M-Audio Profire 2626: Audio & MIDI Interface (Mac/PC)
3 pages
机械工程作业帮助
100% (2)
机械工程作业帮助
4 pages
Relational Databases
No ratings yet
Relational Databases
88 pages
Get Dinosaurs The Textbook Spencer G. Lucas PDF Ebook With Full Chapters Now
100% (1)
Get Dinosaurs The Textbook Spencer G. Lucas PDF Ebook With Full Chapters Now
54 pages
Plagiarism - Report
No ratings yet
Plagiarism - Report
49 pages
Concise Introduction To Linguistics 5th A Rowe Bruce M Levine Diane P Instant Download
No ratings yet
Concise Introduction To Linguistics 5th A Rowe Bruce M Levine Diane P Instant Download
13 pages
Hep 0077
No ratings yet
Hep 0077
56 pages
Weighted Average Cost of Capital (WACC) - 2017 Value Weight Required Rate of Return
No ratings yet
Weighted Average Cost of Capital (WACC) - 2017 Value Weight Required Rate of Return
4 pages
Value Weight Required Rate of Return
No ratings yet
Value Weight Required Rate of Return
3 pages
Banna Leisure 111
No ratings yet
Banna Leisure 111
2 pages
Ann Oil Gas Industry Review
No ratings yet
Ann Oil Gas Industry Review
12 pages
Module 5 Assignment 1
No ratings yet
Module 5 Assignment 1
3 pages
4 - Designing Embedded Networking Applications
No ratings yet
4 - Designing Embedded Networking Applications
23 pages
ENGLISH 3RD TERM FINAL EXAM 2024 GRADE 2 (Salvo Automaticamente)
No ratings yet
ENGLISH 3RD TERM FINAL EXAM 2024 GRADE 2 (Salvo Automaticamente)
7 pages
CV AlessandroOliveira
No ratings yet
CV AlessandroOliveira
7 pages
Selling PointCompetitive Edge - 电子白板
No ratings yet
Selling PointCompetitive Edge - 电子白板
5 pages
Boston House Price Prediction
No ratings yet
Boston House Price Prediction
5 pages
Dbms Practical Question
No ratings yet
Dbms Practical Question
4 pages
Behind A Crypto Scam Case
No ratings yet
Behind A Crypto Scam Case
18 pages
Display LUMAscape
No ratings yet
Display LUMAscape
1 page