0% found this document useful (0 votes)

33 views42 pages

Machine Learning

Uploaded by

Komal Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views42 pages

Machine Learning

Uploaded by

Komal Kumar Sahu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Introduction to

Machine Learning
By,
Salman Sadullah Usmani
Apr 11, 2024
Quotes about ML
Just as electricity transformed almost
everything 100 years ago, today I actually
have a hard time thinking of an industry that I
don’t think AI (Artificial Intelligence) will
transform in the next several years

Andrew Ng
Artificial intelligence would be the ultimate version of Google.
The ultimate search engine that would understand everything on
the web. It would understand exactly what you wanted, and it
would give you the right thing. We’re nowhere near doing that
now. However, we can get incrementally closer to that, and that
is basically what we work on.

Larry Page
Artificial Intelligence, deep learning, machine
learning — whatever you’re doing if you don’t
understand it — learn it. Because otherwise you’re
going to be a dinosaur within 3 years

Mark Cuban
AI vs ML vs DL
Definition of ML
• Machine learning is an application of artificial intelligence (AI) that provides systems the
ability to automatically learn and improve from experience without being explicitly
programmed. Machine learning focuses on the development of computer programs that
can access data and use it learn for themselves.

• Optimize a performance criterion using example data or past experience.

• Role of Statistics: Inference from a sample

• Role of Computer science: Efficient algorithms to

 Solve the optimization problem

 Representing and evaluating the model for inference

What We Talk About When We Talk
About“Learning”
• Learning general models from a data of particular examples

• Data is cheap and abundant (data warehouses, data marts); knowledge is expensive and
scarce.

• Example in retail: Customer transactions to consumer behavior:

People who bought “Da Vinci Code” also bought “The Five People You Meet in Heaven”
(www.amazon.com)

• Build a model that is a good and useful approximation to the data.

Data Mining
• Retail: Market basket analysis, Customer relationship management (CRM)

• Finance: Credit scoring, fraud detection

• Manufacturing: Optimization, troubleshooting

• Medicine: Medical diagnosis

• Telecommunications: Quality of service optimization

• Bioinformatics: Motifs, alignment, Clock

• Web mining: Search engines

Applications to Reality
• Tesla: Self driving cars

• Alexa/Google Assistant/Siri: Voice Command system

• Netflix/Amazon prime: Streaming platforms

• LinkedIn/Facebook: Social networks

• DeepMind: Healthcare
Type of ML algorithms

• Supervised Learning

 Classification

 Regression

• Unsupervised Learning

• Reinforcement Learning
Unsupervised Learning
• Unsupervised learning uses machine learning algorithms to analyze and cluster unlabeled data
sets.

• discover hidden patterns in data without the need for human intervention (hence, they are
“unsupervised”)

• Unsupervised learning models are used for three main tasks:

• Clustering

• Association

• Dimensionality reduction
Clustering
• Clustering is a data mining technique for grouping unlabeled data based on their

similarities or differences.

• For example, K-means clustering algorithms assign similar data points into groups,

where the K value represents the size of the grouping and granularity.

• This technique is helpful for market segmentation, image compression, etc.

Learning Associations
• Association is another type of unsupervised learning method that uses different rules to find
relationships between variables in a given dataset.

• Generally used for market basket analysis and recommendation engines.

• Basket analysis:

P (Y | X ) probability that somebody who buys X also buys Y where X and Y are products/services.

Example: P ( chips | beer ) = 0.7

Dimensionality Reduction
• learning technique used when the number of features (or dimensions) in a given dataset is too
high.

• It reduces the number of data inputs to a manageable size while also preserving the data
integrity.

• Often, this technique is used in the pre-processing data stage, such as when autoencoders
remove noise from visual data to improve picture quality.
Supervised Learning
• A machine learning approach that’s defined by its use of labeled datasets.

• These datasets are designed to train or “supervise” algorithms into classifying data or predicting
outcomes accurately.

• Using labeled inputs and outputs, the model can measure its accuracy and learn over time.

• Supervised learning can be separated into two types of problems when data mining:
classification and regression.
Supervised Learning
Supervised vs Unsupervised
Classification
• Classification problems use an algorithm to accurately assign test data into specific categories,
such as separating apples from oranges.

• Another example to classify spam in a separate folder from your inbox.

• Linear classifiers, support vector machines, decision trees and random forest are all common
types of classification algorithms.
Classification

• Example: Credit scoring

• Differentiating between low-risk and

high-risk customers from their income
and savings

Discriminant: IF income > θ1 AND savings > θ2 THEN

low-risk ELSE high-risk
Classification: Applications
• Aka Pattern recognition

• Face recognition: Pose, lighting, occlusion (glasses, beard), make-up, hair style

• Character recognition: Different handwriting styles.

• Speech recognition: Temporal dependency.

 Use of a dictionary or the syntax of the language.

 Sensor fusion: Combine multiple modalities; eg, visual (lip image) and acoustic for
speech

• Medical diagnosis: From symptoms to illnesses

Regression
• Regression is another type of supervised learning method that uses an algorithm to understand
the relationship between dependent and independent variables.

• Some popular regression algorithms are linear regression, logistic regression and polynomial
regression.

• Regression models are helpful for predicting numerical values based on different data points,
such as sales revenue projections for a given business.
Classification vs Clustering
Reinforcement Learning
• Reinforcement Learning is a part of machine learning. Here, agents are self-trained on reward
and punishment mechanisms.

• It’s about taking the best possible action or path to gain maximum rewards and minimum
punishment through observations in a specific situation. It acts as a signal to positive and
negative behaviors.

• Through a series of Trial and Error methods, an agent keeps learning continuously in an
interactive environment from its own actions and experiences. The only goal of it is to find a
suitable action model which would increase the total cumulative reward of the agent. It learns via
interaction and feedback.
Work flow of Machine learning Process
Data preprocessing
Feature Selection
Resampling Techniques
Evaluation Metrics
Importance: Evaluation metrics help measure the performance and
effectiveness of machine learning models. They provide a way to assess
how well the model performs on unseen data and guide improvements.
1. Accuracy
•Definition: The proportion of correct predictions out of total
predictions.
•Formula: (True Positives + True Negatives) / (True Positives + True
Negatives + False Positives + False Negatives)
•Example: If you classify 100 instances and correctly predict 90 of
them, the accuracy is 90%.
2. Precision
•Definition: The proportion of true positive predictions out of total
positive predictions made by the model.
•Formula: True Positives / (True Positives + False Positives)
•Example: In a binary classification task, if you predict 10 positives and
8 are true positives, precision is 80%.
3. Recall (Sensitivity)
•Definition: The proportion of actual positive instances that the model correctly identifies.
•Formula: True Positives / (True Positives + False Negatives)
•Example: In a medical diagnosis, if there are 50 sick patients and the model correctly identifies 45, recall is 90%.
4. F1-Score
•Definition: Harmonic mean of precision and recall. A balance between precision and recall.
•Formula: 2 * (Precision * Recall) / (Precision + Recall)
•Example: If precision is 80% and recall is 90%, the F1-score is 85%.
5. Mean Squared Error (MSE)
•Definition: Measures the average squared difference between actual and predicted values.
•Formula: (Sum of squared errors) / Number of predictions
•Example: Used in regression tasks, a low MSE indicates that the model's predictions are close to the actual values.
6. Root Mean Squared Error (RMSE)
•Definition: The square root of MSE, giving a metric in the same units as the output.
•Example: If the MSE is 4, the RMSE is 2.
7. Receiver Operating Characteristic (ROC) Curve
•Definition: A graph showing the trade-off between the true positive rate (recall) and the false positive rate (1-specificity) at different thresholds.
•Application: Helps evaluate classification models, particularly when classes are imbalanced.
8. Area Under the ROC Curve (AUC-ROC)
•Definition: Measures the area under the ROC curve. A higher AUC-
ROC value indicates better model performance.
•Example: An AUC-ROC of 0.5 means the model is random, while 1.0
indicates perfect discrimination.
9. Confusion Matrix
•Definition: A table showing the number of true positives, false
positives, true negatives, and false negatives.
•Use: Helps visualize the model's performance and identify potential
areas for improvement.
10. Log Loss
•Definition: Measures the uncertainty of the model's predictions.
•Formula: - (Sum of actual labels * log(predicted probabilities) + (1
- actual labels) * log(1 - predicted probabilities)) / Number of
predictions
•Example: Lower log loss indicates a better model.
11. Mean Absolute Error (MAE)
•Definition: MAE quantifies the average distance between actual values and predicted values, taking the absolute difference between them. It
provides an idea of how close the predictions are to the actual values.
•Formula:

• Where:
• n is the number of data points.
• yi is the actual value of the i-th data point.
• y^i is the predicted value for the i-th data point.
• The absolute difference ∣yi−y^i∣ is calculated for each data point, and the mean is taken.
•Interpretation:
• A lower MAE indicates that the predictions are closer to the actual values.
• MAE provides a clear and intuitive measure of the model's performance.
•Advantages:
• Easy to understand and interpret.
• Not as sensitive to outliers as some other metrics, such as mean squared error (MSE).
•Disadvantages:
• Does not differentiate between positive and negative errors, treating all errors equally.
12. Gini Coefficient
•Definition: The Gini coefficient measures the impurity of a data set or node in a decision tree. It calculates the probability
that a randomly chosen instance from the data set will be misclassified if it were to be randomly labeled.
•Formula:
• Given a set of classes in a data set, the Gini coefficient is calculated as follows:
•

• Where:
• k is the number of classes.
• pj is the probability (or proportion) of class j in the data set.
•Range:
• The Gini coefficient ranges from 0 to 1.
• A value of 0 indicates perfect purity, where all instances belong to one class.
• A value of 1 indicates maximum impurity, where the instances are equally distributed among the different classes.
Confusion Matrix
Model Overfitting and Underfitting
In machine learning, finding the right balance between fitting a model too closely or too loosely to the data is essential for good
model performance. Understanding overfitting and underfitting helps in selecting the most appropriate model complexity.
Overfitting
A model is overfitted when it captures not only the underlying patterns in the data but also the noise and fluctuations in the training
set.
•Causes:
• Excessive model complexity (e.g., too many parameters, high-degree polynomial)
• Insufficient training data relative to model complexity
•Consequences:
• Poor generalization to new, unseen data
• High variance and low bias in predictions
• High training accuracy but low testing accuracy
•Solutions:
• Simplify the model (e.g., reduce the number of parameters)
• Use regularization techniques (e.g., L1 or L2 regularization)
• Cross-validation to select appropriate model complexity
Underfitting
A model is underfitted when it is too simple and fails to capture the underlying patterns in the data.
•Causes:
• Insufficient model complexity (e.g., too few parameters)
• Lack of relevant features or poor feature engineering
• Inadequate training data for the model
•Consequences:
• Poor performance on both training and testing data
• High bias and low variance in predictions
• The model may fail to identify relationships in the data
•Solutions:
• Increase model complexity (e.g., add more features or layers)
• Use more sophisticated algorithms
• Feature engineering (e.g., creating new features or transformations)
• Increase the size of the training dataset
Take away Message: Statistical Challenges in Machine
Learning

•Challenges:
• Dealing with missing data
• Handling outliers
• Multicollinearity: Correlation among input features
• Imbalanced datasets: Unequal distribution of classes
•Solutions:
• Data imputation, scaling, and normalization
• Robust statistics for handling outliers
• Feature selection and engineering

Writing A Scientific Thesis
100% (1)
Writing A Scientific Thesis
64 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
Unit-Ii-Chapter-1 - The Principles of Learning
100% (1)
Unit-Ii-Chapter-1 - The Principles of Learning
4 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
Artificial Intelligence - Machine Learning Fundamentals
No ratings yet
Artificial Intelligence - Machine Learning Fundamentals
31 pages
Unit 3 ML
No ratings yet
Unit 3 ML
119 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
138 pages
Machine Learning Classification, Regression and Clustering
No ratings yet
Machine Learning Classification, Regression and Clustering
77 pages
Chapter 01 Introduction To ML
No ratings yet
Chapter 01 Introduction To ML
178 pages
Machine Learning IAI
No ratings yet
Machine Learning IAI
94 pages
Derrida On Touching Jean Luc Nancy
100% (16)
Derrida On Touching Jean Luc Nancy
394 pages
Machine Learning - Introduction
No ratings yet
Machine Learning - Introduction
73 pages
CH 01 Intro To ML - Updated
No ratings yet
CH 01 Intro To ML - Updated
66 pages
Introduction Class
No ratings yet
Introduction Class
134 pages
ML Notes
No ratings yet
ML Notes
101 pages
ML Solutions
No ratings yet
ML Solutions
34 pages
Unit - 1 Mlcse
No ratings yet
Unit - 1 Mlcse
21 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
Unit 1 - Machine Learning - NOTES1 - ML
No ratings yet
Unit 1 - Machine Learning - NOTES1 - ML
52 pages
Machine Learning
No ratings yet
Machine Learning
74 pages
Semi Detailed Lesson Plan in Oral Communication
No ratings yet
Semi Detailed Lesson Plan in Oral Communication
4 pages
Ai Chapter 5
No ratings yet
Ai Chapter 5
45 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
24 pages
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
No ratings yet
Basics of Machine Learning and Classifications: Dr. Helal Uddin Ahmed
18 pages
Cookery g10 1st Quarter Week 2
No ratings yet
Cookery g10 1st Quarter Week 2
4 pages
Unit 1
No ratings yet
Unit 1
24 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
DST-S Sample Report
No ratings yet
DST-S Sample Report
8 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Machine Learning Is A Branch of Artificial Intelligence (AI)
No ratings yet
Machine Learning Is A Branch of Artificial Intelligence (AI)
80 pages
Lec 001
No ratings yet
Lec 001
17 pages
Machine Learning For Data Science Unit-4
No ratings yet
Machine Learning For Data Science Unit-4
16 pages
What Is Machine Learning?
No ratings yet
What Is Machine Learning?
6 pages
Ch7 Introduction To Machine Learning
No ratings yet
Ch7 Introduction To Machine Learning
29 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
ML Introduction
No ratings yet
ML Introduction
54 pages
Ass Bigd
No ratings yet
Ass Bigd
9 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Machine Learning: BE Sixth Semester 20CS610
No ratings yet
Machine Learning: BE Sixth Semester 20CS610
211 pages
Tirth PDF
No ratings yet
Tirth PDF
19 pages
Disruptive Technologies AI Lecture 2
No ratings yet
Disruptive Technologies AI Lecture 2
12 pages
Machine Learning-Supervised Learning
No ratings yet
Machine Learning-Supervised Learning
31 pages
Machine Learning Notes
100% (1)
Machine Learning Notes
8 pages
Lecture 8
No ratings yet
Lecture 8
11 pages
Intelligent Behaviors: Seeing My Natural Ability
No ratings yet
Intelligent Behaviors: Seeing My Natural Ability
8 pages
Ai Notes
No ratings yet
Ai Notes
8 pages
Lesson Plan Tall and Short
100% (1)
Lesson Plan Tall and Short
3 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Annual Implementation Plan
No ratings yet
Annual Implementation Plan
8 pages
Ai Faheem
No ratings yet
Ai Faheem
16 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
ML Doc1
No ratings yet
ML Doc1
14 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
10 pages
DSF Unit 4
No ratings yet
DSF Unit 4
12 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
Educ 2 A
No ratings yet
Educ 2 A
144 pages
CELTA Lesson Plan - Cover Sheet
100% (1)
CELTA Lesson Plan - Cover Sheet
3 pages
PSY 2 - Prelims
No ratings yet
PSY 2 - Prelims
14 pages
Scorecard For Beauty Pageant
No ratings yet
Scorecard For Beauty Pageant
2 pages
Y10 Badminton Doubles Outline Rubrics
No ratings yet
Y10 Badminton Doubles Outline Rubrics
4 pages
The Spanish Language Speed Learning Course: Speak Spanish Confidently in 12 Days or Less!
No ratings yet
The Spanish Language Speed Learning Course: Speak Spanish Confidently in 12 Days or Less!
18 pages
How To Teach Remedial Reading
No ratings yet
How To Teach Remedial Reading
14 pages
Life Values Among Street Vendors in All Iligan City
No ratings yet
Life Values Among Street Vendors in All Iligan City
5 pages
CLT Essay
100% (2)
CLT Essay
4 pages
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
No ratings yet
Introduction To Recurrent Neural Networks (RNNS) : Dr. Hans Weber February 9, 2024
9 pages
Environment The Third Teacher
No ratings yet
Environment The Third Teacher
24 pages
Emerging Diseases Need For Focused Research in Smallmillets
No ratings yet
Emerging Diseases Need For Focused Research in Smallmillets
11 pages
Investigation of Diversity and Dominance of Fungal Biota in Stored Wheat Grains From Governmental Warehouses in West Bengal, India
No ratings yet
Investigation of Diversity and Dominance of Fungal Biota in Stored Wheat Grains From Governmental Warehouses in West Bengal, India
12 pages
Brooke Expository Writing PPT Refined
No ratings yet
Brooke Expository Writing PPT Refined
16 pages
Game Theory and Grice Theory of Implicatures: Anton Benz
No ratings yet
Game Theory and Grice Theory of Implicatures: Anton Benz
31 pages
Aims of True Education: Sri Aurobindo and Mahatma Gandhi
No ratings yet
Aims of True Education: Sri Aurobindo and Mahatma Gandhi
14 pages
3D Hologram Technology in Learning Environment
No ratings yet
3D Hologram Technology in Learning Environment
12 pages
Slerman Chapter
No ratings yet
Slerman Chapter
9 pages
Moderators' Report/ Principal Moderator Feedback: Level 1 Foundation Project Level 2 Higher Project (P101 & P201)
No ratings yet
Moderators' Report/ Principal Moderator Feedback: Level 1 Foundation Project Level 2 Higher Project (P101 & P201)
11 pages
IELTS Reading Unit 3 Practice
No ratings yet
IELTS Reading Unit 3 Practice
5 pages
Lesson Plan Community
No ratings yet
Lesson Plan Community
4 pages
Daily Tip
No ratings yet
Daily Tip
2 pages
Advanced Analytics Crossword Ans 2022
No ratings yet
Advanced Analytics Crossword Ans 2022
4 pages
Participle
No ratings yet
Participle
2 pages
Histochemical GUS Assay (Exp 6, CSS451)
No ratings yet
Histochemical GUS Assay (Exp 6, CSS451)
2 pages
Gus 1
No ratings yet
Gus 1
1 page
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
The Fundamentals of Machine Learning: Building Intelligent Systems from Data
From Everand
The Fundamentals of Machine Learning: Building Intelligent Systems from Data
Ethan Bennett
No ratings yet
Beyond The Algorithm: Practical Machine Learning Strategies
From Everand
Beyond The Algorithm: Practical Machine Learning Strategies
Jane Onwuchekwa
No ratings yet

Machine Learning

Uploaded by

Machine Learning

Uploaded by

Introduction to

• Optimize a performance criterion using example data or past experience.

• Role of Statistics: Inference from a sample

• Role of Computer science: Efficient algorithms to

 Solve the optimization problem

 Representing and evaluating the model for inference

• Example in retail: Customer transactions to consumer behavior:

• Build a model that is a good and useful approximation to the data.

• Finance: Credit scoring, fraud detection

• Manufacturing: Optimization, troubleshooting

• Medicine: Medical diagnosis

• Telecommunications: Quality of service optimization

• Bioinformatics: Motifs, alignment, Clock

• Web mining: Search engines

• Alexa/Google Assistant/Siri: Voice Command system

• Netflix/Amazon prime: Streaming platforms

• LinkedIn/Facebook: Social networks

• Unsupervised learning models are used for three main tasks:

• This technique is helpful for market segmentation, image compression, etc.

• Generally used for market basket analysis and recommendation engines.

Example: P ( chips | beer ) = 0.7

• Another example to classify spam in a separate folder from your inbox.

• Example: Credit scoring

• Differentiating between low-risk and

Discriminant: IF income > θ1 AND savings > θ2 THEN

• Character recognition: Different handwriting styles.

• Speech recognition: Temporal dependency.

 Use of a dictionary or the syntax of the language.

• Medical diagnosis: From symptoms to illnesses

You might also like