0% found this document useful (0 votes)

30 views31 pages

Xchapter 1

Uploaded by

Adharsh Rajeev Dfc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views31 pages

Xchapter 1

Uploaded by

Adharsh Rajeev Dfc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 31

Machine Learning

1
An Application
 A credit card company receives thousands of
applications for new cards. Each application
contains information about an applicant,
 age
 Marital status
 annual salary
 outstanding debts
 credit rating
 etc.
 Problem: to decide whether an application
should approved, or to classify applications
into two categories, approved and not
approved.

2
Machine learning
 Like human learning from past experiences.
 A computer does not have “experiences”.
 A computer system learns from data, which
represent some “past experiences” of an
application domain.
 Our focus is to learn a target function that can be
used to predict the values of a discrete class
attribute, Here, in the example our approve or
not-approved

3
Some more examples of tasks that are
best solved by using a learning algorithm
 Recognizing patterns:
 Facial identities or facial expressions
 Handwritten or spoken words
 Medical images
 Generating patterns:
 Generating images or motion sequences
 Recognizing anomalies:
 Unusual sequences of credit card transactions
 Unusual patterns of sensor readings in a nuclear power plant
or unusual sound in your car engine.
 Prediction:
 Future stock prices or currency exchange rates

4
Some web-based examples of machine
learning
 The web contains a lot of data. Tasks with very big
datasets often use machine learning
 especially if the data is noisy or non-stationary.

 Spam filtering, fraud detection:

 The enemy adapts so we must adapt too.

 Recommendation systems:
 Lots of noisy data

 Information retrieval:
 Find documents or images with similar content.

 Data Visualization:
 Display a huge database in a revealing way

5
Related Fields

Machine Visualization
Learning
Data Mining and
Knowledge Discovery

Statistics Databases

6
Statistics, Machine Learning and
Data Mining
 Statistics:
 more theory-based
 more focused on testing hypotheses
 Machine learning
 more heuristic
 focused on improving performance of a learning agent
 also looks at real-time learning and robotics – areas not part of
data mining
 Data Mining and Knowledge Discovery
 integrates theory and heuristics
 focus on the entire process of knowledge discovery, including
data cleaning, learning, and integration and visualization of
results
 Distinctions are fuzzy

7
Types of learning task
 Supervised learning
 classification is seen as supervised learning from
examples.
 Supervision: The data (observations, measurements, etc.) are
labeled with pre-defined classes. It is like that a “teacher” gives
the classes (supervision).
 Test data are classified into these classes too.

 Unsupervised learning
 Class labels of the data are unknown
 Given a set of data, the task is to establish the
existence of classes or clusters in the data

8
Supervised learning process: two steps

 Learning (training): Learn a model using the training data

 Testing: Test the model using unseen test data to assess the
model accuracy

Number of correct classifications

Accuracy  ,
Total number of test cases

9
What do we mean by learning?

 Given
 a data set D,
 a task T, and
 a performance measure P,
 a computer system is said to learn from D to perform the
task T if after learning the system’s performance on T
improves as measured by P.

 In other words, the learned model helps the system to

perform T better as compared to no learning

10
An Example
 Data: Loan application data
 Task: Predict whether a loan should be approved
or not.
 Performance measure: accuracy.

No learning: classify all future applications (test

data) to the majority class (i.e., Yes):
Accuracy = 9/15 = 60%.
 We can do better than 60% with learning

11
Fundamental assumption of learning

Assumption: The distribution of training examples is identical

to the distribution of test examples (including future
unseen examples).

 In practice, this assumption is often violated to certain

degree.
 Strong violations will clearly result in poor classification
accuracy.
 To achieve good accuracy on the test data, training
examples must be sufficiently representative of the test
data.

12
 Classifier Accuracy Measures and its evaluation.

13
Evaluating Classification/Pediction
methods
 Predictive accuracy

 Efficiency
 time to construct the model
 time to use the model
 Robustness
handling noise and missing values
 Scalability
efficiency in disk-resident databases
 Interpretability
understandable and insight provided by the model
 Compactness of the model
size of the tree, or the number of rules.
14
Evaluation methods of
classifier/Predictor
 Holdout set: The available data set D is divided into
two disjoint subsets,
 the training set Dtrain (for learning a model)
 the test set Dtest (for testing the model)
 Important: training set should not be used in testing
and the test set should not be used in learning.
 Unseen test set provides a unbiased estimate of accuracy.
 The test set is also called the holdout set
 This method is mainly used when the data set D is
large.

15
Holdout method
 The holdout method has two basic drawbacks
 In problems where we have a sparse dataset we may not be able
to afford the “luxury” of setting aside a portion of the dataset for
testing
 Since it is a single train-and-test experiment, the holdout estimate
of error rate, will be misleading if we happen to get an
“unfortunate” split
 The limitations of the holdout can be overcome with a family of re-
sampling methods at the expense of more computations
Cross Validation
 Random Sub-sampling
 N-Fold Cross-Validation
 Leave-one-out Cross-Validation

16
Evaluation methods

1. Random Subsampling
Holdout method is repeated k times
Performs K data splits of the dataset
 Each split randomly selects a (fixed) number of examples
without replacement
 For each data split we retrain the classifier from scratch with
the training examples and estimate Ei with the test examples
 The true error estimate is obtained as the average of the
separate estimates E i

17
Evaluation methods
2. N-fold cross-validation:
 The available data is partitioned into n equal-size disjoint
subsets.
 Use each subset as the test set and combine the rest n-1 subsets

as the training set to learn a classifier.

 The procedure is run n times, which give n accuracies.

 The final estimated accuracy of learning is the average of the n

accuracies.
 10-fold and 5-fold cross-validations are commonly used.

 This method is used when the available data is not large.

 N-Fold Cross validation is similar to Random Subsampling

 The advantage of K-Fold Cross validation is that all the examples in
the dataset are eventually used for both training and testing
18
Evaluation methods

3. Leave-one-out Cross Validation :

 Leave-one-out is the degenerate case of N-Fold Cross
Validation, where N is chosen as the total number of
examples.
 For a dataset with N examples, perform N experiments,
For each experiment use N-1 examples for training and
the remaining example for testing

19
Bootstrap
 Select training samples uniformly with
replacement
 Each time a tuple is selected ,it is equally likely to
be selected again and readded to training set
Suppose we have a dataset of d tuples.it is sampled d
times with replacement,resulting in bootstrap
sample.Try out several times
Eg: .632 bootstrap
Each tuple has a probabi;ity of 1/d being selected
-Not selected is (1-1/d).
If d is large,probanility approaches 0.368^14.36.8%
will not be selected for training and remaining
63.2% will form training set.Repaet this sampling
procedure k times 20
Evaluation methods
 Accuracy is a measure to evaluate the classifier
 Accuracy is not suitable in some applications.
 In text mining, we may only be interested in the
documents of a particular topic, which are only a small
portion of a big document collection.
 In classification involving skewed or highly imbalanced
data, e.g., network intrusion and financial fraud
detections, we are interested only in the minority class.
 High accuracy does not mean any intrusion is detected.
 E.g., 1% intrusion. Achieve 99% accuracy by doing nothing.
 The class of interest is commonly called the positive
class, and the rest negative classes.

21
Precision and recall measures
We use a confusion matrix to introduce them

22
Precision and recall measures

TP TP
p . r .
TP  FP TP  FN
 Precision p is the number of correctly classified
positive examples divided by the total number of
examples that are classified as positive.
 The term precision indicates the relevancy of
prediction, as it represent out of all samples
labeled as class A, what fraction actuallybelongs
to class A.
23
Precision and recall measures

 Recall r is the number of correctly classified

positive examples divided by the total number of
actual positive examples in the test set.
 Recall indicates the fraction of class A that the
classifier picks up out of all samples that belonged
to class A.

24
An example

 This confusion matrix gives

 precision p = 100% and
 recall r = 1%
because we only classified one positive example correctly
and no negative examples wrongly.

25
Receive operating characteristics curve
 It is commonly called the ROC curve.
 It is a plot of the true positive rate (TPR) against the false
positive rate (FPR).
 True positive rate:

 False positive rate:

26
Sensitivity and Specificity

 In statistics, there are two other evaluation measures:

 Sensitivity: Same as TPR
 Specificity: Also called True Negative Rate (TNR)

 Then we have

27
F1-value (also called F1-score)
 It is hard to compare two classifiers using two measures. F 1 score
combines precision and recall into one measure

 The harmonic mean of two numbers tends to be closer to the

smaller of the two.
 For F -value to be large, both p and r much be large.
1

28
libraries

• scipy
• numpy
• matplotlib
• pandas
• sklearn

29
 Transfer learning generally refers to a process
where a model trained on one problem is used in
some way on a second related problem.

30
 Transfer learning has the benefit of decreasing
the training time for a neural network model and
can result in lower generalization error.
 The weights in re-used layers may be used as the
starting point for the training process and
adapted in response to the new problem. This
usage treats transfer learning as a type of weight
initialization scheme. This may be useful when the
first related problem has a lot more labeled data
than the problem of interest and the similarity in
the structure of the problem may be useful in
both contexts.
31

Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
73 pages
Chp8 Classification Basic Concepts - Lecture#8
No ratings yet
Chp8 Classification Basic Concepts - Lecture#8
40 pages
CS585 Lecture October03rd
No ratings yet
CS585 Lecture October03rd
146 pages
Model Generalization
No ratings yet
Model Generalization
117 pages
Chap 5 Learning
No ratings yet
Chap 5 Learning
56 pages
CHP 3
No ratings yet
CHP 3
70 pages
Evaluating Machine Learning Algorithms and Model Selection
No ratings yet
Evaluating Machine Learning Algorithms and Model Selection
10 pages
Supervised Learning
No ratings yet
Supervised Learning
30 pages
5 - Model For Predictions - ML
No ratings yet
5 - Model For Predictions - ML
52 pages
Unit IV
No ratings yet
Unit IV
51 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
ML Unit 2 Part 1
No ratings yet
ML Unit 2 Part 1
47 pages
Machine
No ratings yet
Machine
61 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
43 pages
Lecture 9 - Evaluations
No ratings yet
Lecture 9 - Evaluations
68 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
37 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Unit Ii
No ratings yet
Unit Ii
118 pages
TE - DWM Module No 3
No ratings yet
TE - DWM Module No 3
48 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
ML Unit 2
No ratings yet
ML Unit 2
35 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Chapter 19
No ratings yet
Chapter 19
30 pages
Chapter 01 Introduction To Machine Learning
No ratings yet
Chapter 01 Introduction To Machine Learning
59 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
DM 09 Classification and Prediction 19112024 102854am
No ratings yet
DM 09 Classification and Prediction 19112024 102854am
21 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Module 5 Advanced Classification Techniques
No ratings yet
Module 5 Advanced Classification Techniques
40 pages
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
No ratings yet
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
62 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
25 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
2020 Evaluation PDF
No ratings yet
2020 Evaluation PDF
25 pages
Module 6
No ratings yet
Module 6
24 pages
ML Model Evaluation
No ratings yet
ML Model Evaluation
17 pages
Mining Process
No ratings yet
Mining Process
33 pages
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
No ratings yet
Evaluating Model Performance: Evaluation Strategies: Train/Validation/Test
127 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
6 Model Evalution
No ratings yet
6 Model Evalution
16 pages
DM Assignment 2
No ratings yet
DM Assignment 2
23 pages
Ch6-Models Selection Evaluating Classifiers
No ratings yet
Ch6-Models Selection Evaluating Classifiers
28 pages
Grade 06 History 1st Term Test Paper With Answers 2019 Sinhala Medium North Western Province
83% (6)
Grade 06 History 1st Term Test Paper With Answers 2019 Sinhala Medium North Western Province
7 pages
Machine Learning Models: by Mayuri Bhandari
No ratings yet
Machine Learning Models: by Mayuri Bhandari
48 pages
ML Unit IV
No ratings yet
ML Unit IV
70 pages
Classification
No ratings yet
Classification
33 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
Unit 4 Classification
No ratings yet
Unit 4 Classification
87 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
Bi Unit 5
No ratings yet
Bi Unit 5
20 pages
Unit-6: Classification and Prediction
No ratings yet
Unit-6: Classification and Prediction
63 pages
Spam Not Spam
No ratings yet
Spam Not Spam
7 pages
DM Unit - 3
No ratings yet
DM Unit - 3
21 pages
Data Mining Models and Evaluation Techniques
No ratings yet
Data Mining Models and Evaluation Techniques
59 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Machine Learning HC
No ratings yet
Machine Learning HC
4 pages
An Overview of Machine Learning
No ratings yet
An Overview of Machine Learning
17 pages
Chapter 5
No ratings yet
Chapter 5
3 pages
TR Rain Error
No ratings yet
TR Rain Error
6 pages
Rewording The Brain How Cryptic Crosswords Can Improve Your Memory and Boost The Power and Agility of Your Brain Research PDF Download
100% (14)
Rewording The Brain How Cryptic Crosswords Can Improve Your Memory and Boost The Power and Agility of Your Brain Research PDF Download
15 pages
Wafers: Basic Wafer Types
No ratings yet
Wafers: Basic Wafer Types
7 pages
Advertising Response Models
50% (2)
Advertising Response Models
36 pages
Text
No ratings yet
Text
3 pages
Arroyo Oscar The World of Tomorrow
100% (1)
Arroyo Oscar The World of Tomorrow
5 pages
A World Class Carbon and Stainless Steel Flange Manufacturer
No ratings yet
A World Class Carbon and Stainless Steel Flange Manufacturer
5 pages
WebSphere DataPower SOA Appliances and XSLT Part 1
No ratings yet
WebSphere DataPower SOA Appliances and XSLT Part 1
23 pages
ED-UCCP-201401A-Packaged Water Cool PDF
No ratings yet
ED-UCCP-201401A-Packaged Water Cool PDF
38 pages
Catch Up Friday Research
No ratings yet
Catch Up Friday Research
1 page
Batangas State University Graduate School
No ratings yet
Batangas State University Graduate School
9 pages
How To Use DNA Baser - 2 Minutes Video Tutorial - Url
No ratings yet
How To Use DNA Baser - 2 Minutes Video Tutorial - Url
13 pages
Trainng report-BSNL: Monday, July 2, 2007
No ratings yet
Trainng report-BSNL: Monday, July 2, 2007
40 pages
FIT ZONE Nutrition Plan For MEN by Guru Mann
100% (1)
FIT ZONE Nutrition Plan For MEN by Guru Mann
8 pages
UCUN DINAS I BHS INGGRIS PKT A Dijawab
100% (3)
UCUN DINAS I BHS INGGRIS PKT A Dijawab
12 pages
Cloud Seeding
No ratings yet
Cloud Seeding
23 pages
Telehandler Genie GTH 1048-Specifications
No ratings yet
Telehandler Genie GTH 1048-Specifications
2 pages
Distance & Direction-2: Floor, Behind Bus Stand, Karnal - Contact: 7015275075, 7206600658
No ratings yet
Distance & Direction-2: Floor, Behind Bus Stand, Karnal - Contact: 7015275075, 7206600658
8 pages
Sneha SVMCM SC 2023-2024
No ratings yet
Sneha SVMCM SC 2023-2024
2 pages
The Impact of Digital Marketing Management On Customers Buying Behavior
No ratings yet
The Impact of Digital Marketing Management On Customers Buying Behavior
22 pages
Akash Internship Report
No ratings yet
Akash Internship Report
49 pages
Tan ChineseLiteratureEssays 2016
No ratings yet
Tan ChineseLiteratureEssays 2016
5 pages
Abdul - Azeez Bin Abdullaah Bin Baaz
No ratings yet
Abdul - Azeez Bin Abdullaah Bin Baaz
4 pages
Group Assignment 6 ICT (XII IPA 5) - 20240118 - 003400 - 0000
No ratings yet
Group Assignment 6 ICT (XII IPA 5) - 20240118 - 003400 - 0000
13 pages
T150mm - Beam and Blocks PDF
No ratings yet
T150mm - Beam and Blocks PDF
2 pages
ONDC - Sept 2022
No ratings yet
ONDC - Sept 2022
16 pages
2104 RZIM Academy Notes 5.1
No ratings yet
2104 RZIM Academy Notes 5.1
5 pages
Asia-Pacific Trade Agreement
No ratings yet
Asia-Pacific Trade Agreement
2 pages
Avoid News Part1 TEXT PDF
No ratings yet
Avoid News Part1 TEXT PDF
11 pages
In Vivo and in Vitro Evaluation of Four Different Aqueous Polymeric Dispersions For Producing An Enteric Coated Tablet
No ratings yet
In Vivo and in Vitro Evaluation of Four Different Aqueous Polymeric Dispersions For Producing An Enteric Coated Tablet
6 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Xchapter 1

Uploaded by

Xchapter 1

Uploaded by

Machine Learning

 Spam filtering, fraud detection:

 Learning (training): Learn a model using the training data

Number of correct classifications

 In other words, the learned model helps the system to

No learning: classify all future applications (test

Assumption: The distribution of training examples is identical

 In practice, this assumption is often violated to certain

as the training set to learn a classifier.

 The final estimated accuracy of learning is the average of the n

 This method is used when the available data is not large.

 N-Fold Cross validation is similar to Random Subsampling

3. Leave-one-out Cross Validation :

 Recall r is the number of correctly classified

 This confusion matrix gives

 False positive rate:

 In statistics, there are two other evaluation measures:

 The harmonic mean of two numbers tends to be closer to the

You might also like