0% found this document useful (0 votes)

80 views56 pages

Int3209 - Data Mining: Week 5: Classification Model Improvements

In this medium skew case, classifier T3 has the best performance because it has: - Highest true positive rate (TPR) of 99% for the rare class - Lowest false positive rate (FPR) of 1% for the rare class - Lowest false negative rate of 1% for the rare class While T1 and T2 have higher FPRs for the rare class compared to T3. In imbalanced problems like this where we want to correctly classify the rare class, T3 would generally be considered the best classifier.

Uploaded by

Đậu Việt Đức

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views56 pages

Int3209 - Data Mining: Week 5: Classification Model Improvements

Uploaded by

Đậu Việt Đức

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

UET

Since 2004

ĐẠI HỌC CÔNG NGHỆ, ĐHQGHN

VNU-University of Engineering and Technology

INT3209 - DATA MINING

Week 5: Classiﬁcation
Model Improvements
Duc-Trong Le

Slide credit: Vipin Kumar et al.,

https://www-users.cse.umn.edu/~kumar001/dmbook

Hanoi, 09/2021
Outline

● Class Imbalance
● Model Underfitting, Overfitting
● Model Selection
● Model Evaluation
Class Imbalance Problem

● Lots of classification problems where the classes

are skewed (more records from one class than
another)
– Credit card fraud
– Intrusion detection
– Defective products in manufacturing assembly line
– COVID-19 test results on a random sample

● Key Challenge:
– Evaluation measures such as accuracy are not
well-suited for imbalanced class
Accuracy

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes a b
CLASS (TP) (FN)
Class=No c d
(FP) (TN)

● Most widely-used metric:

Problem with Accuracy
● Consider a 2-class problem
– Number of Class NO examples = 990
– Number of Class YES examples = 10
● If a model predicts everything to be class NO, accuracy
is 990/1000 = 99 %
– This is misleading because this trivial model does not detect any class
YES example
– Detecting the rare class is usually more interesting (e.g., frauds,
intrusions, defects, etc)

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 0 10
CLASS Class=No 0 990
Which model is better?

PREDICTED
Class=Yes Class=No
A ACTUAL Class=Yes 0 10
Class=No 0 990

Accuracy: 99%

PREDICTED
B Class=Yes Class=No
ACTUAL Class=Yes 10 0
Class=No 500 490

Accuracy: 50%
Alternative Measures

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes a b
CLASS
Class=No c d
Alternative Measures

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 10 0
CLASS
Class=No 10 980
Alternative Measures

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 10 0
CLASS
Class=No 10 980

PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 1 9
CLASS
Class=No 0 990
Measures of Classification Performance

PREDICTED CLASS
Yes No
ACTUAL
Yes TP FN
CLASS
No FP TN

α is the probability that we reject

the null hypothesis when it is true.
This is a Type I error or a false
positive (FP).

β is the probability that we accept

the null hypothesis when it is false.
This is a Type II error or a false
negative (FN).
Alternative Measures

A PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 40 10
CLASS
Class=No 10 40

B PREDICTED CLASS
Class=Yes Class=No
ACTUAL Class=Yes 40 10
CLASS
Class=No 1000 4000
Which of these classifiers is better?

A PREDICTED CLASS
Class=Yes Class=No

ACTUAL Class=Yes 10 40
CLASS
Class=No 10 40

B PREDICTED CLASS
Class=Yes Class=No

ACTUAL Class=Yes 25 25
CLASS Class=No 25 25

C PREDICTED CLASS
Class=Yes Class=No

ACTUAL Class=Yes 40 10
CLASS
Class=No 40 10
ROC (Receiver Operating Characteristic)

● A graphical approach for displaying trade-off

between detection rate and false alarm rate
● Developed in 1950s for signal detection theory
to analyze noisy signals
● ROC curve plots TPR against FPR
– Performance of a model represented as a point in an
ROC curve
ROC Curve

(TPR,FPR):
● (0,0): declare everything
to be negative class
● (1,1): declare everything
to be positive class
● (1,0): ideal

● Diagonal line:
– Random guessing
– Below diagonal line:
◆ prediction is opposite
of the true class
ROC (Receiver Operating Characteristic)

● To draw ROC curve, classifier must produce

continuous-valued output
– Outputs are used to rank test records, from the most likely
positive class record to the least likely positive class record
– By using different thresholds on this value, we can create
different variations of the classifier with TPR/FPR tradeoffs
● Many classifiers produce only discrete outputs (i.e.,
predicted class)
– How to get continuous-valued outputs?
◆ Decision trees, rule-based classifiers, neural networks,
Bayesian classifiers, k-nearest neighbors, SVM
Example: Decision Trees
Decision Tree

Continuous-valued outputs
e.g., Gini scores
ROC Curve Example
ROC Curve Example
- 1-dimensional data set containing 2 classes (positive and negative)
- Any points located at x > t is classified as positive

At threshold t:
TPR=0.5, FNR=0.5, FPR=0.12, TNR=0.88
How to Construct an ROC curve

● Use a classifier that produces a

Instance Score True Class
continuous-valued score for each
1 0.95 +
instance
2 0.93 +
• The more likely it is for the
3 0.87 - instance to be in the + class, the
4 0.85 - higher the score
5 0.85 - ● Sort the instances in decreasing
6 0.85 + order according to the score
7 0.76 - ● Apply a threshold at each unique
8 0.53 + value of the score
9 0.43 - ● Count the number of TP, FP,
10 0.25 + TN, FN at each threshold
• TPR = TP/(TP+FN)
• FPR = FP/(FP + TN)
How to construct an ROC curve

Threshold >=

ROC Curve:
Using ROC for Model Comparison

● No model consistently
outperforms the other
● M is better for
1
small FPR
● M is better for
2
large FPR

● Area Under the ROC

curve (AUC)
● Ideal:
▪ Area = 1
● Random guess:
▪ Area = 0.5
Dealing with Imbalanced Classes - Summary

● Many measures exists, but none of them may be ideal in

all situations
– Random classifiers can have high value for many of these measures

– TPR/FPR provides important information but may not be sufficient by

itself in many practical scenarios
– Given two classifiers, sometimes you can tell that one of them is
strictly better than the other
◆ C1 is strictly better than C2 if C1 has strictly better TPR and FPR relative to C2 (or
same TPR and better FPR, and vice versa)

– Even if C1 is strictly better than C2, C1’s F-value can be worse than
C2’s if they are evaluated on data sets with different imbalances
– Classifier C1 can be better or worse than C2 depending on the
scenario at hand (class imbalance, importance of TP vs FP, cost/time
trade-offs)
Which Classifier is better?

T1 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 50 50
ACTUAL
CLASS Class=No 1 99

T2 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
Class=No 10 90
CLASS

T3 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
CLASS Class=No 1 99
Which Classifier is better? Medium Skew case

T1 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 50 50
ACTUAL
CLASS Class=No 10 990

T2 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
Class=No 100 900
CLASS

T3 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
CLASS Class=No 10 990
Which Classifier is better? High Skew case

T1 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 50 50
ACTUAL
CLASS Class=No 100 9900

T2 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
Class=No 1000 9000
CLASS

T3 PREDICTED CLASS
Class=Yes Class=No

Class=Yes 99 1
ACTUAL
CLASS Class=No 100 9900
Improve Classifiers with Imbalanced Training Set

● Modify the distribution of training data so that

rare class is well-represented in training set
– Undersample the majority class
– Oversample the rare class
Classification Errors

● Training errors: Errors committed on the training set

● Test errors: Errors committed on the test set

● Generalization errors: Expected error of a model over random selection of

records from same distribution
Example Dataset

Two class problem:

+ : 5400 instances
• 5000 instances generated
from a Gaussian centered at
(10,10)

• 400 noisy instances added

o : 5400 instances
• Generated from a uniform
distribution

10 % of the data used for

training and 90% of the
data used for testing
Increasing number of nodes in Decision Trees
Decision Tree with 4 nodes

Decision Tree

Decision boundaries on Training data

Decision Tree with 50 nodes

Decision Tree

Decision boundaries on Training data

Which tree is better?

Decision Tree with 4 nodes

Which tree is better ?

Decision Tree with 50 nodes
Model Underfitting and Overfitting

•As the model becomes more and more complex, test errors can start increasing even
though training error may be decreasing

Underfitting: when model is too simple, both training and test errors are large
Overfitting: when model is too complex, training error is small but test error is large
Model Overfitting – Impact of Training Data Size

Using twice the number of data instances

• Increasing the size of training data reduces the difference between training and
testing errors at a given size of model
Model Overfitting – Impact of Training Data Size

Decision Tree with 50 nodes Decision Tree with 50 nodes

Using twice the number of data instances

• Increasing the size of training data reduces the difference between training and
testing errors at a given size of model
Reasons for Model Overfitting

● Not enough training data

● High model complexity

– Multiple Comparison Procedure
Effect of Multiple Comparison Procedure

● Consider the task of predicting whether Day 1 Up

stock market will rise/fall in the next 10 Day 2 Down
trading days
Day 3 Down
Day 4 Up
● Random guessing:
Day 5 Down
P(correct) = 0.5
Day 6 Down
Day 7 Up
● Make 10 random guesses in a row:
Day 8 Up
Day 9 Up
Day 10 Down
Effect of Multiple Comparison Procedure

● Approach:
– Get 50 analysts
– Each analyst makes 10 random guesses
– Choose the analyst that makes the most
number of correct predictions

● Probability that at least one analyst makes at

least 8 correct predictions
Effect of Multiple Comparison Procedure

● Many algorithms employ the following greedy strategy:

– Initial model: M
– Alternative model: M’ = M ∪ γ,
where γ is a component to be added to the model
(e.g., a test condition of a decision tree)
– Keep M’ if improvement, Δ(M,M’) > α

● Often times, γ is chosen from a set of alternative

components, Γ = {γ1, γ2, …, γk}

● If many alternatives are available, one may inadvertently

add irrelevant components to the model, resulting in
model overfitting
Effect of Multiple Comparison - Example

Use additional 100 noisy variables

generated from a uniform distribution
along with X and Y as attributes.

Use 30% of the data for training and

70% of the data for testing
Using only X and Y as attributes
Notes on Overfitting

● Overfitting results in decision trees that are more

complex than necessary

● Training error does not provide a good estimate

of how well the tree will perform on previously
unseen records

● Need ways for estimating generalization errors

Model Selection

● Performed during model building

● Purpose is to ensure that model is not overly
complex (to avoid overfitting)
● Need to estimate generalization error
– Using Validation Set
– Incorporating Model Complexity
Model Selection:
Using Validation Set
● Divide training data into two parts:
– Training set:
◆ use for model building
– Validation set:
◆ use for estimating generalization error
◆ Note: validation set is not the same as test set

● Drawback:
– Less data available for training
Model Selection:
Incorporating Model Complexity
● Rationale: Occam’s Razor
– Given two models of similar generalization errors,
one should prefer the simpler model over the more
complex model

– A complex model has a greater chance of being fitted

accidentally

– Therefore, one should include model complexity

when evaluating a model
Gen. Error(Model) = Train. Error(Model, Train. Data) +
x Complexity(Model)
Estimating the Complexity of Decision Trees

● Pessimistic Error Estimate of decision tree T

with k leaf nodes:

– err(T): error rate on all training records

– Ω: trade-off hyper-parameter (similar to )
◆ Relative cost of adding a leaf node
– k: number of leaf nodes
– Ntrain: total number of training records
Estimating the Complexity of Decision Trees: Example

e(TL) = 4/24

e(TR) = 6/24

Ω=1

egen(TL) = 4/24 + 1*7/24 = 11/24 = 0.458

egen(TR) = 6/24 + 1*4/24 = 10/24 = 0.417

Estimating the Complexity of Decision Trees

● Resubstitution Estimate:
– Using training error as an optimistic estimate of
generalization error
– Referred to as optimistic error estimate
e(TL) = 4/24

e(TR) = 6/24
Minimum Description Length (MDL)

● Cost(Model,Data) = Cost(Data|Model) + x Cost(Model)

– Cost is the number of bits needed for encoding.
– Search for the least costly model.
● Cost(Data|Model) encodes the misclassification errors.
● Cost(Model) uses node encoding (number of children)
plus splitting condition encoding.
Model Selection for Decision Trees

● Pre-Pruning (Early Stopping Rule)

– Stop the algorithm before it becomes a fully-grown tree
– Typical stopping conditions for a node:
◆ Stop if all instances belong to the same class
◆ Stop if all the attribute values are the same
– More restrictive conditions:
◆ Stop if number of instances is less than some user-specified
threshold
◆ Stop if class distribution of instances are independent of the
available features (e.g., using χ 2 test)
◆ Stop if expanding the current node does not improve impurity
measures (e.g., Gini or information gain).
◆ Stop if estimated generalization error falls below certain
threshold
Model Selection for Decision Trees

● Post-pruning
– Grow decision tree to its entirety
– Subtree replacement
◆ Trim the nodes of the decision tree in a
bottom-up fashion
◆ If generalization error improves after trimming,
replace sub-tree by a leaf node
◆ Class label of leaf node is determined from
majority class of instances in the sub-tree
Example of Post-Pruning
Training Error (Before splitting) = 10/30

Class = Yes 20 Pessimistic error = (10 + 0.5)/30 = 10.5/30

Training Error (After splitting) = 9/30
Class = No 10
Pessimistic error (After splitting)
Error = 10/30
= (9 + 4 × 0.5)/30 = 11/30
PRUNE!

Class = Yes 8 Class = Yes 3 Class = Yes 4 Class = Yes 5

Class = No 4 Class = No 4 Class = No 1 Class = No 1
Examples of Post-pruning
Model Evaluation

● Purpose:
– To estimate performance of classifier on previously
unseen data (test set)
● Holdout
– Reserve k% for training and (100-k)% for testing
– Random subsampling: repeated holdout
● Cross validation
– Partition data into k disjoint subsets
– k-fold: train on k-1 partitions, test on the remaining one
– Leave-one-out: k=n
Cross-validation Example

● 3-fold cross-validation
Variations on Cross-validation

● Repeated cross-validation
– Perform cross-validation a number of times
– Gives an estimate of the variance of the
generalization error
● Stratified cross-validation
– Guarantee the same percentage of class
labels in training and test
– Important when classes are imbalanced and
the sample is small
● Use nested cross-validation approach for model
selection and evaluation
Summary

● Class Imbalance
● Model Underfitting, Overfitting
● Model Selection
● Model Evaluation

HUAWEI Final Written Exam 3333
50% (2)
HUAWEI Final Written Exam 3333
13 pages
Unit6 - 7 Issues
No ratings yet
Unit6 - 7 Issues
53 pages
Chap4 Imbalanced Classes
No ratings yet
Chap4 Imbalanced Classes
28 pages
Lecture 3 1611410001002
No ratings yet
Lecture 3 1611410001002
51 pages
Clase10 11
No ratings yet
Clase10 11
18 pages
04 - Model Selection
No ratings yet
04 - Model Selection
62 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Machine Learning Project Report (Group 3) Shahbaz Khan
No ratings yet
Machine Learning Project Report (Group 3) Shahbaz Khan
11 pages
DL IT324a 4
No ratings yet
DL IT324a 4
52 pages
Unit3 7 Issues
No ratings yet
Unit3 7 Issues
24 pages
CSC4316 9
No ratings yet
CSC4316 9
40 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
MACHINELEARNING
No ratings yet
MACHINELEARNING
20 pages
A10 Model Performance v2 2up
No ratings yet
A10 Model Performance v2 2up
11 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
ML 2 PPT Unit 2
No ratings yet
ML 2 PPT Unit 2
214 pages
19-Introduction Classification Algorithm-18-09-2024
No ratings yet
19-Introduction Classification Algorithm-18-09-2024
102 pages
Data Mining Classification: Alternative Techniques
No ratings yet
Data Mining Classification: Alternative Techniques
14 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
CH-5 ML
No ratings yet
CH-5 ML
36 pages
MISY 631 Final Review Calculators Will Be Provided For The Exam
No ratings yet
MISY 631 Final Review Calculators Will Be Provided For The Exam
9 pages
Assingment On Database
No ratings yet
Assingment On Database
16 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
AI351 Lecture 2 - Common Evaluation Metrics
No ratings yet
AI351 Lecture 2 - Common Evaluation Metrics
50 pages
Session01 DataScience
No ratings yet
Session01 DataScience
79 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
DS Notes
No ratings yet
DS Notes
36 pages
6 Evaluarea Performantei
No ratings yet
6 Evaluarea Performantei
43 pages
PPT6-Buss Intel Analytics
No ratings yet
PPT6-Buss Intel Analytics
41 pages
Classification Metrics
No ratings yet
Classification Metrics
39 pages
All Cards
No ratings yet
All Cards
104 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
ERROR and Confusion Matrix
No ratings yet
ERROR and Confusion Matrix
29 pages
Lecture 5b - Model Performance Analytics
No ratings yet
Lecture 5b - Model Performance Analytics
27 pages
Classification
No ratings yet
Classification
53 pages
Evaluation Matrix
No ratings yet
Evaluation Matrix
29 pages
Unit II
No ratings yet
Unit II
34 pages
All Cards
No ratings yet
All Cards
106 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Module 5
No ratings yet
Module 5
14 pages
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
No ratings yet
CS 620 / DASC 600 Introduction To Data Science & Analytics: Lecture 8-Performance Evaluation
62 pages
TensorFlow Classification
No ratings yet
TensorFlow Classification
68 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
Lecture 5 Evaluation - Classifer
No ratings yet
Lecture 5 Evaluation - Classifer
61 pages
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
No ratings yet
Class Imbalance Problem: BY Dr. Anupam Ghosh 4 SEPT, 2023
27 pages
9b. Evaluation of Classifiers
No ratings yet
9b. Evaluation of Classifiers
4 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
SupervisedLearning Classification
No ratings yet
SupervisedLearning Classification
20 pages
Pid Fuzzy Logic
No ratings yet
Pid Fuzzy Logic
15 pages
Lecture11evaluationmetricsforclassification 240913060639 0c766554
No ratings yet
Lecture11evaluationmetricsforclassification 240913060639 0c766554
28 pages
9 - Session 9 - Visualizing Model Performance, Evidence and Probabilities
No ratings yet
9 - Session 9 - Visualizing Model Performance, Evidence and Probabilities
37 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
L2 - Problems in ML & Performance Evaluation
No ratings yet
L2 - Problems in ML & Performance Evaluation
30 pages
Aids2 QB Ut2
No ratings yet
Aids2 QB Ut2
24 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
FCRAR 2012-1-2 Masory Payrard Bartlet Wright FAU
No ratings yet
FCRAR 2012-1-2 Masory Payrard Bartlet Wright FAU
4 pages
MIT CISRwp443 SucceedingArtificialIntelligence WixomSomehZutavernBeath
No ratings yet
MIT CISRwp443 SucceedingArtificialIntelligence WixomSomehZutavernBeath
15 pages
Named Entity Recognition and Resolution
No ratings yet
Named Entity Recognition and Resolution
17 pages
M818A: Machine Learning and Cyber Security-A
No ratings yet
M818A: Machine Learning and Cyber Security-A
11 pages
Credit Risk Modeling in R - ch1 - PDF
No ratings yet
Credit Risk Modeling in R - ch1 - PDF
45 pages
Big Data in Central Banks
No ratings yet
Big Data in Central Banks
45 pages
Research Paper Rasa Opensource
No ratings yet
Research Paper Rasa Opensource
9 pages
MoveNet SinglePose Model Card
No ratings yet
MoveNet SinglePose Model Card
5 pages
BERT4Rec Sequential Recommendation With BidirectionalEncoder Representations From Transformer
No ratings yet
BERT4Rec Sequential Recommendation With BidirectionalEncoder Representations From Transformer
11 pages
Apples-to-Apples in Cross-Validation Studies: Pitfalls in Classifier Performance Measurement
No ratings yet
Apples-to-Apples in Cross-Validation Studies: Pitfalls in Classifier Performance Measurement
9 pages
Machine Learning Yearning
100% (1)
Machine Learning Yearning
9 pages
Credit Card Fraud Detection - Machine Learning Methods: March 2019
No ratings yet
Credit Card Fraud Detection - Machine Learning Methods: March 2019
6 pages
Efficient Software Cost Estimation Using Machine Learning Techniques
No ratings yet
Efficient Software Cost Estimation Using Machine Learning Techniques
20 pages
INF264 - Exercise 2: 1 Instructions
No ratings yet
INF264 - Exercise 2: 1 Instructions
4 pages
Physica A: Feng Shen, Xingchao Zhao, Zhiyong Li, Ke Li, Zhiyi Meng
No ratings yet
Physica A: Feng Shen, Xingchao Zhao, Zhiyong Li, Ke Li, Zhiyi Meng
17 pages
Debesai Gutierrez Koyluoglu
No ratings yet
Debesai Gutierrez Koyluoglu
11 pages
Assignment AnjaliVats 244
No ratings yet
Assignment AnjaliVats 244
12 pages
Unlocking The Potential of Deep Learning For Marin
No ratings yet
Unlocking The Potential of Deep Learning For Marin
44 pages
More Data Mining With Weka: Ian H. Witten
No ratings yet
More Data Mining With Weka: Ian H. Witten
47 pages
Learning Deep Features For Discriminative Localization - Supp
No ratings yet
Learning Deep Features For Discriminative Localization - Supp
9 pages
Revision: High Variance
No ratings yet
Revision: High Variance
8 pages
Empirical Evaluation of Rectified Activations in ConvolutionNetwork
No ratings yet
Empirical Evaluation of Rectified Activations in ConvolutionNetwork
5 pages
Fashion-MNIST A Novel Image Dataset For Benchmarking Machine Learning Algorithms
No ratings yet
Fashion-MNIST A Novel Image Dataset For Benchmarking Machine Learning Algorithms
6 pages
Detect Depression From Communication How Computer Vision Signal Processing and Sentiment Analysis Join Forces
No ratings yet
Detect Depression From Communication How Computer Vision Signal Processing and Sentiment Analysis Join Forces
14 pages
Fahima Afroz Rozy and Fariha Tabassum
No ratings yet
Fahima Afroz Rozy and Fariha Tabassum
52 pages
Neural Voice Cloning With A Few Samples: February 2018
No ratings yet
Neural Voice Cloning With A Few Samples: February 2018
17 pages
Using Machine Learning Tools To Predict Compressor Stall: Samuel M. Hipple
No ratings yet
Using Machine Learning Tools To Predict Compressor Stall: Samuel M. Hipple
9 pages