0% found this document useful (0 votes)

7 views57 pages

15 Improving Performance - Hacks & Tricks

Uploaded by

Shahzaib Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views57 pages

15 Improving Performance - Hacks & Tricks

Uploaded by

Shahzaib Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 57

Improving Performance – Hacks & Tricks

Data Preparation
Data pre-processing techniques generally refer to the addition, deletion, or
transformation of training set data

Data Cleaning: Identifying and correcting mistakes or errors in the data.

Data Transformation
Data Transforms: Changing the scale or distribution of variables.
Normalized to 0 to 1.
Rescaled to -1 to 1.
Standardized.
Data Transformation
Data Transforms: Changing the scale or distribution of variables.
Standardized to -1 to 1.
Big Data

Data augmentation in data analysis are techniques used to increase the amount of
data by adding slightly modified copies of already existing data or newly created
synthetic data from existing data.
Data Split

70% train, 15% val, 15% test.

80% train, 10% val, 10% test.
60% train, 20% val, 20% test.
Loss Function

70% train, 15% val, 15% test. The function we want to minimize or maximize
80% train, 10% val, 10% test. is called the objective function or criterion.
60% train, 20% val, 20% test. When we are minimizing it, we may also call it
the cost function, loss function, or error
function.
Performance Metrics

Accuracy in classification problems is the

number of correct predictions made by the
model over all kinds predictions made.
Baby Sitting
Loss Optimization Function
Learning Rate
Learning Rate

Loss
Learning Rate
Ideal Curves
Early Stopping

Validation
How many Neurons?
How many Neurons?

X W Y

solving a system of simultaneous equations

How many Neurons?

Size of Training Data

.
.
.

There must be x independent examples for each

parameter in the model, where x could be tens (e.g. 10).
How many Neurons?

Problem Complexity

.
.
.
How many Layers?
Big Network
ImageNet Challenge
Over-fitting
Over-fitting
Under-fitting
Good-fit
Big Network with Dropout
Regularization
W1 = 0

W2 = 0

W3 = 0

.
.
.
Which Activation Function?
Recall: Back Propagation

w1 w2
x f1 y1 f2 y2 . . . J(w)

𝜕𝐽(𝑤) 𝜕𝐽(𝑤) 𝜕𝑦2 𝜕𝑦1 . . .

= * *
𝜕𝑤1 𝜕𝑦2 𝜕𝑦1 𝜕𝑤1
Which Activation Function?
Which Activation Function?

.
.
.
Which Activation Function?

Data Scaling:
-1 to +1
Which Activation Function?

Data Scaling:
0 to 1
Weight Initialization?
Weight Initialization?

In general practice biases are initialized with 0 and weights are initialized with small numbers
drawn randomly from a Gaussian or uniform distribution in the range e.g. [0, 1] , [-1, 1], [-0.3, 0.3]
Weight Initialization?

Sigmoid / Tanh

In Xavier technique weights are initialized with small numbers drawn randomly from a uniform probability
distribution between the range -(1/sqrt(n)) and 1/sqrt(n), where n is the number of inputs to the neuron.
Weight Initialization?

ReLU

In He Normal technique weights are initialized with small numbers drawn randomly from a Gaussian probability
distribution with a mean 0.0 and a standard deviation of sqrt(2/n), where n is the number of inputs to the neuron.
Weight Initialization?

Each time, a neural network is initialized with a different set of weights, resulting in a different starting point, and
potentially resulting in a different final set of weights with different performance characteristics.
Which Loss Function?

Pred = 0.8, Actual = 1

Which Loss Function?
1
𝐽(𝑤) = − 𝑦𝑖 log 𝑦 + 1 − 𝑦𝑖 log(1 − 𝑦)
𝑛
𝑖

Pred = 0.8, Actual = 1

Binary Cross Entropy will calculate a score that summarizes average difference between the actual and predicted
probability distributions for predicting class 1. The score is minimized and a perfect cross-entropy value is 0.
Which Loss Function?

.
.
.
Multi Cross Entropy will calculate a score that summarizes the average difference between the actual and predicted
probability distributions for all classes in the problem. The score is minimized and a perfect cross-entropy value is 0.
Which Loss Optimization Function?
Pre 0.8 Ac 1 J(W)
Pre 0.8 Ac 1 J(W)
Batch
Batch

Pre 0.5 Ac 1 J(W)

Use Average
.
.
.
Mini-Batch
Mini-Batch

Pre 0.6 Ac 1 J(W)

Use Average
.
.
.
Mini-Batch

Pre 0.7 Ac 1 J(W)

Use Average
.
.
.
Larger Batch Size

Pre 0.6 Ac 1 J(W)

Use Average
.
.
.
Larger Batch Size

Pre 0.6 Ac 1 J(W)

Use Average
.
.
.
Grid Search
What parameters can use grid search? Perhaps there are grids of standard
hyperparameter values that you can enumerate to find good configurations, then
repeat the process with finer and finer grids.

1. Activation Functions
2. Network Topology
3. Batches and Epochs
4. Dropout
5. Optimization and Loss
6. Early Stopping
.
.
.
Transfer Learning
Transfer learning generally refers to a process where a model trained on one
problem is used in some way on a second related problem.

Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
CNN Basic Structure, Hyper-Parameter Tuning, Regularization-Dropouts
No ratings yet
CNN Basic Structure, Hyper-Parameter Tuning, Regularization-Dropouts
54 pages
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
No ratings yet
CS601 - Machine Learning - Unit 2 - Notes - 1672759753
14 pages
DeepLearning Workshop Humayun
No ratings yet
DeepLearning Workshop Humayun
63 pages
DeepLearning Recap
No ratings yet
DeepLearning Recap
104 pages
Unit 3
No ratings yet
Unit 3
110 pages
DL Mod2
No ratings yet
DL Mod2
45 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Neural Networks - 2
No ratings yet
Neural Networks - 2
79 pages
CS601 Machine Learning Unit 2 Notes 1672759753
No ratings yet
CS601 Machine Learning Unit 2 Notes 1672759753
14 pages
Week2 DL
No ratings yet
Week2 DL
29 pages
CV Lec4
No ratings yet
CV Lec4
46 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Optimization of Deep Networks
No ratings yet
Optimization of Deep Networks
84 pages
Neural Network Intro Lecture 4
No ratings yet
Neural Network Intro Lecture 4
46 pages
How To Improve Model
No ratings yet
How To Improve Model
27 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
61 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Week 4
No ratings yet
Week 4
61 pages
Deep Learning Lectures - 2
No ratings yet
Deep Learning Lectures - 2
73 pages
Week 06 - Deep Feedforward Networks - Optimization
No ratings yet
Week 06 - Deep Feedforward Networks - Optimization
83 pages
Shenzhen Denver 3000T User Manual
No ratings yet
Shenzhen Denver 3000T User Manual
358 pages
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
No ratings yet
Advanced Machine Learning: Neural Networks Decision Trees Random Forest Xgboost
61 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
No ratings yet
Linear Classifier: by Dr. Sanjeev Kumar Associate Professor Department of Mathematics IIT Roorkee, Roorkee-247 667, India
86 pages
NN Theory
No ratings yet
NN Theory
138 pages
465-Lecture 10-11
No ratings yet
465-Lecture 10-11
79 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
Lecture-16 Machine Learning With Python
No ratings yet
Lecture-16 Machine Learning With Python
39 pages
3 LogisticRegression
No ratings yet
3 LogisticRegression
30 pages
L10 Learning II Gradient Based Learning
No ratings yet
L10 Learning II Gradient Based Learning
72 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
L8 Ann
No ratings yet
L8 Ann
20 pages
L4 Training Neural Networks en
No ratings yet
L4 Training Neural Networks en
48 pages
4-Tensors and Opeartions - Probability Basics-Gradient Descent-27!07!2024
No ratings yet
4-Tensors and Opeartions - Probability Basics-Gradient Descent-27!07!2024
18 pages
Unit 2
No ratings yet
Unit 2
37 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
Deep Learning Module-02 Search Creators
No ratings yet
Deep Learning Module-02 Search Creators
15 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
DL Unit2
No ratings yet
DL Unit2
22 pages
Linear Regression
No ratings yet
Linear Regression
29 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
25 pages
Paver Block Specification
No ratings yet
Paver Block Specification
8 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
31 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Additional Notes Practice Exam
No ratings yet
Additional Notes Practice Exam
8 pages
Linearity: Skip To Content
No ratings yet
Linearity: Skip To Content
10 pages
AccountStatement31-01-2025 To 01-05-2025
No ratings yet
AccountStatement31-01-2025 To 01-05-2025
8 pages
Different Activation Functions With The Equations
No ratings yet
Different Activation Functions With The Equations
6 pages
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
No ratings yet
Aula 4 (L) - Oggi La Tua Lezione È in Presenza
11 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
A Imprimer 4
No ratings yet
A Imprimer 4
4 pages
CSE 440 AI Volume1 (p1)
No ratings yet
CSE 440 AI Volume1 (p1)
4 pages
DL Unit-2
No ratings yet
DL Unit-2
24 pages
Introduction To Neural Network
No ratings yet
Introduction To Neural Network
20 pages
Tutorial 4
No ratings yet
Tutorial 4
6 pages
PSM1
No ratings yet
PSM1
4 pages
Toaz - Info Detailed Lesson Plan DLP For Demo Teaching Parallelism PR
No ratings yet
Toaz - Info Detailed Lesson Plan DLP For Demo Teaching Parallelism PR
3 pages
Sistema de Frenos Freight m12
No ratings yet
Sistema de Frenos Freight m12
457 pages
Rectus Tema
No ratings yet
Rectus Tema
486 pages
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
No ratings yet
Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
1 page
A2mot En5
100% (1)
A2mot En5
5 pages
2025 - Fairview Bio Pi Mock F4
No ratings yet
2025 - Fairview Bio Pi Mock F4
13 pages
Task: For This Assessment, Students Are Expected To Write A Weekly Journal Over The
No ratings yet
Task: For This Assessment, Students Are Expected To Write A Weekly Journal Over The
4 pages
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
No ratings yet
Guidance Mandatory Competence Attainment Report (v7) Final 04072012
8 pages
MicroMonsta 2 Manual EN 2.3
No ratings yet
MicroMonsta 2 Manual EN 2.3
36 pages
Mil H 6875H
No ratings yet
Mil H 6875H
29 pages
Revision Questions 12
No ratings yet
Revision Questions 12
1 page
Computer Network - CS610 Power Point Slides Lecture 12
No ratings yet
Computer Network - CS610 Power Point Slides Lecture 12
20 pages
Simple Compound Complex Sentences
No ratings yet
Simple Compound Complex Sentences
15 pages
Ocean DEHUMID
No ratings yet
Ocean DEHUMID
4 pages
P.7 Math
No ratings yet
P.7 Math
12 pages
Current Affairs - Compendium - DMS - IIT - Delhi
No ratings yet
Current Affairs - Compendium - DMS - IIT - Delhi
28 pages
12 Introduction To Perceptron
No ratings yet
12 Introduction To Perceptron
53 pages
Lecture#24-Universal Turing Machine
No ratings yet
Lecture#24-Universal Turing Machine
53 pages
AI-Powered Exam Assessment System For Handwritten Answer Sheets
No ratings yet
AI-Powered Exam Assessment System For Handwritten Answer Sheets
4 pages
Lecture 6
No ratings yet
Lecture 6
10 pages
3 Zone Fence Integrity Monitor
No ratings yet
3 Zone Fence Integrity Monitor
2 pages
Chapt5-ER-to-Relational Mapping
No ratings yet
Chapt5-ER-to-Relational Mapping
37 pages
Poly Interp
No ratings yet
Poly Interp
27 pages
Lecture 8
No ratings yet
Lecture 8
19 pages
HDI OnQ RandI Set A Closed To Arrival Control On Rate Levels V1.0
No ratings yet
HDI OnQ RandI Set A Closed To Arrival Control On Rate Levels V1.0
11 pages
School Students' Physical Activity Physical Activity and Its Contributing Factors in
No ratings yet
School Students' Physical Activity Physical Activity and Its Contributing Factors in
8 pages
Solar Si ANN
No ratings yet
Solar Si ANN
18 pages
Methods 3 Unit Plan Project: Petition Rubric
No ratings yet
Methods 3 Unit Plan Project: Petition Rubric
1 page
Pollution Exam Questions
No ratings yet
Pollution Exam Questions
5 pages
Electrophysiology Devices Market Report
No ratings yet
Electrophysiology Devices Market Report
7 pages
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
No ratings yet
Project 619839 EPP 1 2020 1 FI EPPKA1 JMD MOB
2 pages
Dpi Reports
No ratings yet
Dpi Reports
2 pages
2013 ME Magway,, English
No ratings yet
2013 ME Magway,, English
4 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet

15 Improving Performance - Hacks & Tricks

Uploaded by

15 Improving Performance - Hacks & Tricks

Uploaded by

Improving Performance – Hacks & Tricks

Data Cleaning: Identifying and correcting mistakes or errors in the data.

70% train, 15% val, 15% test.

Accuracy in classification problems is the

solving a system of simultaneous equations

Size of Training Data

There must be x independent examples for each

𝜕𝐽(𝑤) 𝜕𝐽(𝑤) 𝜕𝑦2 𝜕𝑦1 . . .

Pred = 0.8, Actual = 1

Pred = 0.8, Actual = 1

Pre 0.5 Ac 1 J(W)

Pre 0.6 Ac 1 J(W)

Pre 0.7 Ac 1 J(W)

Pre 0.6 Ac 1 J(W)

Pre 0.6 Ac 1 J(W)

You might also like