0% found this document useful (0 votes)

31 views21 pages

18 - Computational Complexity

This document discusses computational complexity and optimization of machine learning models. It covers topics like model parameters, size, operations counts like FLOPs and MACs, inference time, and techniques for optimizing models like reducing parameters, operations, quantization, and knowledge distillation.

Uploaded by

Fairooz Toroshe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views21 pages

18 - Computational Complexity

Uploaded by

Fairooz Toroshe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Computational Complexity

Chittagong University of Engineering & Technology

Submitted To: Submitted By:

Dr. Kaushik Deb Md . Al-Mamun Provath
Professor 22MCSE004
Dept. of CSE, CUET
Contents
 Model Parameters

 Model Size

 FLOPs

 FLOPS

 MACs

 Inference Time

2 Department of CSE, CUET

Model Parameters
 Model parameters
 configuration settings that determine behavior and predictive capabilities
 learns from the training data
 adjusts during the training process to minimize loss function

 Learnable Parameters
 weights and biases in the model

 Model parameters influence the model's ability to generalize and make accurate
predictions
 When the model parameters are set optimally, the model fits the training data well
and generalizes to unseen data effectively
 If the parameters are poorly chosen, the model may overfit or underfit

3 Department of CSE, CUET

Model Parameters
 The ideal number of parameters depends on several factors:

 Data availability
 More data the more complex a model to use
 Insufficient data with a complex model lead to overfitting

 Model Complexity
 Simple problems addressed by less complex models
 Complex problems requires large number of parameter

 Computational Resources
 Training models with a large number of parameters computationally
expensive
 Having limited computational resources, need to use smaller models

4 Department of CSE, CUET

Calculating Model Parameters
 Feed Forward Neural Network (Dense Layer )
For one hidden layer,

#Parameters= connections between layers + biases in every layer

Between input to hidden unit:

3 X 5 + 5 = 20

Between hidden unit to output:

5 X 2 + 2 = 12

5 Department of CSE, CUET

Calculating Model Parameters
 Feed Forward Neural Network (Dense Layer )

6 Department of CSE, CUET

Calculating Model Parameter
 Feed Forward Neural Network (Dense Layer )

7 Department of CSE, CUET

Calculating Model Parameters
 Feed Forward Neural Network (Dense Layer )

8 Department of CSE, CUET

Calculating Model Parameters
 CNNs

#Parameters = ( Filter height X Filter width X input_image channels + 1) X Number of Filters

RGB image with 2×2

filter, output of 1
channel

9 Department of CSE, CUET

Calculating Model Parameters
 CNNs

#Parameters = ( Filter height X Filter width X input_image channels + 1) X Number of Filters

RGB image with 2×2

filter, output of 1
channel

10 Department of CSE, CUET

Model Size
 Model size measures the storage for the weights of the given neural network
 The common units for model size are: MB (megabyte), KB (kilobyte), bits.
 In general, if the whole neural network uses the same data type (e.g., floating-point),

Model Size = #Parameters • Bit Width

 Example: AlexNet has 61M parameters

 If all weights are stored with 32-bit numbers, total storage will be about
61M × 4 Bytes (32 bits) = 224 MB (224 × 10° Bytes)

 If all weights are stored with 8-bit numbers, total storage will be about
61M × 1 Byte (8 bits) = 61 MB

11 Department of CSE, CUET

FLOPs, FLOPS, MACs
 FLOPs
 Floating Point Operations
 the total number of computations the model will have to perform
 addition, subtraction, division, multiplication, or any other operation
that involves a floating point value
 FLOPS
 Floating Point Operations per Second
 tells us how good is hardware
 The more operations per second we can do, the faster the inference will be
 MACs
 Multiply-Accumulate Computations
 A MAC is an operation that does an addition and a multiplication,
so 2 operations
 Generally, 1 MAC ≈ 2 FLOPs

12 Department of CSE, CUET

FLOPs, FLOPS, MACs
General idea

 We want a low number of FLOPs in our model, but keeping

it complex enough to be good

 We want a high number of FLOPS in our hardware

 Our role will be to optimize the Deep Learning models to

have a low number of FLOPs

13 Department of CSE, CUET

Calculating FLOPs
Let's take the following model that performs a classification on the MNIST dataset

 The Input Image is of size 28x28x1 (grayscale)

 We run 2 Convolutions of 5 kernels of size (3x3)

 We run a Fully Connected Layer of 128 Neurons

 We finish with a Fully Connected Layer of 10 Neurons: 1 per digit.

14 Department of CSE, CUET

Calculating FLOPs

15 Department of CSE, CUET

Calculating MACs

Where, C = channel, k = kernel,

ℎ𝑜 = height of output ,
𝑤𝑜 = width of output

• MACs = 96 X 3 X 11 X 11 X 55 X 55
=105,415,200

g = number of groups

16 Department of CSE, CUET

Inference Time
 How long is takes for a forward propagation

The inference time will be FLOPs/FLOPS

 Suppose, FLOPs = 1,060,400

 CPU performs 1 GFLOPS

 Inference time = (1,060,400)/(1,000,000,000) = 0,001 s or 1ms.

17 Department of CSE, CUET

Model Optimization
 2 main ways to optimize a neural network:
1. Reducing model size
2. Reducing number of operations

 Reducing number of operations:

 Pooling
-subsampling layers

 Separable Convolutions
-don't change the depth, reducing the number of FLOPs
-a pointwise convolution is a 1x1 convolution

 Model Pruning
-redundant network parameters are removed

18 Department of CSE, CUET

Model Optimization
 2 main ways to optimize a neural network:
1. Reducing model size
2. Reducing number of operations

 Reducing model size:

 Quantization
 mapping values from a larger set to a smaller one
 Quantization can be done on weights, and on activations
 These both reduces memory and complexity of computations

 Weight Sharing
 share the weights between neuron
 so we have less of them to store

19 Department of CSE, CUET

Model Optimization
 Knowledge Distillation
 try to transfer the knowledge learned by a large, accurate model (the teacher model) to
a smaller and computationally less expensive model (the student model)

20 Department of CSE, CUET

The End

Thank You

21 Department of CSE, CUET

Task 1
No ratings yet
Task 1
12 pages
Timer Control Series: User Instructions
0% (1)
Timer Control Series: User Instructions
2 pages
223 COE 292 FinalExam Concept
No ratings yet
223 COE 292 FinalExam Concept
17 pages
ML Unit 2
No ratings yet
ML Unit 2
58 pages
Chapter 3-1 Neural Network
No ratings yet
Chapter 3-1 Neural Network
43 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Artificial Intelligence Basics
No ratings yet
Artificial Intelligence Basics
13 pages
Data Science L28 - DeepNetworks
No ratings yet
Data Science L28 - DeepNetworks
48 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
37 pages
ML Unit 4
No ratings yet
ML Unit 4
32 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
63 pages
Python Neural Network
No ratings yet
Python Neural Network
5 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Unit-9 1
No ratings yet
Unit-9 1
67 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
35 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Chapter 1
No ratings yet
Chapter 1
21 pages
Neural Network
No ratings yet
Neural Network
52 pages
Assignment 1 Basic Perceptron Build A Neuron With Bias: Neural Networks
No ratings yet
Assignment 1 Basic Perceptron Build A Neuron With Bias: Neural Networks
7 pages
CE345 - Lecture #11 - Neural Networks
No ratings yet
CE345 - Lecture #11 - Neural Networks
55 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
ML-Lec10-Artificial Neural Networks
No ratings yet
ML-Lec10-Artificial Neural Networks
76 pages
6ee412 ch6 Neural DSP
No ratings yet
6ee412 ch6 Neural DSP
41 pages
Scholastic Video Book Series: Artificial Neural Networks
No ratings yet
Scholastic Video Book Series: Artificial Neural Networks
37 pages
Implementation of A Fast Artificial Neural Network Library (Fann)
No ratings yet
Implementation of A Fast Artificial Neural Network Library (Fann)
92 pages
10 nn1
No ratings yet
10 nn1
162 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
CS344: Artificial Intelligence: Pushpak Bhattacharyya
No ratings yet
CS344: Artificial Intelligence: Pushpak Bhattacharyya
24 pages
ML Ch-4 Artificial Neural Network
No ratings yet
ML Ch-4 Artificial Neural Network
26 pages
Lecture W15ab
No ratings yet
Lecture W15ab
44 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
Introduction To Neural Networks: by Suneel
No ratings yet
Introduction To Neural Networks: by Suneel
51 pages
Int254 Unit 3
No ratings yet
Int254 Unit 3
29 pages
Topic 5 - Intelligent System Applications
No ratings yet
Topic 5 - Intelligent System Applications
142 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
18 pages
Efficient Numerical Computing with Intel MKL: Definitive Reference for Developers and Engineers
From Everand
Efficient Numerical Computing with Intel MKL: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
L17 My Neural NW
No ratings yet
L17 My Neural NW
32 pages
Day1 05 Introduction To DeepLearning Part
No ratings yet
Day1 05 Introduction To DeepLearning Part
20 pages
Wk. 12. Artificial Neural Networks (12!05!2021)
No ratings yet
Wk. 12. Artificial Neural Networks (12!05!2021)
48 pages
Neural Networks and Fuzzy Systems: Neurolab
No ratings yet
Neural Networks and Fuzzy Systems: Neurolab
17 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Lab Manual: CSE 421: Artificial Intelligent and Deep Learning
No ratings yet
Lab Manual: CSE 421: Artificial Intelligent and Deep Learning
28 pages
Ker As Tutorial
No ratings yet
Ker As Tutorial
33 pages
Module 2
No ratings yet
Module 2
44 pages
Chapter 4 Neural Network
No ratings yet
Chapter 4 Neural Network
46 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
02 Neural Network
No ratings yet
02 Neural Network
28 pages
HW 5
No ratings yet
HW 5
10 pages
TP 5 Aii
No ratings yet
TP 5 Aii
9 pages
Part7.2 Artificial Neural Networks
No ratings yet
Part7.2 Artificial Neural Networks
51 pages
Lecture 3 V33
No ratings yet
Lecture 3 V33
52 pages
Keras1-Introduction Two KEras
No ratings yet
Keras1-Introduction Two KEras
6 pages
Essential Concept in Artificial Neural Networks
No ratings yet
Essential Concept in Artificial Neural Networks
27 pages
NNDL Lab
No ratings yet
NNDL Lab
33 pages
ELEC 6240: Neural Networks
No ratings yet
ELEC 6240: Neural Networks
253 pages
UNIT II Basic On Neural Networks
No ratings yet
UNIT II Basic On Neural Networks
36 pages
International Baccalaureate (IB) : Artificial Neural Networks - #1
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #1
33 pages
Lecture 08 On Neural Networks 1
No ratings yet
Lecture 08 On Neural Networks 1
15 pages
Case 2 Object Detection
No ratings yet
Case 2 Object Detection
77 pages
04 - Machine Learning For Embedded and Edge AI
No ratings yet
04 - Machine Learning For Embedded and Edge AI
58 pages
Lecture 3 - MLP and ANN
No ratings yet
Lecture 3 - MLP and ANN
31 pages
Image Segmentation: Unlocking Insights through Pixel Precision
From Everand
Image Segmentation: Unlocking Insights through Pixel Precision
Fouad Sabry
No ratings yet
14 - Performance Measure - Final
No ratings yet
14 - Performance Measure - Final
17 pages
6 - Feature Descriptor - HOG
No ratings yet
6 - Feature Descriptor - HOG
81 pages
15 - DT and KNN Algorithm
No ratings yet
15 - DT and KNN Algorithm
34 pages
13 3 KMeans
No ratings yet
13 3 KMeans
15 pages
Collective Bargaining Agent
No ratings yet
Collective Bargaining Agent
3 pages
Intelligent Agents
No ratings yet
Intelligent Agents
61 pages
CSE 433 - Presentation 0423
No ratings yet
CSE 433 - Presentation 0423
8 pages
Financial Statement
No ratings yet
Financial Statement
15 pages
NLP ch4 l1
No ratings yet
NLP ch4 l1
23 pages
Management Lecture
No ratings yet
Management Lecture
62 pages
#Ai ML
No ratings yet
#Ai ML
102 pages
Journal
No ratings yet
Journal
10 pages
Leadership
No ratings yet
Leadership
9 pages
Human Resource Management: Definition: Human Resource Management (HRM or HR) Is The Strategic and Coherent Approach
No ratings yet
Human Resource Management: Definition: Human Resource Management (HRM or HR) Is The Strategic and Coherent Approach
2 pages
Bba 1
No ratings yet
Bba 1
36 pages
Management Job Evaluation
No ratings yet
Management Job Evaluation
7 pages
Chapter 03sp1718
No ratings yet
Chapter 03sp1718
32 pages
Neural Networks and Neural Language Models
No ratings yet
Neural Networks and Neural Language Models
27 pages
7 NN Apr 28 2021
No ratings yet
7 NN Apr 28 2021
81 pages
Lexical Analysis
No ratings yet
Lexical Analysis
153 pages
Chapter 7
No ratings yet
Chapter 7
31 pages
IT Course Structure
No ratings yet
IT Course Structure
7 pages
Vxrail Tech Faq
No ratings yet
Vxrail Tech Faq
40 pages
Instalacion de Epson Dx5 X 1 Printhead
No ratings yet
Instalacion de Epson Dx5 X 1 Printhead
22 pages
LG LCD TV Tehnical Guide
No ratings yet
LG LCD TV Tehnical Guide
46 pages
The Abcs of Ldap How To Install Run and Administer Ldap Services
No ratings yet
The Abcs of Ldap How To Install Run and Administer Ldap Services
342 pages
IBM Rational Software Architect - Presentation04
No ratings yet
IBM Rational Software Architect - Presentation04
51 pages
Shashi Shekhar: Work Experience Skills
No ratings yet
Shashi Shekhar: Work Experience Skills
1 page
LED Display IFH Installation Manual Ver1.0 - 171214
No ratings yet
LED Display IFH Installation Manual Ver1.0 - 171214
77 pages
Implementation of Program Scheduling
No ratings yet
Implementation of Program Scheduling
10 pages
Week-03Assignment MCQ
No ratings yet
Week-03Assignment MCQ
5 pages
Yamaha Yst-Sw205 sw305 SM
No ratings yet
Yamaha Yst-Sw205 sw305 SM
32 pages
Ngu N 24VDC Mean Well S-201-24
No ratings yet
Ngu N 24VDC Mean Well S-201-24
3 pages
ITT300 Individual Assignment (Arief Najib 2022810958)
No ratings yet
ITT300 Individual Assignment (Arief Najib 2022810958)
6 pages
Datasheet - Lenovo-Tab-M8-2nd-GEN - HD
No ratings yet
Datasheet - Lenovo-Tab-M8-2nd-GEN - HD
2 pages
Moxa Mgate 5111 Series Datasheet v1.0
No ratings yet
Moxa Mgate 5111 Series Datasheet v1.0
6 pages
Python With Django
No ratings yet
Python With Django
11 pages
NE-Q05 Datasheet
No ratings yet
NE-Q05 Datasheet
3 pages
Alarma Moto
No ratings yet
Alarma Moto
1 page
Physical Design Interview Complete 134
No ratings yet
Physical Design Interview Complete 134
1 page
Single Phase Ups From 450 VA To 10 kVA: WWW - Borri.it
No ratings yet
Single Phase Ups From 450 VA To 10 kVA: WWW - Borri.it
18 pages
Delay - DL4 Stomp Box Modelers PDF
No ratings yet
Delay - DL4 Stomp Box Modelers PDF
54 pages
Project Report Explore World: Janta Polytechnic, Jahangirabad Bulandshahr
No ratings yet
Project Report Explore World: Janta Polytechnic, Jahangirabad Bulandshahr
111 pages
Combined Cell
No ratings yet
Combined Cell
36 pages
Arquitectura CI SERVER
No ratings yet
Arquitectura CI SERVER
1 page
Test Case Example
No ratings yet
Test Case Example
3 pages
Idle Mode Behavior in LTE - Part 2 - Radio Frequency Optimization Notes
No ratings yet
Idle Mode Behavior in LTE - Part 2 - Radio Frequency Optimization Notes
7 pages
Devops (3) New
No ratings yet
Devops (3) New
9 pages
Versal Ai Edge Gen2 Automotive Solution Brief
No ratings yet
Versal Ai Edge Gen2 Automotive Solution Brief
3 pages

18 - Computational Complexity

Uploaded by

18 - Computational Complexity

Uploaded by

Computational Complexity

Chittagong University of Engineering & Technology

Submitted To: Submitted By:

2 Department of CSE, CUET

3 Department of CSE, CUET

4 Department of CSE, CUET

#Parameters= connections between layers + biases in every layer

Between input to hidden unit:

Between hidden unit to output:

5 Department of CSE, CUET

6 Department of CSE, CUET

7 Department of CSE, CUET

8 Department of CSE, CUET

#Parameters = ( Filter height X Filter width X input_image channels + 1) X Number of Filters

RGB image with 2×2

9 Department of CSE, CUET

#Parameters = ( Filter height X Filter width X input_image channels + 1) X Number of Filters

RGB image with 2×2

10 Department of CSE, CUET

Model Size = #Parameters • Bit Width

 Example: AlexNet has 61M parameters

11 Department of CSE, CUET

12 Department of CSE, CUET

 We want a low number of FLOPs in our model, but keeping

 We want a high number of FLOPS in our hardware

 Our role will be to optimize the Deep Learning models to

13 Department of CSE, CUET

 The Input Image is of size 28x28x1 (grayscale)

 We run 2 Convolutions of 5 kernels of size (3x3)

 We run a Fully Connected Layer of 128 Neurons

 We finish with a Fully Connected Layer of 10 Neurons: 1 per digit.

14 Department of CSE, CUET

15 Department of CSE, CUET

Where, C = channel, k = kernel,

16 Department of CSE, CUET

The inference time will be FLOPs/FLOPS

 Suppose, FLOPs = 1,060,400

 Inference time = (1,060,400)/(1,000,000,000) = 0,001 s or 1ms.

17 Department of CSE, CUET

 Reducing number of operations:

18 Department of CSE, CUET

 Reducing model size:

19 Department of CSE, CUET

20 Department of CSE, CUET

21 Department of CSE, CUET

You might also like