0% found this document useful (0 votes)

788 views13 pages

Efficient Convolution Algorithms

The document discusses efficient convolution algorithms, highlighting the inefficiencies of naive convolution and the advantages of using separable kernels. It explains how convolution can be accelerated by utilizing Fourier transforms and decomposing kernels into one-dimensional vectors. The text emphasizes ongoing research into faster convolution methods that maintain model accuracy.

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

788 views13 pages

Efficient Convolution Algorithms

Uploaded by

devanand272003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Efficient convolution algorithms.

Mr. Sivadasan E T
Associate Professor
Vidya Academy of Science and Technology, Thrissur
Naive convolution
Naive convolution refers to the straightforward, brute-
force implementation of the convolution operation
without optimizations.

It is equivalent to compose d one-dimensional

convolutions with each of these vectors.

When the kernel is separable, naive convolution is

inefficient.
Efficient Convolution Algorithms

Modern convolutional network applications often

involve networks containing more than one million
units.

It is also possible to speed up convolution by selecting

an appropriate convolution
algorithm.
Efficient Convolution Algorithms
Convolution is equivalent to converting both the input
and the kernel to the frequency domain using a Fourier
transform, performing point-wise multiplication
of the two signals, and converting back to the time
domain using an inverse Fourier transform.

For some problem sizes, this can be faster than the

naïve implementation of discrete convolution.
Fourier Transform
d-dimensional Kernel

A kernel in convolution is a matrix (or tensor for

higher dimensions) used for feature extraction,
filtering, or pattern matching in image processing or
neural networks.

In a d-dimensional case, the kernel operates across all

d dimensions simultaneously.
Separable Kernel
A kernel is called separable if it can be decomposed
into the outer product of d one-dimensional vectors
(one for each dimension).

This decomposition allows the kernel to be represented

as:
K(x1,x2,...,xd) = k1(x1)*k2(x2)* ... *kd(xd)
Separable Kernel
Where each ki is a 1D vector applied in a specific
dimension.

For example, in 2D:

K2D = k1T ⊗ k2

This means the 2D kernel can be expressed as the

product of two 1D kernels.
Separable Kernel
Example:

2D Gaussian blur kernel:

This can be decomposed into two 1D vectors:

Separable Kernel

The composed approach is significantly faster than

performing one d-dimensional convolution with their
outer product.

The kernel also takes fewer parameters to represent as

vectors.
Naïve and Decomposed
• If the kernel is w elements wide in each dimension,

• Then naive multidimensional convolution requires O(wd)

runtime and parameter storage space.
• while separable convolution requires O(w * d) runtime
and parameter storage space.
Naïve and Decomposed
• Of course, not every convolution can be represented
in decomposed way.

• Devising faster ways of performing convolution or

approximate convolution without harming the
accuracy of the model is an active area of research.
Thank You!

Critical Perspectives Task Booklet (Antigone)
No ratings yet
Critical Perspectives Task Booklet (Antigone)
16 pages
WAfrica Metocean Data Rev20
100% (2)
WAfrica Metocean Data Rev20
55 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Limitations of Perceptrons
100% (1)
Limitations of Perceptrons
1 page
A Course in In-Memory Data Management: Prof. Hasso Plattner
No ratings yet
A Course in In-Memory Data Management: Prof. Hasso Plattner
8 pages
Data Mining Notes Jntuh Compress
No ratings yet
Data Mining Notes Jntuh Compress
62 pages
Associative Memory Neural Networks
100% (1)
Associative Memory Neural Networks
35 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
Unit-1 FSD
No ratings yet
Unit-1 FSD
50 pages
IF4071 - Deep Learning Laboratory
No ratings yet
IF4071 - Deep Learning Laboratory
1 page
ML Unit-1
No ratings yet
ML Unit-1
32 pages
NNDL Lab Record
No ratings yet
NNDL Lab Record
26 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
33 pages
DM Unit 5
No ratings yet
DM Unit 5
47 pages
Study On Intel 80386 Microprocessor
No ratings yet
Study On Intel 80386 Microprocessor
3 pages
Unit - 3-NNDL - Notes
No ratings yet
Unit - 3-NNDL - Notes
17 pages
States, State Graphs, and Transition Testing: Unit Iv
No ratings yet
States, State Graphs, and Transition Testing: Unit Iv
42 pages
Unit V
No ratings yet
Unit V
67 pages
NN UNIT-1 Complete Notes With 153 Pages
No ratings yet
NN UNIT-1 Complete Notes With 153 Pages
153 pages
Factors and Tables
No ratings yet
Factors and Tables
10 pages
FIND-S Algorithm: Machine Learning 15CSL76
No ratings yet
FIND-S Algorithm: Machine Learning 15CSL76
3 pages
Solving XOR Problem Using DNN AIDS
100% (1)
Solving XOR Problem Using DNN AIDS
4 pages
21CS743 DL Module4 Notes
No ratings yet
21CS743 DL Module4 Notes
7 pages
AD601 Deep Learning Unit-2 Notes
No ratings yet
AD601 Deep Learning Unit-2 Notes
14 pages
R18 CSM 3-2 Devops
No ratings yet
R18 CSM 3-2 Devops
28 pages
IDS-Unit 3
No ratings yet
IDS-Unit 3
142 pages
Recursively Enumerable Languages
No ratings yet
Recursively Enumerable Languages
8 pages
Unit 1 Notes
100% (1)
Unit 1 Notes
14 pages
Machine Learning Notes PDF
No ratings yet
Machine Learning Notes PDF
85 pages
Dbms Lab Manual II Cse II Sem
No ratings yet
Dbms Lab Manual II Cse II Sem
58 pages
DBMS Lab Manual
No ratings yet
DBMS Lab Manual
73 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
DD Decode
0% (1)
DD Decode
104 pages
What Is Gradient Based Learning in Deep Learning
100% (1)
What Is Gradient Based Learning in Deep Learning
12 pages
Sample Questions Pattern Recognition
No ratings yet
Sample Questions Pattern Recognition
8 pages
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
100% (2)
101905CS502H - Neural Networks and Deep Learning - Model Question Paper
4 pages
ccs355 Syllabus NNDL
100% (1)
ccs355 Syllabus NNDL
3 pages
CCS356 Object Oriented Software Engineering Lecture Notes 1
No ratings yet
CCS356 Object Oriented Software Engineering Lecture Notes 1
222 pages
Cs3691-Unit 3
No ratings yet
Cs3691-Unit 3
22 pages
Optimization For Long-Term Dependencies
No ratings yet
Optimization For Long-Term Dependencies
57 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
Big Data Lab Manual
No ratings yet
Big Data Lab Manual
36 pages
Anna University Notes
No ratings yet
Anna University Notes
153 pages
STM Unit 5
No ratings yet
STM Unit 5
31 pages
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
100% (1)
Fundamentals of Data Science: Nehru Institute of Engineering and Technology
17 pages
CS3491-AIML Lab Manual
No ratings yet
CS3491-AIML Lab Manual
20 pages
Forward Chaining and Backward Chaining in Ai: Inference Engine
No ratings yet
Forward Chaining and Backward Chaining in Ai: Inference Engine
18 pages
DAA UNIT 4 - Final
No ratings yet
DAA UNIT 4 - Final
12 pages
Heuristic Search: Dr.M. Nagaratna Professor, Dept - of CSE Jntuceh
No ratings yet
Heuristic Search: Dr.M. Nagaratna Professor, Dept - of CSE Jntuceh
54 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Deep Learning - AD3501 - Notes - Unit 2 - Convolutional Neural Networks
No ratings yet
Deep Learning - AD3501 - Notes - Unit 2 - Convolutional Neural Networks
36 pages
Algorithm For Asynchronous Check Pointing and Recovery
No ratings yet
Algorithm For Asynchronous Check Pointing and Recovery
4 pages
Cs3451 Ios Unit 5 Notes
No ratings yet
Cs3451 Ios Unit 5 Notes
21 pages
Machine Learning
No ratings yet
Machine Learning
7 pages
Assignment 11
100% (1)
Assignment 11
4 pages
Unit 2 Introduction To Deep Learning
No ratings yet
Unit 2 Introduction To Deep Learning
79 pages
TOC Question Bank - Unit - 1 - 2 - 3 - 4 - 2022
No ratings yet
TOC Question Bank - Unit - 1 - 2 - 3 - 4 - 2022
7 pages
AIML 4th and 5th Module Notes
No ratings yet
AIML 4th and 5th Module Notes
77 pages
Operating Digital Notes (R22 Regulation)
No ratings yet
Operating Digital Notes (R22 Regulation)
156 pages
Neural Network Unit 1 Handwritten Notes
No ratings yet
Neural Network Unit 1 Handwritten Notes
30 pages
AIML Unit Wise Question Bank
100% (1)
AIML Unit Wise Question Bank
4 pages
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
100% (1)
24-Module 4 - Variants of Syntax Trees - Three Address Code-10!09!2024
44 pages
Algorithms For Efficient Computation of Convolution
No ratings yet
Algorithms For Efficient Computation of Convolution
30 pages
Speech Recognition
No ratings yet
Speech Recognition
7 pages
AdaGrad - RMSProp - Adam
No ratings yet
AdaGrad - RMSProp - Adam
9 pages
Recurrent Neural Networks RNN
No ratings yet
Recurrent Neural Networks RNN
19 pages
Introduction To Deep Learning - Deep Feed Forward Network
No ratings yet
Introduction To Deep Learning - Deep Feed Forward Network
24 pages
Introduction To Neural Networks - Single Layer Perceptrons - Modified
No ratings yet
Introduction To Neural Networks - Single Layer Perceptrons - Modified
26 pages
Activation Functions - Sigmoid - Tanh - ReLU - Softmax - Risk Minimization - Loss Function
No ratings yet
Activation Functions - Sigmoid - Tanh - ReLU - Softmax - Risk Minimization - Loss Function
17 pages
Encoder-Decoder Sequence To Sequence Architechure
No ratings yet
Encoder-Decoder Sequence To Sequence Architechure
16 pages
Computer Vision
No ratings yet
Computer Vision
20 pages
Assignment Lesson 7 UNIT 01 Fundamentals of Electrochem
No ratings yet
Assignment Lesson 7 UNIT 01 Fundamentals of Electrochem
10 pages
SW TM4C Utils Ug 2.1.0.12573
No ratings yet
SW TM4C Utils Ug 2.1.0.12573
188 pages
XTRACT: A Tool For Axial Force - Ultimate Curvature Interactions
No ratings yet
XTRACT: A Tool For Axial Force - Ultimate Curvature Interactions
9 pages
Vietnam's Semiconductor Market - FDI-fueled Growth and Local Chip Production Ambitions
No ratings yet
Vietnam's Semiconductor Market - FDI-fueled Growth and Local Chip Production Ambitions
24 pages
Mps Confidential: 1.5A, 210Khz Step-Down Converter
No ratings yet
Mps Confidential: 1.5A, 210Khz Step-Down Converter
8 pages
Leader Ventilation C - Catalog - Ventilation - ZCL03
No ratings yet
Leader Ventilation C - Catalog - Ventilation - ZCL03
60 pages
Identification of Computational Problems
100% (1)
Identification of Computational Problems
38 pages
Checklist of Fishes in Thailand
No ratings yet
Checklist of Fishes in Thailand
355 pages
X Ray - AGFA CURIX 60 Brochure
No ratings yet
X Ray - AGFA CURIX 60 Brochure
2 pages
Sanjib Sir Updated Questions
No ratings yet
Sanjib Sir Updated Questions
11 pages
OM Narrative Report
No ratings yet
OM Narrative Report
9 pages
Drilling Depth Guidelines: Drill Rod Hole Depth (Metric) Up Horizontal Arqtk BQ NQ HQ
No ratings yet
Drilling Depth Guidelines: Drill Rod Hole Depth (Metric) Up Horizontal Arqtk BQ NQ HQ
12 pages
The Rational Expectations Hypothesis As A Key Element of New Classical Macroeconomics
No ratings yet
The Rational Expectations Hypothesis As A Key Element of New Classical Macroeconomics
39 pages
Quality Certificate 1
No ratings yet
Quality Certificate 1
2 pages
Innovation and Leadership Presentation
No ratings yet
Innovation and Leadership Presentation
26 pages
FME Proposal
No ratings yet
FME Proposal
10 pages
1 s2.0 S0959152405000569 Main
No ratings yet
1 s2.0 S0959152405000569 Main
20 pages
SCHEDULE OF LOADS-Model
0% (1)
SCHEDULE OF LOADS-Model
1 page
Eastwood International School: Elementary Report On Learning 2019-2020
No ratings yet
Eastwood International School: Elementary Report On Learning 2019-2020
17 pages
Shaw Et Al. - 2023 - A Daily Diary Study Into The Effects On Mental Hea
No ratings yet
Shaw Et Al. - 2023 - A Daily Diary Study Into The Effects On Mental Hea
9 pages
A Thousand Cuts (Underworld Kings #18) Anne Malcom Instant Download
No ratings yet
A Thousand Cuts (Underworld Kings #18) Anne Malcom Instant Download
44 pages
Lista Precios 04-12-2019
No ratings yet
Lista Precios 04-12-2019
54 pages
Spirit Bangladesh Business Plan
100% (3)
Spirit Bangladesh Business Plan
37 pages
Enrichment
No ratings yet
Enrichment
2 pages
CCPilot XL - Technical Manual - For CCpilot XL 3.0
No ratings yet
CCPilot XL - Technical Manual - For CCpilot XL 3.0
31 pages
Philosophy Introductionhistory
No ratings yet
Philosophy Introductionhistory
13 pages
Please Vote for Me (Edited)
No ratings yet
Please Vote for Me (Edited)
2 pages

Efficient Convolution Algorithms

Uploaded by

Efficient Convolution Algorithms

Uploaded by

Efficient convolution algorithms.

It is equivalent to compose d one-dimensional

When the kernel is separable, naive convolution is

Modern convolutional network applications often

It is also possible to speed up convolution by selecting

For some problem sizes, this can be faster than the

A kernel in convolution is a matrix (or tensor for

In a d-dimensional case, the kernel operates across all

This decomposition allows the kernel to be represented

For example, in 2D:

This means the 2D kernel can be expressed as the

2D Gaussian blur kernel:

This can be decomposed into two 1D vectors:

The composed approach is significantly faster than

The kernel also takes fewer parameters to represent as

• Then naive multidimensional convolution requires O(wd)

• Devising faster ways of performing convolution or

You might also like