Temporal Convolutional Network (TCN)

1) Temporal convolutional networks (TCNs) are a type of neural network architecture designed for sequence modeling tasks like time series forecasting and natural language processing. TCNs leverage the power of convolutional operations to capture patterns within sequential data. 2) TCNs apply causal dilated convolutional layers with residual connections to efficiently capture long-term dependencies without loss of information. This allows TCNs to take variable length sequences as input and output sequences of the same length. 3) The strengths of TCNs include their ability to efficiently capture long-term dependencies, handle variable length sequences, and avoid issues like vanishing gradients. However, their interpretability is limited and they require large amounts of data and memory for long

Uploaded by

jaffar bikat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

391 views21 pages

Temporal Convolutional Network (TCN)

Uploaded by

jaffar bikat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

Temporal Convolutional

Network (TCN)
By,
Hira Khan
Supervised by,
Prof. Dr. Nadeem Javaid
Motivation
• Convolutional networks can achieve better performance than RNNs in
many tasks while avoiding common drawbacks of recurrent models,
such as the exploding/vanishing gradient problem or lacking memory
retention.
• Using a convolutional network instead of a recurrent one can lead to
performance improvements as it allows for parallel computation of
outputs.
Temporal Convolutional Network (TCN)
• The seminal work of Lea et al. (2016) first proposed a Temporal
Convolutional Networks (TCNs) for video-based action segmentation.
• A Temporal Convolutional Network (TCN) is a type of neural network
architecture designed for sequence modeling tasks.
• TCNs leverage the power of convolutional operations to capture and
learn patterns within sequential data.
• It combines simplicity, autoregressive prediction, and very long memory.
• Applications: time series forecasting, natural language processing, and
more.
Working of TCN
• The TCN is designed from two basic principles:
• The architecture can take a sequence of any length and
map it to an output sequence of the same length, just as
with an RNNs.
• The convolutions in the architecture are causal,
meaning that there is no information “leakage” from
future to past.
Working of TCN
• Input Sequence:
• The input of a TCN is in form of sequential data.
• It could be a time series, a sentence in natural language,
or any other sequential data.
• For TCN, the input and output sequence for model will
be of same length.
Input Vector Output Vector
TCN
(ri, ti,fi) (ro, to, fo)

Where,
i = input vector
o = output vector
r = rows/batch size
t = timestamp/input length
f = features/input or output size
Working of TCN
• TCNs use 1D fully convolutional Network (FCN) to
process the input sequence
• Standard 1D convolutional layer:

kernel size of 3.
Working of TCN
• 1D Convolutional Network layer:

For the purposes of this forecasting model, the stride is always set to 1.
Working of TCN
• 1D convolutional layer:

To get output_length of 4
from input_length of 4 and
kernel_size of 3, left zero-
padding of 2 is applied

To ensure an output tensor has the same length as the input tensor, zero padding is applied.
Working of TCN
• Causal convolutional layer:
The padding strategy
employed by causal
convolutional layers
involves left-padding
the input sequence with
zeros to maintain the
causality of temporal
dependencies during the
convolutional
operation.

The output at time T can only convolve with elements from time T and earlier
of previous layer.
Working of TCN
• The TCN is designed from two basic principles:
• The architecture can take a sequence of any length and
map it to an output sequence of the same length, just as
with an RNNs.
• The convolutions in the architecture are causal,
meaning that there is no information “leakage” from
future to past.

TCN = 1st principle +2nd principle

TCN = Dilated convolution+ Zero Padding +Causal Convolution
Working of TCN
• Causal convolutional layer:

The output at time T can only convolve with elements from time T and earlier
of previous layer.
Working of TCN
• Causal convolutional layer:
Drawback:
• For large sequences:
very large filters of
extremely deep network
is required.
• Vanishing/exploding
gradient problem

The output at time T can only convolve with elements from time T and earlier
of previous layer.
Working of TCN
• Dilated Causal convolutional layer:

For kernel size = K and Dilation factor = D,

Effective history of a layer = (K-1) x D

Dilated convolution enables an exponentially large receptive field for model

to learn through extremely large effective history.
Convolutional layer: Standard vs. Causal vs.
Dilated convolution.
Working of TCN
• Residual Blocks:
• For building very deep networks, without the risk of
vanishing/exploding gradient problem and enabling the network to
learn more easily.
• TCNs often use residual connections to improve training and model
performance

Where,
x is the input,
f(x) is the residual function,
Y is output of the residual block

A residual block contains a branch leading out to a series of transformations

whose outputs are added back to the inputs, allowing the layers to learn
modification to the identity mapping rather than the entire transformation.
Working of TCN
• Residual Blocks:
• For building very deep networks, without the risk of
vanishing/exploding gradient problem and enabling the network to
learn more easily.
• TCNs often use residual connections to improve training and model
performance
In TCN, for different input and
output widths, an additional 1x1
convolution is used to ensure that
element wise addition received
tensor of same shape.
Working of TCN
• Residual Blocks:
• The receptive field allows the TCN network to be stabilized
• The output of two residual blocks stacked over each other, in TCN:

where, D is the depth of convolutional layer, k is the kernel size, and s is the residual blocks
Working of TCN
• Output Layer:
• The final layer of the TCN produces the desired output
dimensions based on the learned representations by
passing it through a series of dense layers.
• For regression tasks like time series forecasting, a linear
activation function might be used and the output could
be a single value or a sequence of predicted values.
• For other tasks like classification tasks, a softmax or
sigmoid activation might be employed, depending on
the number of classes and the nature of the problem. The
output might be different, such as classification labels in
NLP tasks.
Strengths and Drawback of TCN
• Strengths:
• TCN are less computationally expensive as they do not require the complex gating
mechanisms that are used in recurrent neural networks (RNNs).
• By using 1D convolutional layers, it can be easily learn long range dependencies like
RNNs.
• It is able to take variable length sequences and map it to an output sequence of the
same length through 1D FCN.
• It is able to provide large receptive field for learning relevant context that varies
significantly in scale (short and long-range dependencies in the input sequence).
• Due to the use of residual blocks they do not suffer from the vanishing gradient
problem that can affect RNNs. It can capture more complex temporal dependencies
• TCNs have been shown to be effective for a variety of time series tasks, such as
forecasting, classification, and segmentation.
• TCN is capable of capturing different levels of abstraction by stacking multiple layers
of dilated convolutions.
Strengths and Drawback of TCN
• Drawbacks:
• For processing large receptive field, TCN can have high memory
requirements, especially when processing long sequences. This is because they
need to store the entire input sequence in memory.
• TCN requires more parameter to be tuned for large receptive field making it
computationally expensive.
• It require a large amount of training data to achieve good performance which
can be a challenge for some time series tasks.
• TCNs can be difficult to interpret, which can make it difficult to understand
how they make predictions.
References
[1] https://unit8.com/resources/temporal-convolutional-networks-and-forecasting/
[2] https://www.youtube.com/watch?v=TSGZBXILk14
[3] Oord, A. V. D., Dieleman, S., Zen, H., Simonyan, K., Vinyals, O., Graves, A., ...
& Kavukcuoglu, K. (2016). Wavenet: A generative model for raw audio. arXiv
preprint arXiv:1609.03499.
[4] Duc, T. N., Minh, C. T., Xuan, T. P., & Kamioka, E. (2020). Convolutional neural
networks for continuous QoE prediction in video streaming services. IEEE
Access, 8, 116268-116278.
[5] https://lukeguerdan.com/blog/2019/intro-to-tcns/
[6] Bai, S., Kolter, J. Z., & Koltun, V. (2018). An empirical evaluation of generic
convolutional and recurrent networks for sequence modeling. arXiv preprint
arXiv:1803.01271.

Nptel Swayam DWDM Slides
No ratings yet
Nptel Swayam DWDM Slides
406 pages
Major - Project Report VIII Sem
No ratings yet
Major - Project Report VIII Sem
87 pages
ملخص ممتاز ومترجم لفصول كويرك
50% (4)
ملخص ممتاز ومترجم لفصول كويرك
76 pages
Machine Learning Infographics by Slidesgo
No ratings yet
Machine Learning Infographics by Slidesgo
28 pages
Image Enhancement II
No ratings yet
Image Enhancement II
78 pages
Eye Disease Identifier Using CNN Final
No ratings yet
Eye Disease Identifier Using CNN Final
40 pages
Report On Pmu
No ratings yet
Report On Pmu
15 pages
Dabetic Retinopathy Detection Using CNN Report
100% (1)
Dabetic Retinopathy Detection Using CNN Report
50 pages
Radon Transform
0% (1)
Radon Transform
9 pages
DiseaseDetection ReportNew
No ratings yet
DiseaseDetection ReportNew
34 pages
1076a-Proposal For Job Portal Development - MS - Technost IT Solutions WLL
No ratings yet
1076a-Proposal For Job Portal Development - MS - Technost IT Solutions WLL
6 pages
Project Report
100% (2)
Project Report
51 pages
Unit-3: Non-Linear Data Structure
No ratings yet
Unit-3: Non-Linear Data Structure
23 pages
DMW Lab Manual (1) EDIT
No ratings yet
DMW Lab Manual (1) EDIT
118 pages
Handling Missing Value
No ratings yet
Handling Missing Value
12 pages
DSA Programs PDF Final
100% (1)
DSA Programs PDF Final
92 pages
Homomorphic Filtering 94015
No ratings yet
Homomorphic Filtering 94015
14 pages
Visvesvaraya Technological University Belagavi: Advanced Battery Management &
No ratings yet
Visvesvaraya Technological University Belagavi: Advanced Battery Management &
33 pages
Data Clustering..
No ratings yet
Data Clustering..
10 pages
Book 1
0% (1)
Book 1
416 pages
A Multifunctional Solar PV and Grid Based On-Board Converter For Electric Vehicles
No ratings yet
A Multifunctional Solar PV and Grid Based On-Board Converter For Electric Vehicles
31 pages
Practical No.2 Perform The Extraction Transformation and Loading (ETL) Process To Construct The Database in The Sqlserver
No ratings yet
Practical No.2 Perform The Extraction Transformation and Loading (ETL) Process To Construct The Database in The Sqlserver
12 pages
Project Report Group C1
No ratings yet
Project Report Group C1
22 pages
Csps
No ratings yet
Csps
33 pages
2019-A Bi-Objective Hyper-Heuristic Support Vector Machines For Big Data Cyber - Security
No ratings yet
2019-A Bi-Objective Hyper-Heuristic Support Vector Machines For Big Data Cyber - Security
11 pages
Web Mining
No ratings yet
Web Mining
13 pages
Java University Paper Questions MCA Mumbai University
No ratings yet
Java University Paper Questions MCA Mumbai University
2 pages
Digital Image Processing - Image Enhancement
No ratings yet
Digital Image Processing - Image Enhancement
50 pages
Mini Project HPC
No ratings yet
Mini Project HPC
17 pages
Non Linear Data Structure
No ratings yet
Non Linear Data Structure
59 pages
Deep Neural Network (DNN)
100% (1)
Deep Neural Network (DNN)
80 pages
Introduction To Datascience (R20DS501)
No ratings yet
Introduction To Datascience (R20DS501)
162 pages
Aparna INTERN REPORT 12
No ratings yet
Aparna INTERN REPORT 12
46 pages
The Price Prediction For Used Cars Using Multiple Linear Regression Model
No ratings yet
The Price Prediction For Used Cars Using Multiple Linear Regression Model
6 pages
DWM Mini Project
No ratings yet
DWM Mini Project
14 pages
Apriori Algorithm
No ratings yet
Apriori Algorithm
23 pages
Naan Mudhalvan
No ratings yet
Naan Mudhalvan
21 pages
Mining Frequent Itemset-Association Analysis
No ratings yet
Mining Frequent Itemset-Association Analysis
59 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
19 pages
Image Analytics
No ratings yet
Image Analytics
93 pages
Divide and Conquer
No ratings yet
Divide and Conquer
54 pages
Step 4 Exploring Data
No ratings yet
Step 4 Exploring Data
17 pages
GATE Progress Tracker DA
No ratings yet
GATE Progress Tracker DA
8 pages
Internship 7th Sem
No ratings yet
Internship 7th Sem
16 pages
Unit 5 Ids
No ratings yet
Unit 5 Ids
19 pages
Soft Computing
No ratings yet
Soft Computing
13 pages
Digital Computer Fundamentals
No ratings yet
Digital Computer Fundamentals
37 pages
DBMS Notes
No ratings yet
DBMS Notes
166 pages
CRT Brainwiz
100% (2)
CRT Brainwiz
178 pages
Airline Search Engine Project
No ratings yet
Airline Search Engine Project
28 pages
Shreyash Kalaskar: Bachelor of Technology in Data Science
No ratings yet
Shreyash Kalaskar: Bachelor of Technology in Data Science
1 page
Topic 1 Etw3482
100% (2)
Topic 1 Etw3482
69 pages
VPR - CC - Unit3.2 - Aneka - CometCloud - TSystems - Workflow - MapReduce - PPSX
No ratings yet
VPR - CC - Unit3.2 - Aneka - CometCloud - TSystems - Workflow - MapReduce - PPSX
67 pages
CS Practical File SOMYA 2017-18
No ratings yet
CS Practical File SOMYA 2017-18
64 pages
Stock Price Prediction Using LSTM RNN and CNN-slid
No ratings yet
Stock Price Prediction Using LSTM RNN and CNN-slid
6 pages
Unit V Big Data Analytics
No ratings yet
Unit V Big Data Analytics
47 pages
Object Detection in Drone Imagery Using Convolutional Neural Networks
100% (1)
Object Detection in Drone Imagery Using Convolutional Neural Networks
191 pages
76 - Sample - Chapter Kunci M2K3 No 9
No ratings yet
76 - Sample - Chapter Kunci M2K3 No 9
94 pages
DL Unit - III Notes1
No ratings yet
DL Unit - III Notes1
14 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
36 pages
Unit 5
No ratings yet
Unit 5
8 pages
Lecture 14 Graph-3
No ratings yet
Lecture 14 Graph-3
37 pages
Lecture 15 Counting.
No ratings yet
Lecture 15 Counting.
52 pages
9 Functions
No ratings yet
9 Functions
54 pages
Weekly
No ratings yet
Weekly
3 pages
CTRL
No ratings yet
CTRL
5 pages
Res Net
No ratings yet
Res Net
29 pages
Dense Net
No ratings yet
Dense Net
15 pages
Res Net 4
No ratings yet
Res Net 4
23 pages
Faculty Notification-All Institutes
No ratings yet
Faculty Notification-All Institutes
3 pages
Me 208 Dynamics: Dr. Ergin TÖNÜK
No ratings yet
Me 208 Dynamics: Dr. Ergin TÖNÜK
31 pages
Microcontrollers
No ratings yet
Microcontrollers
13 pages
Screw Conveyor Design
100% (1)
Screw Conveyor Design
8 pages
010 Strebord v91 Section 10 Acoustics
No ratings yet
010 Strebord v91 Section 10 Acoustics
64 pages
An Empirical Assessment of Empirical Corporate Finance
No ratings yet
An Empirical Assessment of Empirical Corporate Finance
40 pages
Continuity Equation
No ratings yet
Continuity Equation
11 pages
Punch Inspection
No ratings yet
Punch Inspection
5 pages
Calciumphosphate
No ratings yet
Calciumphosphate
9 pages
Sharp Photodevices Application Cirquits
No ratings yet
Sharp Photodevices Application Cirquits
7 pages
Corn Starch
No ratings yet
Corn Starch
8 pages
Precalculus Concepts Through Functions A Unit Circle Approach To Trigonometry 3rd Edition Sullivan Test Bank
No ratings yet
Precalculus Concepts Through Functions A Unit Circle Approach To Trigonometry 3rd Edition Sullivan Test Bank
105 pages
Programming Manual: Advanced Motion Control Software
No ratings yet
Programming Manual: Advanced Motion Control Software
17 pages
Future Worth Method
No ratings yet
Future Worth Method
17 pages
Icar Syllabus-Physics, Chemistry, Maths, Bio & Agriculture
75% (4)
Icar Syllabus-Physics, Chemistry, Maths, Bio & Agriculture
26 pages
Mid Sem Emt 2
No ratings yet
Mid Sem Emt 2
4 pages
Var, Svar and Svec Models
No ratings yet
Var, Svar and Svec Models
32 pages
Design of Rotation Inducing Rocket Fins and Their Analysis For Aerodynamic Stability
No ratings yet
Design of Rotation Inducing Rocket Fins and Their Analysis For Aerodynamic Stability
6 pages
1CD PDF
No ratings yet
1CD PDF
522 pages
Vlookuppractice
No ratings yet
Vlookuppractice
16 pages
Loc Flow
No ratings yet
Loc Flow
85 pages
Q3 Carpentry Week 4
No ratings yet
Q3 Carpentry Week 4
25 pages
Waxes
No ratings yet
Waxes
5 pages
Sistema Piloto
No ratings yet
Sistema Piloto
24 pages
Automatic Localization of Casting Defects With Convolutional Neural Networks
No ratings yet
Automatic Localization of Casting Defects With Convolutional Neural Networks
11 pages
Multiple Choice (8 X 1 PT)
No ratings yet
Multiple Choice (8 X 1 PT)
5 pages
Corekit User Manual Emulex
No ratings yet
Corekit User Manual Emulex
63 pages
Full Stuck Software Developer
No ratings yet
Full Stuck Software Developer
45 pages
Examples On Magnetic Circuits
No ratings yet
Examples On Magnetic Circuits
9 pages
Damage Stability-3
No ratings yet
Damage Stability-3
1 page

Temporal Convolutional Network (TCN)

Uploaded by

Temporal Convolutional Network (TCN)

Uploaded by

Temporal Convolutional

TCN = 1st principle +2nd principle

For kernel size = K and Dilation factor = D,

Dilated convolution enables an exponentially large receptive field for model

A residual block contains a branch leading out to a series of transformations

You might also like