Gated Recurrent Unit

The Gated Recurrent Unit (GRU) is a type of recurrent neural network introduced in 2014 as a simpler alternative to LSTM networks, designed to process sequential data using gating mechanisms. GRUs utilize two main gates, the reset gate and the update gate, to control the flow of information and update the hidden state at each time step. Unlike LSTMs, GRUs do not maintain an internal cell state, making them more efficient while still effectively modeling tasks in natural language processing and other sequential data applications.

Uploaded by

Sunil Mehta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Gated Recurrent Unit

Uploaded by

Sunil Mehta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Gated Recurrent Unit (GRU)

Gated Recurrent Unit (GRU) is a type of recurrent neural network (RNN) that
was introduced by Cho et al. in 2014 as a simpler alternative to Long Short-
Term Memory (LSTM) networks. Like LSTM, GRU can process sequential data
such as text, speech, and time-series data.
The basic idea behind GRU is to use gating mechanisms to selectively update
the hidden state of the network at each time step. The gating mechanisms
are used to control the flow of information in and out of the network. The
GRU has two gating mechanisms, called the reset gate and the update gate.
The reset gate determines how much of the previous hidden state should be
forgotten, while the update gate determines how much of the new input
should be used to update the hidden state. The output of the GRU is
calculated based on the updated hidden state.
The equations used to calculate the reset gate, update gate, and hidden state
of a GRU are as follows:
Reset gate: r_t = sigmoid(W_r * [h_{t-1}, x_t])
Update gate: z_t = sigmoid(W_z * [h_{t-1}, x_t])
Candidate hidden state: h_t’ = tanh(W_h * [r_t * h_{t-1}, x_t])
Hidden state: h_t = (1 – z_t) * h_{t-1} + z_t * h_t’
where W_r, W_z, and W_h are learnable weight matrices, x_t is the input at
time step t, h_{t-1} is the previous hidden state, and h_t is the current hidden
state.

In summary, GRU networks are a type of RNN that use gating mechanisms to
selectively update the hidden state at each time step, allowing them to
effectively model sequential data. They have been shown to be effective in
various natural language processing tasks, such as language modeling,
machine translation, and speech recognition.

To solve the Vanishing-Exploding gradients problem often encountered

during the operation of a basic Recurrent Neural Network, many variations
were developed. One of the most famous variations is the Long Short Term
Memory Network(LSTM). One of the lesser-known but equally effective
variations is the Gated Recurrent Unit Network(GRU).

Unlike LSTM, it consists of only three gates and does not maintain an Internal
Cell State. The information which is stored in the Internal Cell State in an
LSTM recurrent unit is incorporated into the hidden state of the Gated
Recurrent Unit. This collective information is passed onto the next Gated
Recurrent Unit. The different gates of a GRU are as described below:-

1.
Update Gate(z): It determines how much of the past knowledge
needs to be passed along into the future. It is analogous to the
Output Gate in an LSTM recurrent unit.
2. Reset Gate(r): It determines how much of the past knowledge to
forget. It is analogous to the combination of the Input Gate and the
Forget Gate in an LSTM recurrent unit.
3. Current Memory Gate( ): It is often overlooked during a typical
discussion on Gated Recurrent Unit Network. It is incorporated into
the Reset Gate just like the Input Modulation Gate is a sub-part of
the Input Gate and is used to introduce some non-linearity into the
input and to also make the input Zero-mean. Another reason to
make it a sub-part of the Reset gate is to reduce the effect that
previous information has on the current information that is being
passed into the future.
The basic work-flow of a Gated Recurrent Unit Network is similar to that of
a basic Recurrent Neural Network when illustrated, the main difference
between the two is in the internal working within each recurrent unit as Gated
Recurrent Unit networks consist of gates which modulate the current input
and the previous hidden state.
Working of a Gated Recurrent Unit:
• Take input the current input and the previous hidden state as
vectors.
• Calculate the values of the three different gates by following the
steps given below:-

1. For each gate, calculate the parameterized current input

and previously hidden state vectors by performing
element-wise multiplication (Hadamard Product) between
the concerned vector and the respective weights for each
gate.
2. Apply the respective activation function for each gate
element-wise on the parameterized vectors. Below given is
the list of the gates with the activation function to be
applied for the gate.
• Update Gate : Sigmoid Function
• Reset Gate : Sigmoid Function

• The process of calculating the Current Memory Gate is a little

different. First, the Hadamard product of the Reset Gate and the
previously hidden state vector is calculated. Then this vector is
parameterized and then added to the parameterized current input
vector.
• To calculate the current hidden state, first, a vector of ones and the
same dimensions as that of the input is defined. This vector will be
called ones and mathematically be denoted by 1. First, calculate the
Hadamard Product of the update gate and the previously hidden
state vector. Then generate a new vector by subtracting the update
gate from ones and then calculate the Hadamard Product of the
newly generated vector with the current memory gate. Finally, add
the two vectors to get the currently hidden state vector.

• The above-stated working is stated as below:-

Note that the blue circles denote element-wise multiplication. The positive
sign in the circle denotes vector addition while the negative sign denotes
vector subtraction(vector addition with negative value). The weight matrix W
contains different weights for the current input vector and the previous
hidden state for each gate.

Just like Recurrent Neural Networks, a GRU network also generates an

output at each time step and this output is used to train the network using
gradient descent.
Note that just like the workflow, the training process for a GRU network is
also diagrammatically similar to that of a basic Recurrent Neural Network
and differs only in the internal working of each recurrent unit.

AI - Recent Trends and Applications
100% (1)
AI - Recent Trends and Applications
331 pages
CNN RNN LSTM GRU Simple
100% (3)
CNN RNN LSTM GRU Simple
20 pages
AI and Ethics Bundle
No ratings yet
AI and Ethics Bundle
75 pages
RNN LSTM
No ratings yet
RNN LSTM
49 pages
Unit 2 DL
No ratings yet
Unit 2 DL
44 pages
Gated Recurrent Unit: Master Sidsd - S2
100% (1)
Gated Recurrent Unit: Master Sidsd - S2
23 pages
006 Practical List of DM-2023
No ratings yet
006 Practical List of DM-2023
1 page
Deep Learning
No ratings yet
Deep Learning
49 pages
Online Salon System Thesis
100% (3)
Online Salon System Thesis
7 pages
CS601 - Machine Learning - Unit 4 - Notes - 1672759767
No ratings yet
CS601 - Machine Learning - Unit 4 - Notes - 1672759767
12 pages
Future Foglets of The Hive Mind
100% (1)
Future Foglets of The Hive Mind
8 pages
Zhu Et Al 2022 Applications of Smart Technologies in Construction Project Management
No ratings yet
Zhu Et Al 2022 Applications of Smart Technologies in Construction Project Management
12 pages
Lecture 3 LSTM, GRU
No ratings yet
Lecture 3 LSTM, GRU
45 pages
RNN 2
No ratings yet
RNN 2
144 pages
AI Prompts For Teaching - A Spell Book
No ratings yet
AI Prompts For Teaching - A Spell Book
52 pages
LCTM and Gru
No ratings yet
LCTM and Gru
62 pages
DL U-Ii
No ratings yet
DL U-Ii
41 pages
Deep Learning (MODULE-5)
No ratings yet
Deep Learning (MODULE-5)
71 pages
Mod 6
No ratings yet
Mod 6
48 pages
Instructor Name: Shukdev Datta ML Developer at Innovative Skills
No ratings yet
Instructor Name: Shukdev Datta ML Developer at Innovative Skills
17 pages
Week 6
No ratings yet
Week 6
60 pages
ML Unit 4
No ratings yet
ML Unit 4
47 pages
UNIT-5-Modern Recurrent Neural Networks
No ratings yet
UNIT-5-Modern Recurrent Neural Networks
60 pages
Gated Recurrent Unit
No ratings yet
Gated Recurrent Unit
12 pages
Unit 2 DL
No ratings yet
Unit 2 DL
43 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
ML (Cs-601) Unit 4 Complete
No ratings yet
ML (Cs-601) Unit 4 Complete
45 pages
LSTM and GRU
No ratings yet
LSTM and GRU
22 pages
Chap 7.2 Sequence Analysis Using RNN LSTM
No ratings yet
Chap 7.2 Sequence Analysis Using RNN LSTM
60 pages
Area and Time Efficient Vlsi Implementation of Recurrent Neural Network Architecture
No ratings yet
Area and Time Efficient Vlsi Implementation of Recurrent Neural Network Architecture
42 pages
DL Ut - 2
No ratings yet
DL Ut - 2
30 pages
Aihumanize Io ...
No ratings yet
Aihumanize Io ...
12 pages
Final Report
No ratings yet
Final Report
37 pages
LSTM & Gru
No ratings yet
LSTM & Gru
17 pages
Revision Notes LSTRM
No ratings yet
Revision Notes LSTRM
19 pages
Module 4
No ratings yet
Module 4
14 pages
DLT Unit-4
No ratings yet
DLT Unit-4
18 pages
RNN
No ratings yet
RNN
28 pages
CH4 - AA1.1-Sequence Models
No ratings yet
CH4 - AA1.1-Sequence Models
26 pages
AAM Unit 6 Notes
No ratings yet
AAM Unit 6 Notes
20 pages
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
0% (1)
Unit 4 - Machine Learning - WWW - Rgpvnotes.in
16 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
14 pages
Unit 4 - Machine Learning
No ratings yet
Unit 4 - Machine Learning
16 pages
Unit 4 - MachineLearning
No ratings yet
Unit 4 - MachineLearning
16 pages
Research Article: Applying Artificial Neural Networks For Face Recognition
No ratings yet
Research Article: Applying Artificial Neural Networks For Face Recognition
17 pages
Deeplearning
No ratings yet
Deeplearning
15 pages
GRU Modified
No ratings yet
GRU Modified
15 pages
CIO Strategy Guide Managing Organisational Impacts of Generative AI
No ratings yet
CIO Strategy Guide Managing Organisational Impacts of Generative AI
12 pages
Unit 2-stu-RG
No ratings yet
Unit 2-stu-RG
15 pages
Machine Learning Unit 4 RNN
No ratings yet
Machine Learning Unit 4 RNN
11 pages
DL 4
No ratings yet
DL 4
11 pages
Untitled
No ratings yet
Untitled
478 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
Summer Internship 2022
No ratings yet
Summer Internship 2022
11 pages
CS 601 Machine Learning Unit 4
No ratings yet
CS 601 Machine Learning Unit 4
14 pages
LSTM Deep Learning
No ratings yet
LSTM Deep Learning
11 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Understanding GRU Networks
No ratings yet
Understanding GRU Networks
8 pages
LSTM&RNN
No ratings yet
LSTM&RNN
10 pages
NN Text Generation Zaid Bouslikhin
No ratings yet
NN Text Generation Zaid Bouslikhin
14 pages
GRU
No ratings yet
GRU
2 pages
Gate Activation Signal Analysis For Gated Recurrent Neural Networks and Its Correlation With Phoneme Boundaries
No ratings yet
Gate Activation Signal Analysis For Gated Recurrent Neural Networks and Its Correlation With Phoneme Boundaries
5 pages
Unit 3
No ratings yet
Unit 3
8 pages
Evaluating GRU and LSTM With Regularization On Translating Different Language Pairs
No ratings yet
Evaluating GRU and LSTM With Regularization On Translating Different Language Pairs
7 pages
Learning Long-Term Dependencies With RNN
No ratings yet
Learning Long-Term Dependencies With RNN
8 pages
Gated Recurrent Unit Networks - GeeksforGeeks
No ratings yet
Gated Recurrent Unit Networks - GeeksforGeeks
12 pages
Maize Seed Variety Identification Model
No ratings yet
Maize Seed Variety Identification Model
9 pages
What Is Recurrent Neural Network
No ratings yet
What Is Recurrent Neural Network
2 pages
DL Unit-4
No ratings yet
DL Unit-4
4 pages
NLP - L7 Gru
No ratings yet
NLP - L7 Gru
5 pages
Master Thesis Support Vector Machine
100% (3)
Master Thesis Support Vector Machine
5 pages
AI For Generation of Images
No ratings yet
AI For Generation of Images
2 pages
Blackbox 5.0 Ideas
No ratings yet
Blackbox 5.0 Ideas
3 pages
Chris Lattner Dissertation
100% (2)
Chris Lattner Dissertation
4 pages
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
No ratings yet
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
6 pages
Decision Making in The Battlespace of The Future
No ratings yet
Decision Making in The Battlespace of The Future
36 pages
Lecture06 Graph Visualization
No ratings yet
Lecture06 Graph Visualization
74 pages
AnneDashini 1106191005 SE Assignment
No ratings yet
AnneDashini 1106191005 SE Assignment
6 pages
It Is Not Accuracy vs. ExplainabilityWe Need Both For Trustworthy AI Systems
No ratings yet
It Is Not Accuracy vs. ExplainabilityWe Need Both For Trustworthy AI Systems
8 pages
Self-Attention GRU Networks For Fake Job Classification
No ratings yet
Self-Attention GRU Networks For Fake Job Classification
5 pages
Ijbsv 17 P 1581
No ratings yet
Ijbsv 17 P 1581
7 pages
Ai 3
No ratings yet
Ai 3
13 pages
Strings in Python
No ratings yet
Strings in Python
15 pages
Enhanced HAR Using Dynamic STGAT
No ratings yet
Enhanced HAR Using Dynamic STGAT
9 pages
Firewall
No ratings yet
Firewall
4 pages
Electronics
No ratings yet
Electronics
23 pages
Shsconf Apmm2024 04003 5
No ratings yet
Shsconf Apmm2024 04003 5
5 pages
19 Indian JLTech 1
No ratings yet
19 Indian JLTech 1
31 pages
Autoencoders in Machine Learning
No ratings yet
Autoencoders in Machine Learning
7 pages
Lecture 1 - Introduction To Java Programming
No ratings yet
Lecture 1 - Introduction To Java Programming
16 pages
AI For Kids Assignments 02
No ratings yet
AI For Kids Assignments 02
3 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
5 pages
Hamming Code
No ratings yet
Hamming Code
5 pages
Equations - Uplift AI
No ratings yet
Equations - Uplift AI
1 page
CPU Organization
No ratings yet
CPU Organization
3 pages
LSTM 07-May-2025
No ratings yet
LSTM 07-May-2025
2 pages
COURSE TITLE:-MCA (2023-25) Semester: - Iv
No ratings yet
COURSE TITLE:-MCA (2023-25) Semester: - Iv
1 page
Front Page Ns
No ratings yet
Front Page Ns
1 page