0% found this document useful (0 votes)

346 views20 pages

Introduction To Deep Learning - Class

This document provides an overview of a deep learning course. The course will cover 14 topics over 14 weeks including vanilla neural networks, convolutional neural networks, recurrent neural networks, generative adversarial networks, and deep reinforcement learning. It will be taught by Amarachukwu Felix. E and aims to help students understand what deep learning is, why it is important, and key deep learning techniques.

Uploaded by

s2112287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

346 views20 pages

Introduction To Deep Learning - Class

Uploaded by

s2112287

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Deep Learning

Course Code: WID 3011

Amarachukwu Felix. E (PhD)

Email: [email protected]

Amara Felix
Introduction to Deep Learning
Topics and Activities
Week 1: Introduction to Deep Learning
Week 2: Vanilla Neural Networks I
Week 3: Vanilla Neural Networks II
Week 4: Multi-layer Perceptron I
Week 5: Multi-layer Perceptron II
Week 6: Convolutional Neural Networks I
Week 7: Convolutional Neural Networks II
Week 8: Recurrent Neural Network I
Week 9: Recurrent Neural Network II
Week 10: Generative Adversarial Network I
Week 11: Generative Adversarial Network II
Week 12: Deep Reinforcement Learning
Week 13: Good Practise in Deep Learning
Week 14: Deep Learning Project Presentation
Introduction to Deep Learning

Learning Outcome
1. What is Deep Learning?
2. Why is Deep Learning Important?
3. Differentiate between a simple Neural Network and Deep Neural Network
4. Deep Learning techniques
-CNN
-RNN
-Transformer architecture
5. Conclusion
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Neural Networks Architecture

The architecture of deep neural networks, characterized by multiple layers of interconnected

neurons, allows for the modeling of intricate hierarchical patterns. Deeper layers capture
increasingly abstract and complex representations.

 Activation Functions

These introduce non-linearities into the network. Without them, no matter how deep the
network, it would only be able to model linear relationships. Activation functions enable the
network to capture non-linear patterns and relationships in the data.
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Backpropagation

This is the cornerstone algorithm for training deep networks. By calculating the gradient of the
loss with respect to each weight and iteratively adjusting these weights, networks learn to make
better predictions over time.

 Large Datasets

One of the significant reasons deep learning models have become so powerful is the availability of
massive datasets. These datasets provide the necessary examples for networks to learn intricate
patterns.
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Computational Power

The rise of GPU computing has played a pivotal role in the current success of deep learning. GPUs
can perform parallel computations efficiently, making it feasible to train large and deep networks
in a reasonable amount of time

 Regularization Techniques

Techniques like dropout, weight decay, and batch normalization prevent overfitting and help
models generalize better, especially when the networks are deep.
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Transfer Learning

This involves using pre-trained models on large datasets and fine-tuning them for specific tasks. It
allows for achieving high accuracy even with smaller datasets by leveraging knowledge captured in
previously trained networks.

(For instance, one of the most common uses of transfer learning is in image classification tasks
using pre-trained neural networks. These networks have been trained on large datasets like
ImageNet and have learned useful features from these datasets. When we employ transfer
learning, we can leverage these learned features without starting the training process from
scratch.)
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Representation Learning

Unlike traditional methods that rely heavily on feature engineering, deep learning models learn
representations from the data automatically. This self-learned feature extraction often surpasses
manually designed features.

(For instance, Deep Learning, especially with Convolutional Neural Networks (CNNs), is a classic
example of representation learning. In image classification tasks using CNNs, the initial layers learn
low-level features (like edges and textures), and as you go deeper into the network, the layers
capture higher-level, abstract features specific to the classes in the dataset.
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 End-to-End Learning

Deep learning models can learn to map raw input data to desired outputs without the need for
explicit intermediate steps or processing, further simplifying model design and improving
performance.

(For instance, One of the most famous examples of end-to-end learning is the application of Deep
Learning to the problem of speech recognition. Traditional speech recognition systems involve
multiple steps: feature extraction (like MFCC- Mel-Frequency Cepstral Coefficients), acoustic
modeling, and language modeling. In an end-to-end system, you'd feed the raw audio waveform
(or maybe some minimal
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 End-to-End Learning

Deep learning models can learn to map raw input data to desired outputs without the need for
explicit intermediate steps or processing, further simplifying model design and improving
performance.

(For instance, One of the most famous examples of end-to-end learning is the application of Deep
Learning to the problem of speech recognition. Traditional speech recognition systems involve
multiple steps: feature extraction (like MFCC- Mel-Frequency Cepstral Coefficients), acoustic
modeling, and language modeling. In an end-to-end system, you feed the raw audio waveform (or
maybe some minimal preprocessing like spectrograms) directly to the neural network, which then
produces transcriptions without the traditional intermediate steps.
Introduction to Deep Learning
Key elements where the magic of deep learning takes place:

 Innovative Architectures

Beyond standard feedforward networks, architectures like Convolutional Neural Networks (CNNs)
for image tasks and Recurrent Neural Networks (RNNs) for sequential data have paved the way for
state-of-the-art results in various domains.
Introduction to Deep Learning

NNs contain two phases;

1.Forward Propagation
2.Back Propagation

 During the training of a NN you perform both of these phases. When you would like the NN to make a
prediction for an unlabeled example, you would simply perform forward propagation.

 Back propagation is really the magic that allows NN to perform so well for tasks that are traditionally very
difficult for computers to carry out.
Imagine that we are tasked to write a computer program that can identify handwritten digits. Each
image we receive will be 28 x 28 pixels and we will also have access to the correct label for that
image. The first thing we do is set up our neural network as in the figure below.
Introduction to Deep Learning

AI is a branch of computer science that aims to create machines/systems that have

Connection between fields the ability to imitate intelligent human behavior. E.g. problem-solving, understanding natural
language, recognizing patterns, and making decisions

-ML is a subset of AI that focuses on the development of algorithms and statistical

models.
-Rely on patterns and inference. specifically concentrates on the use of data and
algorithms to simulate the learning process
DL is a subset of ML that focuses on algorithms inspired by the structure and function
of the brain called artificial neural networks. While neural networks have been around
for a while, the term "deep" refers to the number of layers in the network. Modern
networks can be deep, meaning they can have tens or even hundreds of layers.
Deep Learning techniques

-Convolutional Neural Networks (CNNs) for image tasks.

•Key Components:
• Convolutional Layers: Extract features by sliding a small window (or filter) over the
input data.
• Pooling Layers: Reduce the spatial size, making the network faster and more robust.
• Fully Connected Layers: Classify the extracted features into various classes.
•Applications: Image classification, object detection, facial recognition, etc.

•Purpose: Primarily designed for

image processing.

•Specialty: They automatically and

adaptively learn spatial hierarchies
of features from input images.

Source: Convolutional Neural Network: An Overview (analyticsvidhya.com)

The Deep-dive in Deep Learning

-Recurrent Neural Networks (RNNs) for sequential data

Purpose: Designed to handle sequential data.
Specialty: They have a "memory" which captures information about previous steps in the sequence. This makes
them suitable for tasks where context from earlier in the sequence is needed to understand the current input.

Challenge: They can struggle with long sequences due to the vanishing (or exploding) gradient problem

To overcome the above challenge:

•Types
• LSTM (Long Short-Term Memory): A type of RNN designed to remember long sequences without losing
track of the context.
• GRU (Gated Recurrent Units): A simplified version of LSTM with fewer parameters.

They incorporate mechanisms called gates to control the flow of information, making them more capable of
learning from long-term dependencies.

•Applications: Time series forecasting, machine translation, speech recognition, etc.

Transformer Architectures

•Aim: Revolutionized Natural Language Processing (NLP) tasks.

•Specialty: Unlike RNNs, transformers can pay selective attention to different parts of the input data, allowing
them to handle long sequences more effectively.
•Key Components:
• Attention Mechanisms: Allows the model to focus on different parts of the input for different tasks. The
self-attention mechanism lets it consider other words in the input sentence when encoding a particular
word.
• Positional Encodings: Since transformers do not process data in order (like RNNs), they need
information about the position of words in a sequence.

•Popular Models:
• BERT (Bidirectional Encoder Representations from Transformers): Designed to understand the
context of words in search queries or other text by looking at the words before and after it.
• GPT (Generative Pre-trained Transformer): A model for generating human-like text by predicting the
next word in a sequence.

•Applications: Text classification, machine translation, question-answering systems, and even chatbots.
Conclusion

 Deep learning, a subset of machine learning, employs multi-layered neural networks to analyze various types of data.

 Offers unprecedented accuracy and automation capabilities.

 NNs contain two phases; Forward Propagation and Back Propagation

 Its architectures, particularly Convolutional Neural Networks (CNNs) for image processing and Recurrent Neural
Networks (RNNs) for sequential data, have revolutionized fields ranging from computer vision to natural language
processing.

 Leveraging vast datasets and computational power, deep learning techniques have enabled breakthroughs in
applications such as image and speech recognition, medical diagnosis, financial forecasting, and autonomous vehicles,
solidifying its transformative role in today's technology landscape.
References

1. Nielsen MA. Neural networks and deep learning. San Francisco, CA, USA: Determination
press; 2015 Sep 25.
2. Neural networks and deep learning

Week 3 - Advanced Topics in Machine Learning
100% (1)
Week 3 - Advanced Topics in Machine Learning
22 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
Group I
No ratings yet
Group I
20 pages
Lec 1 - Deep - Learning - Introduction
No ratings yet
Lec 1 - Deep - Learning - Introduction
34 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
28 pages
Deep Learning Essentials for Experts
No ratings yet
Deep Learning Essentials for Experts
22 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
UNIT I Part 1 Notes
No ratings yet
UNIT I Part 1 Notes
28 pages
Lecture 2
No ratings yet
Lecture 2
71 pages
Introduction To Deep Learning: by Gargee Sanyal
No ratings yet
Introduction To Deep Learning: by Gargee Sanyal
20 pages
AI Chapter 4
No ratings yet
AI Chapter 4
63 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
Chapter1. Introduction To Deep Learning
No ratings yet
Chapter1. Introduction To Deep Learning
21 pages
Salman Technical Seminar
No ratings yet
Salman Technical Seminar
24 pages
Introduction
No ratings yet
Introduction
64 pages
Lecture 1 Introduction of Deep Learning
No ratings yet
Lecture 1 Introduction of Deep Learning
31 pages
Deep Learning (Handout)
No ratings yet
Deep Learning (Handout)
11 pages
Report of Ann Cat3-Dev
No ratings yet
Report of Ann Cat3-Dev
8 pages
uNIT 1
No ratings yet
uNIT 1
16 pages
Unit 3
No ratings yet
Unit 3
21 pages
Four Unit
No ratings yet
Four Unit
3 pages
1 - Deep Learning 10-10-2023
No ratings yet
1 - Deep Learning 10-10-2023
30 pages
Deep Learning Project
No ratings yet
Deep Learning Project
24 pages
Lecture 3 - Introduction To Deep Learning
No ratings yet
Lecture 3 - Introduction To Deep Learning
27 pages
Vasudevan S. Deep Learning. A Comprehensive Guide 2022
No ratings yet
Vasudevan S. Deep Learning. A Comprehensive Guide 2022
307 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
AD3501 Deep Learning PRAISE
No ratings yet
AD3501 Deep Learning PRAISE
21 pages
DL Unit I & II
No ratings yet
DL Unit I & II
51 pages
DL Unit - I CSD Iv
No ratings yet
DL Unit - I CSD Iv
19 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Unit.1.Introduction To Deep Learning
No ratings yet
Unit.1.Introduction To Deep Learning
10 pages
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Unit 1 Part 1
No ratings yet
Unit 1 Part 1
61 pages
Deep Learning Day 27
No ratings yet
Deep Learning Day 27
43 pages
Deep Learning Concepts Explained
No ratings yet
Deep Learning Concepts Explained
14 pages
Unit 4
No ratings yet
Unit 4
27 pages
Introd 02
No ratings yet
Introd 02
32 pages
Unit 3 Introduction To Deep Learning Part 1
No ratings yet
Unit 3 Introduction To Deep Learning Part 1
7 pages
NN DL Unit - III
No ratings yet
NN DL Unit - III
19 pages
Deep Learning Module Overview
No ratings yet
Deep Learning Module Overview
17 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
DL - Unit - 1 - Foundations of Deep Learning
No ratings yet
DL - Unit - 1 - Foundations of Deep Learning
35 pages
Unit I
No ratings yet
Unit I
10 pages
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
No ratings yet
Krishna Rungta - TensorFlow in 1 Day Make Your Own Neural Network (2018) - Trang-1
24 pages
AI 101 CheatSheet for Beginners
No ratings yet
AI 101 CheatSheet for Beginners
18 pages
Deep Learning
No ratings yet
Deep Learning
98 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
Unit - 1 Deep Learning 3-2
No ratings yet
Unit - 1 Deep Learning 3-2
15 pages
Deep Learning
100% (4)
Deep Learning
32 pages
Deep Learning Note 21cs743
No ratings yet
Deep Learning Note 21cs743
96 pages
Module 1 DL Snotes
No ratings yet
Module 1 DL Snotes
11 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
DL Unit - 1 Notes
No ratings yet
DL Unit - 1 Notes
45 pages
Deep Neural Networks Explained
No ratings yet
Deep Neural Networks Explained
12 pages
Deep Learning and Neural Networks Overview
No ratings yet
Deep Learning and Neural Networks Overview
118 pages
Understanding Machine Learning
100% (73)
Understanding Machine Learning
416 pages
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
94% (18)
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
334 pages
Full Course of Machine Learning
100% (17)
Full Course of Machine Learning
660 pages
Burkov's Guide to Machine Learning
100% (11)
Burkov's Guide to Machine Learning
135 pages
Machine Learning Projects in Python
100% (17)
Machine Learning Projects in Python
135 pages
Machine Learning - An Applied Mathematics Introduction PDF
100% (14)
Machine Learning - An Applied Mathematics Introduction PDF
246 pages
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
91% (11)
Hands On Machine Learning With Python Concepts and Applications For Beginners - John Anderson 2018
166 pages
Practical Projects
100% (32)
Practical Projects
478 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (19)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Machine Learning Paradigms
100% (10)
Machine Learning Paradigms
336 pages
Machine Learning Projects Python
94% (18)
Machine Learning Projects Python
134 pages
Data Structure and Algorithms With Python
100% (16)
Data Structure and Algorithms With Python
369 pages
The Python Bible
97% (33)
The Python Bible
506 pages
AI Agents by Google
100% (11)
AI Agents by Google
42 pages
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
100% (11)
Deep Learning - Fundamentals, Theory and Applications 2019 PDF
168 pages
Machine Learning
100% (3)
Machine Learning
47 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (15)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Beginners Python Cheat Sheet
89% (9)
Beginners Python Cheat Sheet
28 pages
Python Programming. A Step-by-Step Guide For Absolute Beginners
91% (46)
Python Programming. A Step-by-Step Guide For Absolute Beginners
181 pages
(Hunt, J.) A Beginners Guide To Python 3 Programming
96% (47)
(Hunt, J.) A Beginners Guide To Python 3 Programming
440 pages
Machine Learning With Python
100% (15)
Machine Learning With Python
692 pages
Top 100 Applications of Generative AI 1683282083
96% (23)
Top 100 Applications of Generative AI 1683282083
119 pages
Applied Generative AI For Beginners Practical Knowledge 1703207445
94% (18)
Applied Generative AI For Beginners Practical Knowledge 1703207445
221 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
AI Concepts Using Python
100% (10)
AI Concepts Using Python
428 pages
15000+ ChatGPT Prompts, (Crafti - Pro) - Tareas
93% (28)
15000+ ChatGPT Prompts, (Crafti - Pro) - Tareas
367 pages
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
100% (15)
Python Programming for Beginners_ From Basics to AI Integrations. 5-Minute Illustrated Tutorials, Coding Hacks, Hands-On Exercises & Case Studies to Master Python in 7 Days and Get Paid More by Prince
244 pages
Convolutional Neural Networks in Python
100% (3)
Convolutional Neural Networks in Python
141 pages
Data Visualization With Python PDF
93% (15)
Data Visualization With Python PDF
662 pages
Generative Ai Fundamentals v1
100% (19)
Generative Ai Fundamentals v1
80 pages
Numbers: Willbegin Will Whichwillbeusedintheremainingpirtof Divisibility Dividid Long Division Division K
No ratings yet
Numbers: Willbegin Will Whichwillbeusedintheremainingpirtof Divisibility Dividid Long Division Division K
53 pages
Module 4 - Graph Theory (Part - 1)
No ratings yet
Module 4 - Graph Theory (Part - 1)
44 pages
Question Bank CP
No ratings yet
Question Bank CP
8 pages
IGCSE Computer Science Revision Checklist
No ratings yet
IGCSE Computer Science Revision Checklist
10 pages
Event DrivenProgramming Lab Manual
No ratings yet
Event DrivenProgramming Lab Manual
44 pages
Multibit Error Correction in Space Memory
No ratings yet
Multibit Error Correction in Space Memory
74 pages
Bimonthly Paper November
No ratings yet
Bimonthly Paper November
3 pages
Java Basics for Beginners Guide
No ratings yet
Java Basics for Beginners Guide
38 pages
Machine Learning
No ratings yet
Machine Learning
6 pages
Parallel Computing Ebook Downloads
100% (1)
Parallel Computing Ebook Downloads
47 pages
Bulk Exchange
No ratings yet
Bulk Exchange
6 pages
Computer Architecture Course Guide
No ratings yet
Computer Architecture Course Guide
228 pages
Introduction To Algorithms
No ratings yet
Introduction To Algorithms
19 pages
OSY Question Paper Final
No ratings yet
OSY Question Paper Final
2 pages
CS211 Summer2024 Midterm B1 A
No ratings yet
CS211 Summer2024 Midterm B1 A
9 pages
AUS - G4 - 16.data and Graphs - 3.line Graphs
No ratings yet
AUS - G4 - 16.data and Graphs - 3.line Graphs
39 pages
Inter-GPS: Geometry Solver with Formal Language
No ratings yet
Inter-GPS: Geometry Solver with Formal Language
13 pages
Dpco Question Bank
No ratings yet
Dpco Question Bank
6 pages
Computer Applications Jsa 2021 (5) - 1
No ratings yet
Computer Applications Jsa 2021 (5) - 1
72 pages
MIC EPA Notes
No ratings yet
MIC EPA Notes
84 pages
Electrical Engineering Principlesand and Applications 5th Edition Hambley Solutions Manual Full Chapters Instanly
100% (1)
Electrical Engineering Principlesand and Applications 5th Edition Hambley Solutions Manual Full Chapters Instanly
165 pages
Set
No ratings yet
Set
23 pages
Huffman Code
No ratings yet
Huffman Code
4 pages
Eti CHP 1
No ratings yet
Eti CHP 1
12 pages
Chapter 07
No ratings yet
Chapter 07
54 pages
Introduction To MetaTrader 5 An - Rafael F. V. C. Santos
No ratings yet
Introduction To MetaTrader 5 An - Rafael F. V. C. Santos
155 pages
Student Sports Participation List
No ratings yet
Student Sports Participation List
18 pages
III Eee Cs3353 Cp&Ds QB Unit5
No ratings yet
III Eee Cs3353 Cp&Ds QB Unit5
6 pages
ClientController Java
No ratings yet
ClientController Java
6 pages
Lobos Sofiabianca
No ratings yet
Lobos Sofiabianca
4 pages

Introduction To Deep Learning - Class

Uploaded by

Introduction To Deep Learning - Class

Uploaded by

Deep Learning

Course Code: WID 3011

Amarachukwu Felix. E (PhD)

 Neural Networks Architecture

The architecture of deep neural networks, characterized by multiple layers of interconnected

NNs contain two phases;

AI is a branch of computer science that aims to create machines/systems that have

-ML is a subset of AI that focuses on the development of algorithms and statistical

-Convolutional Neural Networks (CNNs) for image tasks.

•Purpose: Primarily designed for

•Specialty: They automatically and

Source: Convolutional Neural Network: An Overview (analyticsvidhya.com)

-Recurrent Neural Networks (RNNs) for sequential data

To overcome the above challenge:

•Applications: Time series forecasting, machine translation, speech recognition, etc.

•Aim: Revolutionized Natural Language Processing (NLP) tasks.

 Offers unprecedented accuracy and automation capabilities.

 NNs contain two phases; Forward Propagation and Back Propagation

You might also like