Lecture 1

Deep learning

Uploaded by

Hiruni Chamindi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views38 pages

Lecture 1

Deep learning

Uploaded by

Hiruni Chamindi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

EC 9170

Deep Learning for Electrical &

Computer Engineers

Lecture 01:
Deep feedforward networks

26th February 2024

Faculty of Engineering, University of Jaﬀna
•Is Artificial Intelligence, Machine learning,
and Deep Learning the same thing?

Ability of machine to imitate human

intelligence

Algorithms to incorporate intelligence into

machine by automatically learning from
data

Algorithms that mimics human brain to

incorporate intelligence into machine
Artificial Intelligence
• Al is any technique, code or algorithm that enables machines to
develop, demonstrate and mimic human cognitive behaviour or
intelligence, hence the name "Artificial Intelligence."
• Al doesn't mean that everything machines will be doing. Rather, Al
can be better represented as "Augmented Intelligence", i.e. Man +
Machine, to solve business problems better and faster.
• Al won't replace managers, but managers who use Al will replace
those who don't.
• Some of the most successful applications of Al around us can be seen
in Robotics, Computer Vision, Virtual Reality, Speech Recognition,
Automation, Gaming and so on...
Machine learning
• Machine learning is the sub-ﬁeld of Al, which gives machines the
ability to improve their performance over Fme without explicit
intervenFon or help from the human being
• In this approach machines are shown thousands or millions of
examples and trained how to correctly solve a problem.
• Most of the current applicaFons of machine learning leverage
supervised learning
• Other uses of ML can be broadly classiﬁed between unsupervised
learning and reinforced learning.
Deep learning
• Deep learning is a sub field of Machine Learning that very closely
tries to mimic human brain’s working using neurons.
Deep learning
• These techniques focus on building Artificial Neural Networks (ANN)
using several hidden layers.
• There are variety of deep learning networks such as Multilayer
Perceptron ( MLP), Autoencoders (AE), Convolution Neural Network
(CNN), Recurrent Neural Network (RNN), Deep Feedforward
Network etc.
Why Deep learning?
Why Deep learning is growing?
• Processing power needed for Deep learning is readily becoming
available using GPs, Distributed Computing and powerful CPUs
• Moreover, deep learning models seem to outperform machine
learning models as the data grows.
• Explosion of features and datasets
• Focus on customisation and real-time decisions
• Uncover hard to detect patterns (using traditional techniques)
when the incidence rate is low
Why Deep learning is growing?
• Find latent features (super variables) without significant manual
feature engineering
• Real-time fraud detection and self-learning models using streaming
data (KAFKA, MapR)
• Ensure consistent customer experience and regulatory compliance
• Higher operational efficiency
Challenges with Deep learning
• Data Quality and Quantity: high-quality labeled data can be expensive and time-consuming.
Additionally, the quality of the data can significantly impact the performance and robustness of
the models.
• Computational Resources: Need computational resources, including powerful GPUs or even
specialized hardware like TPUs (Tensor Processing Units). This can be a barrier for smaller
organizations or researchers with limited access to such resources.
• Overfitting: especially when trained on limited data or when the model capacity is too high
relative to the complexity of the problem. Techniques like dropout, regularization, and data
augmentation are commonly employed to mitigate this issue.
• Interpretability: Deep learning models are often considered "black boxes" due to their
complexity, making it challenging to understand how they arrive at a particular prediction. This
lack of interpretability can be problematic, especially in critical applications like healthcare or
finance, where understanding the reasoning behind a decision is crucial.
Build a Neural Network
• Neural Network: A computational model that works in a similar
way to the neurons in the human brain.
• Biological neurons are organized in a vast network of billions of neurons.
• Each neuron typically is connected to thousands of other neurons.
Build a Neural Network
• A biological neuron is composed of a Cell body, many dendrites (branching
extensions), one axon (long extension), synapses
• Biological neurons receive signals from other neurons via these synapses.

When a neuron receives a sufficient number of signals within a few milliseconds, it

fires its own signals.
• Comparison between biological neuron and artificial neuron
Neural Network
• Neural network consists of large number of highly interconnected
neurons in it.
• Each neuron takes an input, performs some operations then passes
the output to the following neuron.

Two-Layer Neural Network

Key Components
1. Layers-
• Input layer: It contains artificial neurons which receive input data, which could be
raw data (e.g., pixel values of an image). Input layer neurons depend on the number
of features.
• Output layer: the final layer in the neural network, contains artificial neurons that
are responsible for producing the model's predictions or outputs. output layer
neurons depend on the number of outputs.
• Hidden layers: layers of neurons that perform computations and transformations on
the input data. They are called "hidden" because they are not directly observable as
inputs or outputs of the system. Instead, they serve as intermediate layers between
the input and output layers, capturing complex patterns and features in the data.
More neurons = More calculation = More time
Key Components Cont…
2. Neurons - Basic unit of a Neural Network. It can take inputs from
other neurons and give the corresponding output. the inputs and
output can only be a binary number i.e. 0 or 1.
3. Weights - ConnecFon between every pair of neurons. the
importance is given to each factor in compuFng the output.
Typically chosen randomly in the ﬁrst run and opFmized using
backward propagaFon.
Key Components Cont…
4. Activation Function- Function used to generate outputs by matrix
multiplication of inputs and weights along with bias.
F(x) =
0.67
Key Components Cont…
Ø Neural Network Notation
Key Components Cont…
Ø Neural Network NotaFon
Key Components Cont…
4. Forward Propagation- Weights for each input are initialized to make
predictions and compute error. Output from each layer is fed
forward to the next layer.
Key Components Cont…
4. Loss Function- To compute error between actual and prediction
values and measure models performance. Hyperparameters are
fine tuned to minimize the loss function. Some common loss
functions are- Mean Square Error, Log loss, Cross entropy,
A Simple Artificial Neural Network
• One or more binary inputs and one binary output
• Activates its output when more than a certain number of its inputs are
active.
Ø Linear Threshold Unit (LTU)
• Inputs of a LTU are numbers (not binary).
• Each input connecFon is associated with a weight.
• Computes a weighted sum of its inputs and applies a step funcFon to
that sum.
Ø Perceptron
• The perceptron is a single layer of LTUs.
• The input neurons output whatever input they are fed.
• A bias neuron, which just outputs 1 all the time.
• If we use logistic function (sigmoid) instead of a step function, it
computes a continuous output.
Ø How is a Perceptron Trained?
• For an LTU to give an output it needs to know the values of the
weights w1, w2… wn.
• The Perceptron training algorithm is inspired by Hebb's rule.
• When a biological neuron often triggers another neuron, the
connection between these two neurons grow stronger.
• Feed one training instance x to each neuron j at a time and make its
prediction y cat.
• Update the connection weights.
Ø Perceptron in Keras
Multi-Layer Perceptron (MLP)
Perceptron Weakness
Incapable of solving some trivial problems, e.g., XOR classification problem. Why?
Multi-Layer Perceptron (MLP)
Perceptron Weakness
Incapable of solving some trivial problems, e.g., XOR classification problem. Why?
Multi-Layer Perceptron (MLP)
• The limitations of Perceptrons can be eliminated by stacking multiple
Perceptrons.
• The resulting network is called a Multi-Layer Perceptron (MLP) or deep
feedforward neural network.
• A feedforward neural network is composed of:
• One input layer
• One or more hidden layers
• One final output layer
Every layer except the output layer includes
a bias neuron and is fully connected to the
next layer
Ø How Does it Work?
• The model is associated with a directed acyclic graph describing how
the funcFons are composed together.
• E.g., assume a network with just a single neuron in each layer.
Ø XOR with Feedforward Neural Network
Ø How to Learn Model Parameters W?
Feedforward Neural Network - Cost Function

We use the cross-entropy (minimizing the negative log-likelihood) between the

training data y and the model's predictions 𝑦! as the cost function.

Ø Gradient-Based Learning
• The most significant difference between the linear models we have seen so
far and feedforward neural network?
• The non-linearity of a neural network causes its cost functions to become non-
convex
Ø Gradient-Based Learning Cont…
• Linear models, with convex cost function, guarantee to find global
minimum.
• Convex optimization converges starting from any initial parameters.

• Stochastic gradient descent applied to non-convex cost functions has no

such convergence guarantee.
• It is sensitive to the values of the initial parameters.
• For feedforward neural networks, it is important to initialize all weights to
small random values.
• The biases may be initialized to zero or to small positive values.
Training Feedforward Neural Networks
Training Feedforward Neural Networks Cont…
Hidden Units
Feedforward Network in Keras

Revised Grade Slip Format
100% (6)
Revised Grade Slip Format
1 page
An Ingression Into Deep Learning - Resp
No ratings yet
An Ingression Into Deep Learning - Resp
25 pages
Term Paper K 12
No ratings yet
Term Paper K 12
36 pages
Daily Lesson Log
100% (7)
Daily Lesson Log
2 pages
2 DeepLearning
No ratings yet
2 DeepLearning
46 pages
ML06_Neural-Network_2024-2025
No ratings yet
ML06_Neural-Network_2024-2025
78 pages
Deep Learning - Unit 1 Notes
No ratings yet
Deep Learning - Unit 1 Notes
27 pages
Unit 1
No ratings yet
Unit 1
70 pages
deep learning UNIT 1
No ratings yet
deep learning UNIT 1
22 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
No ratings yet
An Introduction To Neural Networks: Instituto Tecgraf PUC-Rio Nome: Fernanda Duarte Orientador: Marcelo Gattass
45 pages
Refined Chapter 5 UceQEJ (2)
No ratings yet
Refined Chapter 5 UceQEJ (2)
79 pages
Module 2
No ratings yet
Module 2
84 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Neural-Network-oxygen
No ratings yet
Neural-Network-oxygen
25 pages
DL Concepts 1 Overview
No ratings yet
DL Concepts 1 Overview
80 pages
Unit 4
100% (1)
Unit 4
57 pages
Deep Learning Computer Vision
No ratings yet
Deep Learning Computer Vision
47 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
CP4252 ML UNIT- V
No ratings yet
CP4252 ML UNIT- V
17 pages
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Neural Network: BY, Deekshitha J P Rakshitha Shankar
No ratings yet
Neural Network: BY, Deekshitha J P Rakshitha Shankar
27 pages
Unit 5 Ml
No ratings yet
Unit 5 Ml
37 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
deep learning
No ratings yet
deep learning
18 pages
UNIT I-PGI20C05J-Deep Neural Networks (1)
No ratings yet
UNIT I-PGI20C05J-Deep Neural Networks (1)
35 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Unit 5 PR
No ratings yet
Unit 5 PR
47 pages
Day1 05 Introduction to DeepLearning Part
No ratings yet
Day1 05 Introduction to DeepLearning Part
20 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Neural network
No ratings yet
Neural network
7 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Neural Networks
100% (1)
Neural Networks
119 pages
Unit IV Artificial Neural Networks
No ratings yet
Unit IV Artificial Neural Networks
25 pages
Neural Networks
No ratings yet
Neural Networks
17 pages
Deep learning notes
No ratings yet
Deep learning notes
47 pages
02 Deep Feedforward Learning - Notes
No ratings yet
02 Deep Feedforward Learning - Notes
34 pages
An Ingression Into Deep Learning - FP
No ratings yet
An Ingression Into Deep Learning - FP
17 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
DL_UNIT_3_NOTES
No ratings yet
DL_UNIT_3_NOTES
16 pages
Lecture 2
No ratings yet
Lecture 2
37 pages
Lecture 1
No ratings yet
Lecture 1
26 pages
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
No ratings yet
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
245 pages
6. Deep Learning
No ratings yet
6. Deep Learning
79 pages
Unit 1 part 1
No ratings yet
Unit 1 part 1
61 pages
Neural Networks
No ratings yet
Neural Networks
40 pages
Lesson 03 Artificial Neural Network
No ratings yet
Lesson 03 Artificial Neural Network
116 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
ML.Unit-5
No ratings yet
ML.Unit-5
22 pages
unit 4- DL
No ratings yet
unit 4- DL
33 pages
Lec 1
No ratings yet
Lec 1
57 pages
ch1 of artificial newral network
No ratings yet
ch1 of artificial newral network
20 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
ECSE484 Intro v2
No ratings yet
ECSE484 Intro v2
67 pages
Unit 1
No ratings yet
Unit 1
20 pages
Unit 4 Hca
No ratings yet
Unit 4 Hca
57 pages
four unit
No ratings yet
four unit
3 pages
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
From Everand
Long Short Term Memory: Fundamentals and Applications for Sequence Prediction
Fouad Sabry
No ratings yet
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
From Everand
Artificial Neural Networks: Fundamentals and Applications for Decoding the Mysteries of Neural Computation
Fouad Sabry
No ratings yet
Opinion Marking Signal LP
No ratings yet
Opinion Marking Signal LP
8 pages
HSV 524 Syllabus 7.30.19
No ratings yet
HSV 524 Syllabus 7.30.19
7 pages
IASSW Ethiopia REFREV080405
No ratings yet
IASSW Ethiopia REFREV080405
20 pages
JULY 2025
No ratings yet
JULY 2025
17 pages
skripsi teacher strategies_9
No ratings yet
skripsi teacher strategies_9
22 pages
Mpre Lac Filipino
No ratings yet
Mpre Lac Filipino
2 pages
The Use of Role Play Technique To Improve Student's Speaking Skills
No ratings yet
The Use of Role Play Technique To Improve Student's Speaking Skills
113 pages
Nama: Bayu Dwi Cahya NPM: 11218373 Kelas: 1EA04 Universitas Gunadarma Fakultas: Ekonomi Jurusan: Manajemen Tugas: Bahasa Inggris
No ratings yet
Nama: Bayu Dwi Cahya NPM: 11218373 Kelas: 1EA04 Universitas Gunadarma Fakultas: Ekonomi Jurusan: Manajemen Tugas: Bahasa Inggris
2 pages
The-Philippine-Environment :)
No ratings yet
The-Philippine-Environment :)
18 pages
Manual For Autonomous Colleges 15 4 2021
No ratings yet
Manual For Autonomous Colleges 15 4 2021
157 pages
Sabrina Khadraoui: Mohamed Khider University, Biskra, Biskra Province
No ratings yet
Sabrina Khadraoui: Mohamed Khider University, Biskra, Biskra Province
2 pages
RTI Information IPM PDF
No ratings yet
RTI Information IPM PDF
1 page
Video Swin Transformer
No ratings yet
Video Swin Transformer
12 pages
Role of teachin-WPS Office
No ratings yet
Role of teachin-WPS Office
10 pages
Evaluation: (Please Put A Tick Mark in The Appropriate Box)
No ratings yet
Evaluation: (Please Put A Tick Mark in The Appropriate Box)
1 page
Mathematics Grade 6 Handover Tool
No ratings yet
Mathematics Grade 6 Handover Tool
9 pages
Educational Philosophy
No ratings yet
Educational Philosophy
1 page
Evaluating A Game-Development Approach To Teach Introductory Programming Concepts in Secondary Education
No ratings yet
Evaluating A Game-Development Approach To Teach Introductory Programming Concepts in Secondary Education
6 pages
College of Nursing: Cebu Normal University
No ratings yet
College of Nursing: Cebu Normal University
1 page
Reading Sketch Starter 3 - SB
100% (7)
Reading Sketch Starter 3 - SB
46 pages
CPDD PTR 02 Instructional Design For Teachers
No ratings yet
CPDD PTR 02 Instructional Design For Teachers
18 pages
Factoring Bingo 3 1
No ratings yet
Factoring Bingo 3 1
2 pages
ARTSEDGE-0409-AIS Whole
No ratings yet
ARTSEDGE-0409-AIS Whole
80 pages
Mo Geography
No ratings yet
Mo Geography
5 pages
BIB 105 Old Testament Survey 1 Spring 2024
No ratings yet
BIB 105 Old Testament Survey 1 Spring 2024
10 pages
GRADE.1.INDIGENOUS.LANGUAGE
No ratings yet
GRADE.1.INDIGENOUS.LANGUAGE
32 pages
JUNIO, BRYAN F. BSEd-SOCIAL STUDIES - TI-Activity-10-PRESENTING-IDEAL-TEACHING-THROUGH-DEMO-TEACHING
No ratings yet
JUNIO, BRYAN F. BSEd-SOCIAL STUDIES - TI-Activity-10-PRESENTING-IDEAL-TEACHING-THROUGH-DEMO-TEACHING
9 pages

Lecture 1

Uploaded by

Lecture 1

Uploaded by

EC 9170

Deep Learning for Electrical &

26th February 2024

Ability of machine to imitate human

Algorithms to incorporate intelligence into

Algorithms that mimics human brain to

When a neuron receives a sufficient number of signals within a few milliseconds, it

Two-Layer Neural Network

We use the cross-entropy (minimizing the negative log-likelihood) between the

• Stochastic gradient descent applied to non-convex cost functions has no

You might also like