0% found this document useful (0 votes)

36 views27 pages

Unit II

Introduction to Deep Learning& Application

Uploaded by

Naga Raju Challa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views27 pages

Unit II

Introduction to Deep Learning& Application

Uploaded by

Naga Raju Challa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 27

BAPATLA ENGINEERING COLLEGE :: BAPATLA

(Autonomous)

Deep Learning (20ECJ44)

By
Dr. Naga Raju Challa
Assistant Professor,
Department of ECE,
Bapatla Engineering College,
(Autonomous)
Bapatla.
UNIT- I
Introduction to Deep Learning
&
Architectures
Machine Learning Vs. Deep Learning

 Machine learning and deep learning are both subfields of

artificial intelligence (AI) that focus on training
algorithms to make predictions or decisions based on data.
 However, they differ in their approaches, architectures,
and applications.
Machine Learning Vs Deep Learning
Features Machine Learning Deep Learning

ML encompasses a broad range of techniques DL is a subset of ML that specifically deals with

and algorithms that can be categorized into neural networks consisting of multiple layers.
supervised, unsupervised, and reinforcement These networks, known as artificial neural
Architecture
learning. networks, can automatically learn features from
These algorithms often rely on handcrafted data.
features extracted from data.

ML models can perform well with smaller DL models typically require large amounts of
datasets, making them suitable for applications data to generalize effectively.
Data Size
with limited data availability. They thrive when trained on big datasets,
making them suitable for tasks

ML models are generally less computationally DL models often demand significant

intensive than deep learning models. They can computational resources, including powerful
Computations
run on standard hardware and may not require GPUs or TPUs, due to the complexity of neural
specialized GPUs. networks with many layers.
Machine Learning Vs. Deep Learning
Features Machine Learning Deep Learning

ML models often train faster than DL models, DL models slower than ML models.
Training Time which can require prolonged training times,
especially on large datasets.

Traditional ML models are often more DL models, particularly DNN, are considered as
Interpretability interpretable because they rely on human- black boxes because understanding their
engineered features and simpler algorithms. decision-making processes can be challenging.

In traditional ML, a significant amount of time DL models can automatically learn features
Feature is spent on feature engineering, which involves from the raw data, reducing the need for
Engineering selecting, transforming, and engineering extensive feature engineering. This is one of the
relevant features from the raw data to improve key advantages of deep learning.
model performance.

fraud detection, recommendation systems, Image recognition, speech recognition, NLP,

Applications
natural language processing, and more. autonomous driving, and game playing.
Representation learning

 Definition: The process of automatically discovering and extracting meaningful patterns or

features from raw data is known as Representation learning.
 It is also known as feature learning or feature extraction.
 It is a fundamental concept in deep learning and machine learning.
 The representation learning are more informative and relevant for solving a specific task, such as
classification, regression, or clustering.
 The aim of representation learning is to transform data into a different, more compact, and more
useful format.
 In traditional machine learning, feature engineering often involves manually selecting or
designing features based on domain knowledge.
 However, the representation learning, aims to automate this process by allowing the model to
learn the most relevant features directly from the data.
Representation learning Models
 Convolutional Neural Networks (CNNs): In computer vision tasks, CNNs are designed to
automatically learn hierarchical representations of images. They use convolutional layers to capture
local patterns and features, followed by fully connected layers for higher-level abstractions.
 Recurrent Neural Networks (RNNs): RNNs are used for sequential data, such as natural language
text or time series data. They learn to capture temporal dependencies and can be used for tasks like
sentiment analysis, machine translation, and speech recognition.
 Transfer Learning: Transfer learning involves pre-training a deep neural network on a large dataset
and then fine-tuning it on a smaller, task-specific dataset. The pre-trained network serves as a feature
extractor, and its learned representations are often useful for various downstream tasks.
 Auto-encoders: Auto-encoders are neural networks that are trained to reconstruct their input data. The
hidden layers of the auto-encoder learn to capture essential features or representations of the input data
during the training process. Variations like de-noising auto-encoders and variational auto-encoders
(VAEs) are commonly used for representation learning.
Representation learning Models
 Auto-encoders: Auto-encoders are neural networks that are trained to reconstruct their input data. The
hidden layers of the auto-encoder learn to capture essential features or representations of the input data
during the training process. Variations like de-noising auto-encoders and variational auto-encoders
(VAEs) are commonly used for representation learning.
 Self-Supervised Learning: Self-supervised learning is a type of representation learning where a model
learns from data with automatically generated labels or annotations. For example, predicting missing
parts of an image or missing words in a sentence can be used to train self-supervised models.
 Representation learning has played a crucial role in improving the performance of deep learning
models across various domains, including computer vision, NLP, speech recognition, etc.
 By learning informative representations, models can generalize better to new and unseen data, making
them more effective and efficient for a wide range of tasks.
Width Vs Depth of Neural Networks

 Width and Depth are two important architectural aspects of neural networks that affect their
capacity and performance.
 The number of neurons (or units) in each layer of a neural network is known as Width.
 Increasing the width of a neural network can increase its capacity to learn complex patterns in
the data.
 However, a very wide network may also require more training data and computational
resources and may be prone to overfitting.
Source: NPTEL IIT KGP
Width Vs. Depth of Neural Networks

 The number of layers presented in a neural network is called Depth.

 A deep neural network has many hidden layers between the input and output layers.
 Deeper networks are capable of capturing hierarchical features in the data, where lower
layers learn simple features, and higher layers learn more abstract and complex features.
 Image processing, and natural language understanding.
Source: NPTEL IIT KGP
Activation Functions

 The sigmoid function compresses all inputs to the range  The Tanh function compresses all inputs to the
of {0,1}. 𝜕 𝜎 ( 𝑥) range of {-1,1}.
 The gradient of the function is =𝜎 ( 𝑥 ) ( 1 −𝜎 ( 𝑥 ) )  The gradient of the function is
𝜕𝑥
Cons Pros:
Saturation Region:  It is a zero center
 A sigmoid neuron is saturated when=1 or =0. Cons:
 From the graph, at saturation it would be 0;  Still gradient vanish problem is there.
𝑤=𝑤 −𝜂 𝛻 𝑤 =0  Computationally expensive.
 A saturated gradient neuron can causes the gradient to
vanish
 Sigmoid are not zero center
Rectified Linear Unit (ReLU)
 ReLU Stands for Rectified Linear Unit.
 It is a non linear activation function.
 Pros:
 It doesn’t saturate in the positive region
 Computationally Effective.
 Much faster than sigmoid/Tanh
 The derivative of ReLU is

𝑓 ( 𝑥 ) =𝑚𝑎𝑥 ( 0 , 𝑥 )

 Cons:
 This causes Dead neuron Problem.
Leaky ReLU & Exponential ReLu
Leaky RELU Parametric RELU Exponential RELU

𝑓 ( 𝑥 ) =𝑚𝑎𝑥 ( 0 .01 𝑥 , 𝑥 ) 𝑓 ( 𝑥 )=𝑥 if 𝑥 >0

𝑓 ( 𝑥 ) =𝑚𝑎𝑥 ( 𝛼 𝑥 , 𝑥 ) = 𝑎𝑒 𝑥 − 1 if 𝑥 ≤ 0
Learning Models
 Supervised Learning Models: In supervised learning, models are trained on labeled data, where each
input is associated with a corresponding target or output. Common algorithms include linear regression,
decision trees, support vector machines, and deep neural networks. These models learn to map inputs to
outputs and can be used for tasks like classification and regression.
 Unsupervised Learning Models: Unsupervised learning models work with unlabeled data and aim to
discover patterns, structures, or relationships within the data. Clustering algorithms like k-means and
hierarchical clustering, as well as dimensionality reduction techniques like Principal Component
Analysis (PCA), are examples of unsupervised learning models.
 Reinforcement Learning Models: In reinforcement learning, agents learn to make sequential
decisions in an environment to maximize a reward signal. These models are used in applications such
as game playing, robotics, and autonomous systems. Popular reinforcement learning algorithms include
Q-learning and deep reinforcement learning algorithms like DQN and A3C.
Learning Models
 Semi-Supervised Learning Models: Semi-supervised learning combines elements of both supervised
and unsupervised learning. These models use a small amount of labeled data and a larger amount of
unlabeled data to improve learning performance.
 Self-Supervised Learning Models: Self-supervised learning is a type of unsupervised learning where
models generate their own labels from the data itself. For example, in natural language processing,
models might learn to predict missing words in a sentence or generate contextually relevant
representations of words or phrases.
 Transfer Learning Models: Transfer learning involves pre-training a model on one task or dataset and
then fine-tuning it for another related task. This approach can save time and resources when training
models and is commonly used in deep learning, such as with pre-trained convolutional neural networks
(CNNs) for image classification.
Learning Models
 Neural Network Architectures: Deep learning models, which are a subset of neural networks, have
gained prominence in recent years. These models consist of multiple layers of interconnected artificial
neurons and are particularly well-suited for tasks involving large amounts of data, such as image and
speech recognition. Popular architectures include convolutional neural networks (CNNs) for computer
vision and recurrent neural networks (RNNs) for sequence data.
Unsupervised Training of Neural Networks
 Unsupervised training of neural networks is a type of machine learning approach where a neural
network learns patterns, representations, or structures in data without explicit supervision or labeled
target outputs.
 In contrast to supervised learning, where the network is provided with labeled examples and aims to
minimize the prediction error.
 However, the unsupervised learning focuses on extracting useful information from the data itself.
 Techniques:
 Autoencoder
 Restricted Boltzmann Machines (RBMs)
 Clustring
 Generative Adversarial Networks (GANs)
 Variational Autoencoders (VAEs)
 Dimensionality Reduction
Autoencoder
 An auto encoder is a type of artificial neural network used in unsupervised machine learning and
dimensionality reduction tasks.
 It is used for data compression and feature learning.
 Auto encoders consist of an encoder and a decoder, both of which are neural networks.
 The main idea behind auto encoders is to learn a compressed representation (encoding) of the input
data and then decode it to reconstruct the original data.
 Assumptions:
 1. High degree of correlation/ Structure exist in the data.
 2. For uncorrelated data (input features are independent), then compression and subsequent
reconstruction would be difficult.
Autoencoder

 The number of layers in the bottleneck layer or

hidden layer is much lesser than the number of
nodes in the input layer.
 The number of nodes in the input layer is equal
to the number of nodes in the output layer.
 The number of nodes in the input layer is𝑀 × 𝑁 +1
 The number of nodes in the output layer is𝑀 × 𝑁
 If the number of nodes in the hidden layer is d
then the condition for number of hidden layers in
the encoder side is𝑑≪ 𝑀 × 𝑁
 the condition for number of hidden layers in the
decoder side is𝑑≪ 𝑀 × 𝑁 +1 Fig: Basic Structure of an Autoencoder
Autoencoder

Fig: Block Diagram of an Autoencoder

Stacked Autoencoder

Fig: Block Diagram of an Stacked Autoencoder

Autoencoder

 Expectations:
 Sensitive enough to input for accurate
reconstruction.
 Insensitive enough that it doesn’t memorize or
overfit the training data
 𝐿𝑜𝑠𝑠 function ⇒ 𝐿 ( 𝑋 , ^
𝑋 ) +𝑅𝑒𝑔𝑢𝑙𝑎𝑟𝑖𝑧𝑒𝑟

Fig: Basic Structure of an Autoencoder

Autoencoder
Under Complete Autoencoder

Fig: Basic Structure of an Autoencoder

 In order to minimize the error we use back propogation algorithm.

Under Complete Autoencoder

In this case length (h)< length( 𝑋 𝑖 )

h=𝑔 ( 𝑊 𝑋 𝑖 +𝑏 )
^
𝑋 = 𝑓 ( 𝑊 ∗ h+𝑐 )
𝑖

Under Complete Autoencoder

Over Complete Autoencoder

In this case length (h) ≥ length( 𝑋 𝑖 )

h=𝑔 ( 𝑊 𝑋 𝑖 +𝑏 )
^
𝑋 = 𝑓 ( 𝑊 ∗ h+𝑐 )
𝑖

Fig: Over Complete Autoencoder

Autoencoder: Examples

 Suppose all the inputs are binary each𝑥 ∈ { 0 ,1 }

𝑖𝑗
 The most suitable activation function for the
decoder is
^
𝑋 𝑖=log 𝑖𝑠𝑡𝑖𝑐 ( 𝑊 ∗ h+ 𝑐 )
 The logistic function resists all output values to 0
and 1.
 However, at the encoder side tanh or linear, or
sigmoid activation functions can be used.

Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
DL Unit 1
No ratings yet
DL Unit 1
200 pages
Unit 1
No ratings yet
Unit 1
20 pages
Deep Learning Report For Students
No ratings yet
Deep Learning Report For Students
32 pages
Unit-Ii DLL
No ratings yet
Unit-Ii DLL
19 pages
Unit - V
No ratings yet
Unit - V
44 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
Unit 2
No ratings yet
Unit 2
64 pages
Deep Learning For NLP
No ratings yet
Deep Learning For NLP
78 pages
DL UNIT-4 Part-1
No ratings yet
DL UNIT-4 Part-1
10 pages
Deep Learning Fundamentals
No ratings yet
Deep Learning Fundamentals
19 pages
DL Unit 4 Perfect PDF - 1
No ratings yet
DL Unit 4 Perfect PDF - 1
23 pages
Unit I
No ratings yet
Unit I
28 pages
Deep Learning Module 1 Chapter 1
No ratings yet
Deep Learning Module 1 Chapter 1
18 pages
Lecture 1a - Introduction
No ratings yet
Lecture 1a - Introduction
38 pages
Deep
No ratings yet
Deep
15 pages
Deep Learnig
No ratings yet
Deep Learnig
16 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Unit-I (R20 Syllabus) Machine Learning Basics
No ratings yet
Unit-I (R20 Syllabus) Machine Learning Basics
50 pages
Machine Learning Tutorial
No ratings yet
Machine Learning Tutorial
149 pages
Group I
No ratings yet
Group I
20 pages
Lec 1
No ratings yet
Lec 1
30 pages
Deep Learning Midsem Merged Previous Batch
No ratings yet
Deep Learning Midsem Merged Previous Batch
423 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
No ratings yet
R21 - A7709 - Deep Learning: Dr. Bhawani Sankar Panigrahi
92 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
Deep Learning Material
No ratings yet
Deep Learning Material
136 pages
Unit I
No ratings yet
Unit I
10 pages
Salman Technical Seminar
No ratings yet
Salman Technical Seminar
24 pages
Unit 3 Introduction To Deep Learning Part 1
No ratings yet
Unit 3 Introduction To Deep Learning Part 1
7 pages
Lecture 1,2,3 - Module 1 - ML Vs DL
No ratings yet
Lecture 1,2,3 - Module 1 - ML Vs DL
26 pages
Notes DL-1
No ratings yet
Notes DL-1
10 pages
Unit 3
No ratings yet
Unit 3
16 pages
MVDAFT Final
No ratings yet
MVDAFT Final
30 pages
Section 6
No ratings yet
Section 6
6 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Deep Learning-1
No ratings yet
Deep Learning-1
20 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
Deep Learning (Nirali)
No ratings yet
Deep Learning (Nirali)
32 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
ML Unit 4
No ratings yet
ML Unit 4
16 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
Neural Networks & Deep Learning Makaut & & 7th SemNotes
No ratings yet
Neural Networks & Deep Learning Makaut & & 7th SemNotes
36 pages
ML - MODULE7 - Advanced Topics in ML
No ratings yet
ML - MODULE7 - Advanced Topics in ML
22 pages
DL Intro
No ratings yet
DL Intro
64 pages
Deep Learning Introduction Class
No ratings yet
Deep Learning Introduction Class
46 pages
NoteGPT Summary DL Mod1
No ratings yet
NoteGPT Summary DL Mod1
3 pages
Deep Neural Network
No ratings yet
Deep Neural Network
17 pages
AA12 Deep Learning 2024
No ratings yet
AA12 Deep Learning 2024
30 pages
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
No ratings yet
Introduction To Deep Learning: Technical Seminar by Md. Abul Fazl (14261A05A0) CSE Dept
21 pages
UNIT - 5 Lecture 2
No ratings yet
UNIT - 5 Lecture 2
26 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
Cheatsheets For Deep Learning 1650192034
No ratings yet
Cheatsheets For Deep Learning 1650192034
95 pages
Deep Learning
From Everand
Deep Learning
Manish Soni
No ratings yet
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
Zhao 2023 JNL A Self-Supervised Contrastive Learning Method For Leaf Disease Iden
No ratings yet
Zhao 2023 JNL A Self-Supervised Contrastive Learning Method For Leaf Disease Iden
16 pages
Depp Learning For Medical Image Processing
No ratings yet
Depp Learning For Medical Image Processing
57 pages
Proposal 06
No ratings yet
Proposal 06
2 pages
Self-Supervised Visualisation of Medical Image Datasets
No ratings yet
Self-Supervised Visualisation of Medical Image Datasets
13 pages
Summary - Foundations On LLMs
No ratings yet
Summary - Foundations On LLMs
6 pages
A Beginner's Guide To Large Language Mo-Ebook-Part1
No ratings yet
A Beginner's Guide To Large Language Mo-Ebook-Part1
25 pages
Battle of The Backbones - A Large-Scale Comparison of Pretrained Models Across Computer Vision Tasks
No ratings yet
Battle of The Backbones - A Large-Scale Comparison of Pretrained Models Across Computer Vision Tasks
29 pages
Multilingual Text-To-Speech Training Using Cross Language Voice Conversion and Self-Supervised Learning of Speech Representations
No ratings yet
Multilingual Text-To-Speech Training Using Cross Language Voice Conversion and Self-Supervised Learning of Speech Representations
5 pages
A Closer Look at Few-Shot Classification Again
No ratings yet
A Closer Look at Few-Shot Classification Again
21 pages
Yerxa Et Al - Efficient Coding of Natural Images Using Maximum Manifold Capacity Representations
No ratings yet
Yerxa Et Al - Efficient Coding of Natural Images Using Maximum Manifold Capacity Representations
26 pages
Lecun 20240124 Uw Lyttle
No ratings yet
Lecun 20240124 Uw Lyttle
84 pages
Barlow Twins - Self-Supervised Learning Via Redundancy Reduction (2021)
No ratings yet
Barlow Twins - Self-Supervised Learning Via Redundancy Reduction (2021)
13 pages
Augmentation-Free Self-Supervised Learning
No ratings yet
Augmentation-Free Self-Supervised Learning
9 pages
A Comprehensive Survey On Pretrained Foundation Models: A History From BERT To ChatGPT
No ratings yet
A Comprehensive Survey On Pretrained Foundation Models: A History From BERT To ChatGPT
99 pages
ECG Semantic Integrator (ESI) : A Foundation ECG Model Pretrained With LLM-Enhanced Cardiological Text
No ratings yet
ECG Semantic Integrator (ESI) : A Foundation ECG Model Pretrained With LLM-Enhanced Cardiological Text
18 pages
A Survey On Self-Supervised Learning Algorithms Applications and Future Trends
No ratings yet
A Survey On Self-Supervised Learning Algorithms Applications and Future Trends
20 pages
Machine Learning New
No ratings yet
Machine Learning New
41 pages
Explainability in IDS, Read Just After Intros
No ratings yet
Explainability in IDS, Read Just After Intros
6 pages
Simcpsr: Simple Contrastive Learning For Paper Submission Recommendation System
No ratings yet
Simcpsr: Simple Contrastive Learning For Paper Submission Recommendation System
13 pages
ASSIGNMENT 1 Mavhine Learning
No ratings yet
ASSIGNMENT 1 Mavhine Learning
8 pages
To Compress or Not To Compress - Self-Supervised Learning and Information Theory: A Review
No ratings yet
To Compress or Not To Compress - Self-Supervised Learning and Information Theory: A Review
38 pages
Biomolecules
No ratings yet
Biomolecules
14 pages
EAT: Self-Supervised Pre-Training With Efficient Audio Transformer
No ratings yet
EAT: Self-Supervised Pre-Training With Efficient Audio Transformer
9 pages
CM20315 09 Regularization
No ratings yet
CM20315 09 Regularization
44 pages
2023 Arabicnlp-1 10
No ratings yet
2023 Arabicnlp-1 10
8 pages
Business Data Mining Week 5
No ratings yet
Business Data Mining Week 5
19 pages
ScBERT As A Large-Scale Pretrained Deep Language Model For Cell Type Annotation of Single-Cell RNA-seq Data
No ratings yet
ScBERT As A Large-Scale Pretrained Deep Language Model For Cell Type Annotation of Single-Cell RNA-seq Data
27 pages
CoMAE - Yang Et Al - 2023
No ratings yet
CoMAE - Yang Et Al - 2023
10 pages
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
No ratings yet
Unsupervised Learning of Visual Features by Contrasting Cluster Assignments
13 pages

Unit II

Uploaded by

Unit II

Uploaded by

BAPATLA ENGINEERING COLLEGE :: BAPATLA

Deep Learning (20ECJ44)

 Machine learning and deep learning are both subfields of

ML encompasses a broad range of techniques DL is a subset of ML that specifically deals with

ML models are generally less computationally DL models often demand significant

fraud detection, recommendation systems, Image recognition, speech recognition, NLP,

 Definition: The process of automatically discovering and extracting meaningful patterns or

 The number of layers presented in a neural network is called Depth.

𝑓 ( 𝑥 ) =𝑚𝑎𝑥 ( 0 .01 𝑥 , 𝑥 ) 𝑓 ( 𝑥 )=𝑥 if 𝑥 >0

 The number of layers in the bottleneck layer or

Fig: Block Diagram of an Autoencoder

Fig: Block Diagram of an Stacked Autoencoder

Fig: Basic Structure of an Autoencoder

Fig: Basic Structure of an Autoencoder

 In order to minimize the error we use back propogation algorithm.

In this case length (h)< length( 𝑋 𝑖 )

Under Complete Autoencoder

In this case length (h) ≥ length( 𝑋 𝑖 )

Fig: Over Complete Autoencoder

 Suppose all the inputs are binary each𝑥 ∈ { 0 ,1 }

You might also like