0% found this document useful (0 votes)

4 views4 pages

Deep Learnig U2

The document provides an overview of various neural network architectures and optimization algorithms, including Deep Feedforward Neural Networks, Gradient Descent, and different types of auto-encoders. It discusses their definitions, architectures, formulas, advantages, and applications in machine learning. Additionally, it covers techniques for regularization and dataset augmentation to enhance model performance.

Uploaded by

shivamchoubeyrishu4747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views4 pages

Deep Learnig U2

Uploaded by

shivamchoubeyrishu4747

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

1.

Deep Feedforward Neural Networks

• Definition: Deep Feedforward Neural Networks are a type of artificial neural network
where connections between the nodes do not form a cycle. This is the simplest form of
neural networks.

• Architecture: Consists of an input layer, several hidden layers, and an output layer.

• Activation Functions: Commonly used activation functions include Sigmoid, Tanh, and
ReLU.

• Forward Propagation: Involves calculating the output of each neuron from the input
layer to the output layer.

• Use Cases: Image and speech recognition, language translation, and other applications
requiring pattern recognition.

2. Gradient Descent (GD)

• Definition: An optimization algorithm used to minimize the cost function by iteratively

adjusting the model parameters in the opposite direction of the gradient.

• Types of Gradient Descent:

o Batch Gradient Descent: Uses the entire dataset for each update.

o Stochastic Gradient Descent: Uses one training example at each update.

o Mini-batch Gradient Descent: Uses a small random subset of the dataset at each
update.

3. Momentum-Based GD

• Definition: Enhances gradient descent by adding a momentum term to accelerate

convergence and prevent oscillations.

• Formula: v(t)=γv(t−1)+η∇J(θ)v(t) = \gamma v(t-1) + \eta \nabla J(\theta)

o v(t)v(t): velocity (momentum term)

o γ\gamma: momentum hyperparameter

o η\eta: learning rate

o ∇J(θ)\nabla J(\theta): gradient of the cost function

4. Nesterov Accelerated GD
• Definition: An improved version of momentum-based GD that looks ahead to the
estimated future position.

• Formula: v(t)=γv(t−1)+η∇J(θ−γv(t−1))v(t) = \gamma v(t-1) + \eta \nabla J(\theta -

\gamma v(t-1))

5. Stochastic Gradient Descent (SGD)

• Definition: An iterative method for optimizing an objective function using one training
example at a time.

• Advantages: Faster convergence for large datasets, reduced computational cost.

• Disadvantages: Can lead to noisy updates and require careful tuning of the learning rate.

6. AdaGrad

• Definition: An adaptive gradient algorithm that adjusts the learning rate for each
parameter based on historical gradient information.

• Formula: θt+1=θt−ηGt+ϵ∇J(θt)\theta_{t+1} = \theta_t - \frac{\eta}{\sqrt{G_{t} +

\epsilon}} \nabla J(\theta_t)

o GtG_t: sum of the squares of the past gradients

o ϵ\epsilon: small constant to avoid division by zero

7. Adam

• Definition: Combines the advantages of AdaGrad and RMSProp, using adaptive learning
rates and momentum.

• Parameters: β1\beta_1 (decay rate for the first moment), β2\beta_2 (decay rate for the
second moment), ϵ\epsilon (small constant).

• Formula: mt=β1mt−1+(1−β1)∇J(θt)m_t = \beta_1 m_{t-1} + (1 - \beta_1) \nabla

J(\theta_t) and vt=β2vt−1+(1−β2)(∇J(θt))2v_t = \beta_2 v_{t-1} + (1 - \beta_2) (\nabla
J(\theta_t))^2

8. RMSProp

• Definition: An optimization algorithm that adjusts the learning rate by dividing the
gradient by a running average of its recent magnitude.

• Formula: θt+1=θt−ηE[g2]t+ϵ∇J(θt)\theta_{t+1} = \theta_t - \frac{\eta}{\sqrt{E[g^2]_{t} +

\epsilon}} \nabla J(\theta_t)
9. Auto-encoder

• Definition: An unsupervised learning model used to encode input data into a

compressed representation and then decode it back to reconstruct the input.

• Architecture: Consists of an encoder (compresses the input) and a decoder (reconstructs

the input).

• Applications: Dimensionality reduction, feature learning, and anomaly detection.

10. Regularization in Auto-encoders

• Purpose: Prevent overfitting and improve generalization by adding constraints to the

model.

• Techniques:

o L1 Regularization: Adds the absolute value of the weights to the loss function.

o L2 Regularization: Adds the square of the weights to the loss function.

11. Denoising Auto-encoders

• Definition: Train on a noisy version of the input data and aim to reconstruct the clean
input.

• Objective: Improve the model's robustness and ability to capture relevant structures in
the data.

12. Sparse Auto-encoders

• Definition: Apply sparsity constraints on the hidden layer activations to encourage

learning a compact and efficient representation.

• Technique: Use an additional sparsity penalty term in the loss function.

13. Contractive Auto-encoders

• Definition: Penalize the gradient of the encoder's activations with respect to the input to
make the learned representation robust to small variations.

• Formula: Add a term to the loss function proportional to the Frobenius norm of the
Jacobian of the hidden representations.

14. Variational Auto-encoder

• Definition: A generative model that learns a probabilistic distribution over the latent
space, allowing for the generation of new data samples.
• Objective: Maximize the Evidence Lower Bound (ELBO) to ensure the latent variables
follow a desired distribution.

15. Auto-encoders relationship with PCA and SVD

• PCA: Auto-encoders can be seen as a non-linear extension of PCA, which performs linear
dimensionality reduction.

• SVD: Singular Value Decomposition can be used to analyze the linear transformations
performed by auto-encoders.

16. Dataset Augmentation

• Definition: Techniques to artificially increase the size and diversity of a dataset by

applying transformations like rotation, scaling, flipping, and adding noise.

• Purpose: Improve the model's generalization by providing more varied training

examples.

ML Interview Questions PDF
100% (5)
ML Interview Questions PDF
20 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
Research Proposal For MS (CS) Thesis
100% (1)
Research Proposal For MS (CS) Thesis
9 pages
Deep Generative Models
No ratings yet
Deep Generative Models
55 pages
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
No ratings yet
Jntuk r20 Unit-V Deep Learning Techniques (WWW - Jntumaterials.co - In)
61 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
Module 2
No ratings yet
Module 2
67 pages
FDS Book
No ratings yet
FDS Book
123 pages
I2DL Student Lecture Notes
No ratings yet
I2DL Student Lecture Notes
97 pages
Module 1
No ratings yet
Module 1
64 pages
Unit 5 (QB) - ML
No ratings yet
Unit 5 (QB) - ML
38 pages
02 Neural Networks
No ratings yet
02 Neural Networks
32 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Table of Content: (Page Numbers in PDF File)
No ratings yet
Table of Content: (Page Numbers in PDF File)
223 pages
Module4 AI
No ratings yet
Module4 AI
12 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
2023246032-Backward Propagation and Other Differential Algorithms
No ratings yet
2023246032-Backward Propagation and Other Differential Algorithms
48 pages
Chapter 1
No ratings yet
Chapter 1
28 pages
AI-900 Dumps Microsoft Azure AI Fundamentals (Beta)
No ratings yet
AI-900 Dumps Microsoft Azure AI Fundamentals (Beta)
6 pages
Unit - IV
No ratings yet
Unit - IV
24 pages
Mcculloh: Linear Activation Function
No ratings yet
Mcculloh: Linear Activation Function
12 pages
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
No ratings yet
UCS - 401 - Unit-LV - Trends in Machine Learning - Model and Symbols - Bagging and Boosting, Multitask
44 pages
DL Ut - 1
No ratings yet
DL Ut - 1
14 pages
21CS743
No ratings yet
21CS743
27 pages
Unit 5e - Autoencoders
No ratings yet
Unit 5e - Autoencoders
32 pages
Deep Learning Cheats
No ratings yet
Deep Learning Cheats
13 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
Dl-Unit 3
No ratings yet
Dl-Unit 3
14 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Cours 5
No ratings yet
Cours 5
23 pages
Inference and Learning
No ratings yet
Inference and Learning
33 pages
DL Test-2
No ratings yet
DL Test-2
28 pages
Lecture 2
No ratings yet
Lecture 2
31 pages
Deep Learning Updated
No ratings yet
Deep Learning Updated
11 pages
Cst414-Deep Learning Module 2
No ratings yet
Cst414-Deep Learning Module 2
13 pages
Introduction and Basics of Machine Learning
No ratings yet
Introduction and Basics of Machine Learning
9 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
Terms To Review
No ratings yet
Terms To Review
9 pages
Important Optimization Algorithms Essentials
No ratings yet
Important Optimization Algorithms Essentials
12 pages
Comprehensive Deep Learning Concepts
No ratings yet
Comprehensive Deep Learning Concepts
5 pages
Data Analytics - Paperback Sample
No ratings yet
Data Analytics - Paperback Sample
32 pages
AI As Subset
No ratings yet
AI As Subset
16 pages
Pure Optimization
No ratings yet
Pure Optimization
23 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Secrets of Deep Learning 1716536527
No ratings yet
Secrets of Deep Learning 1716536527
12 pages
Assignment Jaiprakash
No ratings yet
Assignment Jaiprakash
5 pages
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
No ratings yet
Advanced Topics in Machine Learning: Supervised Learning, Deep Learning, and Optimization Techniques
5 pages
DL 4
No ratings yet
DL 4
15 pages
New - Neural Network & Deep Learning
No ratings yet
New - Neural Network & Deep Learning
8 pages
Deep Learning
No ratings yet
Deep Learning
11 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
DL Objectives
No ratings yet
DL Objectives
4 pages
Lecture 14 Autoencoders
No ratings yet
Lecture 14 Autoencoders
39 pages
GD Compare
No ratings yet
GD Compare
5 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
Assignment 2 QSN 1
No ratings yet
Assignment 2 QSN 1
4 pages
Autoencoder - Unit 4
No ratings yet
Autoencoder - Unit 4
39 pages
Basic Machine Learning Terms 2
No ratings yet
Basic Machine Learning Terms 2
4 pages
Categorization of ML-DL Algorithms
No ratings yet
Categorization of ML-DL Algorithms
1 page
Lecture 1-Unit 3.3
No ratings yet
Lecture 1-Unit 3.3
3 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Artificial Intelligence in Oncology: The Predictive Power of Deep Learning'
No ratings yet
Artificial Intelligence in Oncology: The Predictive Power of Deep Learning'
40 pages
Challenges With Developing and Deploying AI Models and Applications in Industrial Systems
No ratings yet
Challenges With Developing and Deploying AI Models and Applications in Industrial Systems
19 pages
AIP-210 CertNexus Certified Artificial Intelligence Practitioner Practice Questions
No ratings yet
AIP-210 CertNexus Certified Artificial Intelligence Practitioner Practice Questions
8 pages
Stroke Detection With Deep Learning: SRH Hochschule Heidelberg
No ratings yet
Stroke Detection With Deep Learning: SRH Hochschule Heidelberg
78 pages
HAND GESTURE MAGIC Capstone Project Repo
No ratings yet
HAND GESTURE MAGIC Capstone Project Repo
35 pages
Asm - Artificial Intelligence - 129649
No ratings yet
Asm - Artificial Intelligence - 129649
13 pages
Activation Function
No ratings yet
Activation Function
44 pages
LEaggue
No ratings yet
LEaggue
41 pages
DADS302 Unit-05
No ratings yet
DADS302 Unit-05
36 pages
2023 - Van Rooij - Reclaiming AI As A Theoretical Tool For Cognitive Science
No ratings yet
2023 - Van Rooij - Reclaiming AI As A Theoretical Tool For Cognitive Science
22 pages
Unit 3-Fuzzy Clustering
No ratings yet
Unit 3-Fuzzy Clustering
34 pages
Improving Large Language Model
No ratings yet
Improving Large Language Model
14 pages
Montreal Decleration
No ratings yet
Montreal Decleration
21 pages
2599-Article Text-10848-1-10-20230801
No ratings yet
2599-Article Text-10848-1-10-20230801
11 pages
Unit - 5
No ratings yet
Unit - 5
14 pages
986-Article Text-3920-1-10-20240312
No ratings yet
986-Article Text-3920-1-10-20240312
9 pages
AI Course Outline & Guidelines
No ratings yet
AI Course Outline & Guidelines
4 pages
Ai Assignment 1
No ratings yet
Ai Assignment 1
9 pages
Multi-Output Classification With Machine Learning
No ratings yet
Multi-Output Classification With Machine Learning
10 pages
DMDW Day-Wise Lesson Plan
No ratings yet
DMDW Day-Wise Lesson Plan
4 pages
Breed Identification of Meat Using Machine Learning and Breed Tag SNPs
No ratings yet
Breed Identification of Meat Using Machine Learning and Breed Tag SNPs
7 pages
Big Data Computing - Assignment 6
No ratings yet
Big Data Computing - Assignment 6
3 pages
論文 HuBERT
No ratings yet
論文 HuBERT
4 pages
Harnessing GenAI and LLMs For An Automated Evaluation Tool To Aid Teachers
No ratings yet
Harnessing GenAI and LLMs For An Automated Evaluation Tool To Aid Teachers
3 pages
Sagnik Anupam Resume
No ratings yet
Sagnik Anupam Resume
1 page
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet