Deep Learning Assignment 01

The document discusses advanced neural network architectures, focusing on Convolutional Neural Networks (CNNs) for image processing and Recurrent Neural Networks (RNNs) for sequential data. It also covers activation functions such as ReLU and Tanh, highlighting their importance in introducing non-linearity, and explores loss functions like Mean Squared Error and Cross-Entropy Loss used for optimization in regression and classification tasks. Real-world applications for each concept are provided, illustrating their significance in various fields.

Uploaded by

syedmudaser92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views3 pages

Deep Learning Assignment 01

Uploaded by

syedmudaser92

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Deep Learning Assignment 01

Deep Learning Essentials

Question 1: Exploring Neural Network Architectures

Neural networks have advanced beyond simple fully connected architectures. Two
commonly used advanced models are Convolutional Neural Networks (CNNs) and
Recurrent Neural Networks (RNNs).
1. Convolutional Neural Networks (CNNs)
● Definition: CNNs are specialized neural networks used primarily for image
processing tasks.
● How They Work:
o CNNs use convolutional layers that apply filters (kernels) to an image to
detect patterns like edges, textures, and complex features.
o Pooling layers (e.g., max pooling) help reduce data dimensionality, making
computations more efficient.
o Unlike traditional neural networks, CNNs do not fully connect every neuron;
instead, they focus on local spatial features.
● Difference from Fully Connected Networks:
o CNNs take advantage of spatial structure in images, making them more
efficient by reducing parameters.
o Fully connected networks process each input independently, which does not
preserve spatial relationships in images.
● Real-World Applications:
o Image classification – Used in self-driving cars, medical image analysis,
and facial recognition.
o Object detection – Used in security surveillance and autonomous vehicles.
o Style transfer & image generation – Used in AI-generated artwork and
deepfake technology.
2. Recurrent Neural Networks (RNNs)
● Definition: RNNs are designed to handle sequential data by maintaining a
hidden state that carries past information.
● How They Work:
o Unlike traditional networks, RNNs have loops, allowing them to retain
memory of past inputs.
o This makes them useful for problems where context matters, such as
language processing.
● Difference from Fully Connected Networks:
o Fully connected networks treat each input as independent, while RNNs
maintain dependencies across sequences.
o RNNs are best suited for tasks that require time-series memory, such as
speech or text prediction.
● Real-World Applications:
o Speech recognition – Used in voice assistants like Siri and Google Assistant.
o Machine translation – Used by Google Translate to convert languages.
o Stock price prediction – Used in financial forecasting.

Question 2: Beyond Sigmoid - Activation Functions in

Neural Networks
Activation functions are essential in deep learning because they introduce non-linearity,
allowing neural networks to model complex relationships. Two widely used activation
functions beyond Sigmoid are ReLU and Tanh.
1. Rectified Linear Unit (ReLU)
● Definition: f(x)=max(0,x)f(x) = \max(0, x)f(x)=max(0,x)
● How It Works:
o If the input value is positive, it remains the same.
o If the input value is negative, it becomes zero.
● Advantages:
Prevents the vanishing gradient problem (which occurs in Sigmoid and Tanh).
Computationally efficient, making it faster than other activation functions.
● Common Usage:
o Used in almost all modern deep neural networks for tasks like image
recognition, object detection, and deep reinforcement learning.
● Limitation:
o Dying ReLU problem – Some neurons may output zero permanently if their
weights are not updated properly.
2. Hyperbolic Tangent (Tanh)
● Definition: f(x)=ex−e−xex+e−xf(x) = \frac{e^x - e^{-x}}{e^x + e^{-x}}
f(x)=ex+e−xex−e−x
● How It Works:
o Outputs values between -1 and 1, making it centered around zero.
o Helps in cases where both negative and positive inputs are important.
● Advantages:
Provides better convergence than Sigmoid.
Helps networks learn patterns with both positive and negative values.
● Common Usage:
o Frequently used in Recurrent Neural Networks (RNNs) due to better gradient
flow.
● Limitation:
o Still suffers from vanishing gradient, though it is better than Sigmoid.

Question 3: Exploring Loss Functions

Loss functions measure how well a neural network’s predictions match actual values.
Two commonly used loss functions are Mean Squared Error (MSE) and Cross-Entropy
Loss.
1. Mean Squared Error (MSE)
● Formula: MSE=1n∑i=1n(yi−yî)2MSE = \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}
_i)^2MSE=n1i=1∑n(yi−yî)2
● Usage: Used in regression problems, where predictions are continuous values.
● Why It’s Suitable:
Penalizes larger errors more, ensuring a smoother gradient for optimization.
● Real-World Applications:
o Used in predicting house prices, weather forecasting, and stock market
trends.
2. Cross-Entropy Loss (for Multi-Class Classification)
● Formula: −∑iyilog(yî)-\sum_{i} y_i \log(\hat{y}_i)−i∑yilog(yî)
● Usage: Used in classification problems where multiple categories exist.
● Why It’s Suitable:
Helps optimize softmax outputs, ensuring valid probability distributions.
● Real-World Applications:
o Used in image classification (e.g., identifying objects in an image).
o Used in spam detection, sentiment analysis, and language modeling.

MACHINE LEARNING R23 material
100% (11)
MACHINE LEARNING R23 material
32 pages
H13-311_V3.5 _unlocked
100% (1)
H13-311_V3.5 _unlocked
132 pages
Question Bank
No ratings yet
Question Bank
14 pages
TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Supervised Vs Unsupervised Learning
100% (1)
Supervised Vs Unsupervised Learning
7 pages
Deep-Learning-Assignment-01
No ratings yet
Deep-Learning-Assignment-01
5 pages
ISE-1 Imp DLpdf
No ratings yet
ISE-1 Imp DLpdf
28 pages
DL
No ratings yet
DL
12 pages
NNML_Full
No ratings yet
NNML_Full
19 pages
ML prep for samsung
No ratings yet
ML prep for samsung
73 pages
DLT
No ratings yet
DLT
31 pages
DL_Cie2
No ratings yet
DL_Cie2
5 pages
neural network -test questions
No ratings yet
neural network -test questions
9 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
UNIT II DNN
No ratings yet
UNIT II DNN
24 pages
ANN notes
No ratings yet
ANN notes
7 pages
DL MCQ
No ratings yet
DL MCQ
13 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
GENAI-SEE
No ratings yet
GENAI-SEE
51 pages
WS_2021
No ratings yet
WS_2021
16 pages
Assigment-19
No ratings yet
Assigment-19
3 pages
tutorial 1,2
No ratings yet
tutorial 1,2
12 pages
Introduction to ANN
No ratings yet
Introduction to ANN
6 pages
Question QUIZ MID 2
No ratings yet
Question QUIZ MID 2
6 pages
Ch4 and Ch5 Notes
No ratings yet
Ch4 and Ch5 Notes
38 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
ML_MU_Unit_5NeuralNetworkpdf__2025_04_16_13_47_39
No ratings yet
ML_MU_Unit_5NeuralNetworkpdf__2025_04_16_13_47_39
57 pages
Deep Learning
No ratings yet
Deep Learning
6 pages
DL Questions
No ratings yet
DL Questions
5 pages
sdl unit 2 3 4
No ratings yet
sdl unit 2 3 4
12 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
Pr1_ANN_Writeup.docx
No ratings yet
Pr1_ANN_Writeup.docx
7 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
Unit II
No ratings yet
Unit II
12 pages
Deep Learing
No ratings yet
Deep Learing
37 pages
Module 2
No ratings yet
Module 2
13 pages
29122024
No ratings yet
29122024
12 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
Review of Deep Learning Algorithms and Architectur
No ratings yet
Review of Deep Learning Algorithms and Architectur
29 pages
DL Mid
No ratings yet
DL Mid
7 pages
DL (2)
No ratings yet
DL (2)
18 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Updated_AAM_QB_(1)[1]
No ratings yet
Updated_AAM_QB_(1)[1]
6 pages
SS_2020
No ratings yet
SS_2020
21 pages
DL PRACTICAL FILE
No ratings yet
DL PRACTICAL FILE
58 pages
AML 03 Dense Neural Networks
No ratings yet
AML 03 Dense Neural Networks
20 pages
deep learning questions
No ratings yet
deep learning questions
17 pages
deped mission and vision
No ratings yet
deped mission and vision
5 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
Deep Learning Viva
No ratings yet
Deep Learning Viva
5 pages
Challenging Questions
No ratings yet
Challenging Questions
2 pages
DL_EXP-3_16010422230
No ratings yet
DL_EXP-3_16010422230
9 pages
BTCS604
No ratings yet
BTCS604
2 pages
Introduction to Convolutional Neural Networks (1)
No ratings yet
Introduction to Convolutional Neural Networks (1)
4 pages
Activation Functions and Their Characteristics in Deep Neural Networks
No ratings yet
Activation Functions and Their Characteristics in Deep Neural Networks
6 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
Data Mining: Practical Machine Learning Tools and Techniques
No ratings yet
Data Mining: Practical Machine Learning Tools and Techniques
123 pages
what are the activation functions, how do i deter...
No ratings yet
what are the activation functions, how do i deter...
3 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
Deep Learning (1)
No ratings yet
Deep Learning (1)
19 pages
practice MCQ
No ratings yet
practice MCQ
19 pages
MCQs _ Deep Learning Fundamentals_ Understanding Neural Networks, Activation Functions, and Bac
No ratings yet
MCQs _ Deep Learning Fundamentals_ Understanding Neural Networks, Activation Functions, and Bac
10 pages
Eye Recognition With Mixed Convolutional and Residual Network (Micore-Net)
No ratings yet
Eye Recognition With Mixed Convolutional and Residual Network (Micore-Net)
1 page
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Artificial Neural Network - Quick Guide - Tutorialspoint
No ratings yet
Artificial Neural Network - Quick Guide - Tutorialspoint
61 pages
Animal Detection Using Deep Learning Algorithm
No ratings yet
Animal Detection Using Deep Learning Algorithm
6 pages
Machine Learning Using Python Project (PPT)
0% (2)
Machine Learning Using Python Project (PPT)
8 pages
Deep Reinforcement Learning
100% (1)
Deep Reinforcement Learning
410 pages
On Artificial Intelligence by - Amit Kumar Mishra
No ratings yet
On Artificial Intelligence by - Amit Kumar Mishra
10 pages
OpenCV CVDL Curriculum
No ratings yet
OpenCV CVDL Curriculum
24 pages
Brain Sciences: A Deep Siamese Convolution Neural Network For Multi-Class Classification of Alzheimer Disease
No ratings yet
Brain Sciences: A Deep Siamese Convolution Neural Network For Multi-Class Classification of Alzheimer Disease
15 pages
NTCC Sem VI Major Project WPR
No ratings yet
NTCC Sem VI Major Project WPR
12 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
6 pages
XLSTMTime Long-term Time Series Forecasting With XLSTM
No ratings yet
XLSTMTime Long-term Time Series Forecasting With XLSTM
13 pages
A) B) C) D) : Objective Type Questions Image Segmentation
No ratings yet
A) B) C) D) : Objective Type Questions Image Segmentation
5 pages
2407.02694v1
No ratings yet
2407.02694v1
74 pages
Deep Temporal Convolutional Networks For Short-Ter
No ratings yet
Deep Temporal Convolutional Networks For Short-Ter
12 pages
The Future of Robots - Rodney Brooks
No ratings yet
The Future of Robots - Rodney Brooks
2 pages
Support Vector Machines On The D-Wave Quantum Annealer: D.Willsch, M.Willsch, H.De Raedt, K.Michielsen
No ratings yet
Support Vector Machines On The D-Wave Quantum Annealer: D.Willsch, M.Willsch, H.De Raedt, K.Michielsen
17 pages
Formalizing Supervised Learning Model Selection
No ratings yet
Formalizing Supervised Learning Model Selection
1 page
Education in The Era of Generative Artificial Intelligence (Ai) : Understanding The Potential Benefits of Chatgpt in Promoting Teaching and Learning
No ratings yet
Education in The Era of Generative Artificial Intelligence (Ai) : Understanding The Potential Benefits of Chatgpt in Promoting Teaching and Learning
12 pages
Soft Computing Question Bank
No ratings yet
Soft Computing Question Bank
4 pages
Unsupervised Machine Learning
No ratings yet
Unsupervised Machine Learning
20 pages
Ai Startup Impact
No ratings yet
Ai Startup Impact
2 pages
Artifical Neural Network
No ratings yet
Artifical Neural Network
7 pages
What is convex optimization in simple terms _
No ratings yet
What is convex optimization in simple terms _
4 pages
Handout BITS-C464 Machine Learning - 2013
No ratings yet
Handout BITS-C464 Machine Learning - 2013
3 pages
11 19UCSPEX01 B 3 10FAI Unit-1
No ratings yet
11 19UCSPEX01 B 3 10FAI Unit-1
123 pages
Session2 2024_2025_ Natural Language Processing
No ratings yet
Session2 2024_2025_ Natural Language Processing
30 pages
Ai Project Cycle
No ratings yet
Ai Project Cycle
7 pages

Deep Learning Assignment 01

Uploaded by

Deep Learning Assignment 01

Uploaded by

Deep Learning Assignment 01

Deep Learning Essentials

Question 1: Exploring Neural Network Architectures

Question 2: Beyond Sigmoid - Activation Functions in

Question 3: Exploring Loss Functions

You might also like