0% found this document useful (0 votes)

149 views9 pages

Neural Network - Test Questions

The document discusses various concepts related to artificial neurons and neural networks, including input patterns, activation functions, network architectures, training challenges, regularization techniques, hyperparameters, and evaluation methods. It also covers specific neural network types such as CNNs, RNNs, and GANs, as well as techniques for handling imbalanced datasets and improving training efficiency. Additionally, it explains the importance of activation functions, the bias-variance trade-off, and methods like batch normalization and softmax in neural network training.

Uploaded by

Dr. R. Gowri CIT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

149 views9 pages

Neural Network - Test Questions

Uploaded by

Dr. R. Gowri CIT

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

1.

Below is a diagram if a single artificial neuron (unit):

The node has three inputs x = (x1, x2, x3) that receive only binary signals (either 0 or 1). How many
different input patterns this node can receive? What if the node had four inputs? or Five inputs ?

2. Suppose that the weights corresponding to the three inputs have the following values:

The activation of the unit is given by the step-function:

Calculate what will be the output value y of the unit for each of
the following input patterns:

3. Find the output value y for each pattern for the following network which represents and function
4. Suggest how above network is used to implement the logical OR function (true when at least one of
the arguments is true)

5. The following diagram represents a feed-forward neural network with one hidden layer:

A weight on connection between nodes i and j is denoted by w I j , such as w13 is the weight on the
connection between nodes 1 and 3. The following table lists all the weights in the network

Where v denotes the weighted sum of a node. Each of the input nodes (1 and 2) can only receive binary
values (either 0 or 1). Calculate the output of the network (y5 and y6) for each of the input pattern

6. Common techniques for dealing with vanishing or exploding gradients in RNNs?

A. LSTM (Long Short-Term Memory) networks

B. GRU (Gated Recurrent Unit) networks
C. Gradient clipping
D. Weight normalization
E. All of the above
F. None of the above

7. What are activation functions, and why are they important?

 Answer: Activation functions introduce non-linearity into the neural
network. They transform the weighted sum of inputs into an output value.
Common activation functions include:
o Sigmoid: Outputs values between 0 and 1, often used in binary
classification.
o ReLU (Rectified Linear Unit): Outputs the input if positive, 0
otherwise, known for its computational efficiency.
o Tanh (Hyperbolic Tangent): Similar to sigmoid, outputs values
between -1 and 1.
o Softmax: Used for multi-class classification, outputs probabilities
for each class summing to 1.

1. 8 What are the different types of neural network architectures?

o Answer: There are various neural network architectures, each
suited for specific tasks:
 Feedforward Neural Networks: Information flows in one
direction, from input to output, without loops. Examples
include Multilayer Perceptrons (MLPs).
 Recurrent Neural Networks (RNNs): Process sequential
data by having feedback loops, enabling them to remember
past information. Examples include LSTMs and GRUs.
 Convolutional Neural Networks (CNNs): Designed for
image processing, they use convolutional layers to extract
features from spatial data.
 Generative Adversarial Networks (GANs): Composed of
two networks, a generator and a discriminator, competing
against each other to learn realistic data distributions.
 Autoencoders: Learn compressed representations of data
by encoding and decoding information, useful for
dimensionality reduction and anomaly detection.
2. What are the common challenges faced during neural network
training?
oAnswer:
 Overfitting: The network learns the training data too well
and fails to generalize to unseen data.
 Underfitting: The network is not complex enough to learn
the underlying patterns in the data.
 Vanishing or Exploding Gradients: During
backpropagation, gradients can become extremely small or
large, hindering effective weight updates.
 Local Minima: Gradient descent can get stuck in local
minima, not reaching the global minimum of the loss
function.
 Data Imbalance: If the training data is unevenly distributed
across classes, the network may bias towards the majority
class.
3. Explain the concept of regularization in neural networks.
o Answer: Regularization techniques are used to prevent overfitting
by adding constraints to the network's learning process. Common
regularization methods include:
 L1 Regularization (Lasso): Adds a penalty proportional to
the absolute value of the weights, encouraging sparsity
(setting some weights to zero).
 L2 Regularization (Ridge): Adds a penalty proportional to
the squared value of the weights, reducing the magnitude of
the weights and preventing them from becoming too large.
 Dropout: Randomly drops out neurons during training,
preventing co-adaptation and encouraging the network to
learn more robust features.
4. What are hyperparameters in neural networks, and how are they
tuned?
o Answer: Hyperparameters are parameters that are not learned by
the network during training but are set beforehand. Examples
include:
 Learning rate: Controls the step size of weight updates.
 Number of layers and neurons: Determines the network's
complexity.
 Batch size: The number of training examples used in each
update step.
 Regularization parameters: Control the strength of
regularization.

Hyperparameters are tuned using techniques like:

 Grid search: Trying different combinations of

hyperparameters on a predefined grid.
 Random search: Randomly sampling hyperparameters from
a predefined distribution.
 Bayesian optimization: Using a Bayesian model to guide
the search for optimal hyperparameters.
5. What are some popular frameworks for building and training
neural networks?
o Answer: Popular deep learning frameworks include:
 TensorFlow: Developed by Google, widely used for research
and production.
 PyTorch: Developed by Facebook, known for its flexibility
and ease of use.
 Keras: A high-level API that can run on top of TensorFlow or
Theano, simplifying neural network development.
 Caffe: Designed for image processing, known for its speed
and efficiency.
 MXNet: Developed by Apache, supports both CPU and GPU
computation.
6. What are the differences between batch gradient descent,
stochastic gradient descent, and mini-batch gradient descent?
Answer:
o
 Batch Gradient Descent: Updates weights after processing
the entire training dataset. It is slow but often converges to
the global minimum.
 Stochastic Gradient Descent (SGD): Updates weights
after processing a single training example. It is faster but can
be noisy and oscillate around the minimum.
 Mini-batch Gradient Descent: Updates weights after
processing a small batch of training examples (typically 32-
256). It offers a balance between speed and stability, being
the most commonly used approach.
7. Explain the difference between "epoch" and "batch" in neural
network training.
Answer:
o
 Epoch: One complete pass through the entire training
dataset. Each epoch consists of multiple batches.
 Batch: A small subset of training examples used to update
the weights during one iteration of gradient descent. The size
of the batch can influence the speed and stability of training.
8. What are some common techniques for visualizing the internal
representations learned by neural networks?
o Answer: Techniques for visualizing internal representations
include:
 Activation maps: Showing the activation values of neurons
in different layers, providing insights into which features the
network is focusing on.
 t-SNE (t-Distributed Stochastic Neighbor
Embedding): Reducing the dimensionality of the latent
space to visualize the relationships between different data
points.
 Saliency maps: Highlighting the regions of the input image
that contribute most to the network's prediction.

1. What is the purpose of "padding" in convolutional layers?

o Answer: Padding is a technique used in convolutional layers to add

extra values (usually zeros) to the borders of the input image. This
helps to preserve the spatial resolution of the feature maps by
preventing the shrinking of the output size after convolution. Padding
can also help to capture features near the edges of the image, which
might be missed otherwise due to the limited receptive field of the
filters.
2. Describe the different types of pooling layers commonly used in
CNNs.

o Answer: Common pooling layers in CNNs include:

 Max pooling: Takes the maximum value from a small region
(e.g., 2x2) in the feature map. This helps to reduce the number
of parameters and makes the network more robust to small
variations in the input.
 Average pooling: Calculates the average value from a small
region in the feature map. This can provide a smoother
representation of the features compared to max pooling.
3. What is the difference between "convolution" and "cross-
correlation" in CNNs?

o Answer: Convolution and cross-correlation are similar operations but

with a subtle difference:
 Convolution: The filter is flipped (rotated by 180 degrees)
before being applied to the input image. This allows for feature
extraction with a focus on spatial relationships.
 Cross-correlation: The filter is applied directly to the input
image without flipping. This is often used for image processing
tasks that don't require spatial feature extraction.

In practice, the term "convolution" is often used loosely to refer to

both operations, as the difference is often negligible.

4. Explain the concept of "stride" in convolutional layers.

o Answer: The stride is the step size by which the filter is moved
across the input image during convolution. A stride of 1 means the
filter moves one pixel at a time, while a larger stride (e.g., 2) means it
skips pixels. Using a stride larger than 1 reduces the size of the
output feature map, leading to a faster computation and potentially
coarser feature extraction.
5. What are some common techniques for evaluating the performance
of neural networks?

o Answer: Common evaluation techniques for neural networks include:

 Accuracy: The proportion of correctly classified instances in a
classification task.
 Precision: The proportion of correctly predicted positive
instances among all instances predicted as positive.
 Recall: The proportion of correctly predicted positive instances
among all actual positive instances.
 F1-score: A harmonic mean of precision and recall, providing a
balanced measure of performance.
 AUC (Area Under the Curve): A measure of the classifier's
ability to distinguish between positive and negative instances.
 Mean Squared Error (MSE): A measure of the average
squared difference between predictions and true values in
regression tasks.
 Loss function value: The value of the loss function, which is
minimized during training, indicating the overall error of the
model.
6. Describe the concept of a "confusion matrix" and its use in
evaluating classification models.

o Answer: A confusion matrix is a table that summarizes the

performance of a classification model by showing the number of
correct and incorrect predictions for each class. It helps to visualize
the model's accuracy, precision, recall, and other performance
metrics. The rows of the confusion matrix represent the actual class
labels, and the columns represent the predicted class labels. By
analyzing the different cells in the matrix, we can understand the
model's strengths and weaknesses in terms of classifying different
classes.
7. Explain the concept of "bias-variance trade-off" in machine
learning, particularly in the context of neural networks.

o Answer: The bias-variance trade-off is a fundamental concept in

machine learning, including neural networks, where there is a trade-
off between bias and variance in model predictions:
 Bias: A model's tendency to underfit the data, making
systematic errors. High bias models are often simple and fail to
capture complex patterns in the data.
 Variance: A model's sensitivity to variations in the training
data. High variance models are often complex and can overfit
the training data, leading to poor generalization on unseen
data.

The goal is to find a balance between bias and variance to achieve

optimal performance. Techniques like regularization, early stopping,
and increasing the size of the training dataset can help to manage
this trade-off.

8. What are some common techniques for dealing with imbalanced

datasets in machine learning, specifically in the context of neural
networks?

o Answer: Techniques for handling imbalanced datasets in neural

networks include:
 Oversampling: Increasing the number of instances of the
minority class by replicating or generating synthetic samples.
 Undersampling: Reducing the number of instances of the
majority class.
 Cost-sensitive learning: Assigning different costs to errors
made on different classes, giving more weight to
misclassifications of the minority class.
 Ensemble methods: Combining multiple models trained on
different subsets of the data or with different weighting
schemes.
 Data augmentation: Applying transformations to the minority
class to generate more diverse samples.
 Class-balanced loss functions: Using loss functions that are
specifically designed to handle imbalanced data.
9. What is the purpose of a "softmax" activation function in neural
networks, and how does it work?

o Answer: The softmax activation function is typically used in the

output layer of a neural network for multi-class classification tasks. It
takes a vector of raw scores as input and converts it into a probability
distribution over the different classes, where the sum of the
probabilities across all classes is equal to 1. The softmax function
applies an exponential transformation to each score and then
normalizes the results by dividing by the sum of all exponentiated
scores. This ensures that the output values represent probabilities
and are interpretable as the likelihood of belonging to each class.
10. Explain the concept of "batch normalization" and its benefits in
neural network training.

o Answer: Batch normalization is a technique used to normalize the

activations of neurons in a neural network during training. It involves
standardizing the outputs of each layer by subtracting the mean and
dividing by the standard deviation of the activations within a batch of
training examples. This helps to:
 Reduce internal covariate shift: Stabilize the distribution of
activations across layers, preventing the network from
becoming overly sensitive to small changes in the input data.
 Accelerate training: Enable the use of higher learning rates
without causing instability, leading to faster convergence.
 Improve generalization: Regularize the network, preventing
it from overfitting and improving performance on unseen data.

DL Viva
No ratings yet
DL Viva
7 pages
DL Classtest3
No ratings yet
DL Classtest3
4 pages
IA 3 Must Study Merged
No ratings yet
IA 3 Must Study Merged
69 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
Chapter21 4e
No ratings yet
Chapter21 4e
35 pages
Genai See
No ratings yet
Genai See
51 pages
Section - C: Unit 1
No ratings yet
Section - C: Unit 1
12 pages
DL CO1 and CO2 Answers
No ratings yet
DL CO1 and CO2 Answers
36 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
NNDL
No ratings yet
NNDL
7 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
SS 2021
No ratings yet
SS 2021
16 pages
Ch4 and Ch5 Notes
No ratings yet
Ch4 and Ch5 Notes
38 pages
Deep Learning Assignment 01
No ratings yet
Deep Learning Assignment 01
3 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
Project
No ratings yet
Project
6 pages
Viva Questions-Dllab
No ratings yet
Viva Questions-Dllab
4 pages
Grade 2 Math - End Term 2 - 2024
No ratings yet
Grade 2 Math - End Term 2 - 2024
7 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
Structures Congress 2017: Buildings and Special Structures
No ratings yet
Structures Congress 2017: Buildings and Special Structures
801 pages
Unit 1 Mid Term
No ratings yet
Unit 1 Mid Term
3 pages
QA 06 Ratio-2
No ratings yet
QA 06 Ratio-2
34 pages
Deep Learning - As1
No ratings yet
Deep Learning - As1
2 pages
Question Bank
No ratings yet
Question Bank
14 pages
Quiz 3
No ratings yet
Quiz 3
5 pages
ISE-1 Imp DLPDF
No ratings yet
ISE-1 Imp DLPDF
28 pages
DL Question Paper Solved
No ratings yet
DL Question Paper Solved
12 pages
Deep Learning
No ratings yet
Deep Learning
6 pages
AML (Advanced Machine Learning)
No ratings yet
AML (Advanced Machine Learning)
11 pages
Model Questions DWT COMPLETE SOLUTIONS
No ratings yet
Model Questions DWT COMPLETE SOLUTIONS
18 pages
Viva
No ratings yet
Viva
8 pages
2 Marks Gen AI
No ratings yet
2 Marks Gen AI
14 pages
SS 2021 Solutions
No ratings yet
SS 2021 Solutions
16 pages
DL Internal
No ratings yet
DL Internal
9 pages
Deep Learning Viva Questions Simple Answers
No ratings yet
Deep Learning Viva Questions Simple Answers
3 pages
1
No ratings yet
1
15 pages
Deepques
No ratings yet
Deepques
12 pages
Unit 5-1
No ratings yet
Unit 5-1
6 pages
AMLQuestion BANK
No ratings yet
AMLQuestion BANK
3 pages
DL Imp Viva
No ratings yet
DL Imp Viva
5 pages
Question Bank Advanced CO1, CO2
No ratings yet
Question Bank Advanced CO1, CO2
4 pages
Deep Learning Theory Questions
No ratings yet
Deep Learning Theory Questions
3 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Swadhyay Assignment Logarithm Allen
No ratings yet
Swadhyay Assignment Logarithm Allen
12 pages
WS 2021
No ratings yet
WS 2021
16 pages
Information Retrieval 8 Term Weighting A
No ratings yet
Information Retrieval 8 Term Weighting A
11 pages
DL Important
No ratings yet
DL Important
13 pages
DL Questions
No ratings yet
DL Questions
5 pages
Interview Questions in Neural Network
No ratings yet
Interview Questions in Neural Network
9 pages
Deped Mission and Vision
No ratings yet
Deped Mission and Vision
5 pages
Assignment 4
No ratings yet
Assignment 4
7 pages
Introduction To ANN
No ratings yet
Introduction To ANN
6 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
Updated AAM QB
No ratings yet
Updated AAM QB
6 pages
Deep Learning Viva
No ratings yet
Deep Learning Viva
5 pages
Unit 6 (C++) - Arrays
No ratings yet
Unit 6 (C++) - Arrays
91 pages
Opn Research by Prof Narang
No ratings yet
Opn Research by Prof Narang
43 pages
Computer Organization Hamacher Instructor Manual Solution - Chapter 3
No ratings yet
Computer Organization Hamacher Instructor Manual Solution - Chapter 3
46 pages
Plane Table Surneying 1 and Levelling
No ratings yet
Plane Table Surneying 1 and Levelling
30 pages
Interview Questions Answers
No ratings yet
Interview Questions Answers
7 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
DL Cie2
No ratings yet
DL Cie2
5 pages
Unit 3
No ratings yet
Unit 3
4 pages
The Koch Snowflake
75% (8)
The Koch Snowflake
16 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
Deep Learning - Question Bank
No ratings yet
Deep Learning - Question Bank
6 pages
What Are The Components of A Neural Network? Explain: Unit - I Part - A
No ratings yet
What Are The Components of A Neural Network? Explain: Unit - I Part - A
8 pages
DL MCQ
No ratings yet
DL MCQ
13 pages
Deep Learning
No ratings yet
Deep Learning
4 pages
Exam Version
No ratings yet
Exam Version
413 pages
Math 155 Lecture Notes Section 10,2
No ratings yet
Math 155 Lecture Notes Section 10,2
7 pages
QuestionBank C# and
No ratings yet
QuestionBank C# and
3 pages
Determinant and Matrices
No ratings yet
Determinant and Matrices
4 pages
A Natural Asymmetry in Electrical Systems With Far-Reaching Consequences
No ratings yet
A Natural Asymmetry in Electrical Systems With Far-Reaching Consequences
4 pages
CH2114
No ratings yet
CH2114
2 pages
Why Are Complex Numbers Needed in Quantum Mechanics? Some Answers For The Introductory Level
No ratings yet
Why Are Complex Numbers Needed in Quantum Mechanics? Some Answers For The Introductory Level
8 pages
ME Math 10 Q2 1002 PS
No ratings yet
ME Math 10 Q2 1002 PS
26 pages
E Mahesh PGT Mathematics
No ratings yet
E Mahesh PGT Mathematics
14 pages
Cad Unit-3 PDF
No ratings yet
Cad Unit-3 PDF
18 pages
(Download) SSC - CGL Tier-II Exam Paper-I (Arithmetical Ability) Held On - 16-09-2012 - SSCPORTAL PDF
No ratings yet
(Download) SSC - CGL Tier-II Exam Paper-I (Arithmetical Ability) Held On - 16-09-2012 - SSCPORTAL PDF
12 pages
Adiabatic Reactor 2
No ratings yet
Adiabatic Reactor 2
11 pages
Reward Management Practices and Its Impact On Employees Motivation An Evidence
No ratings yet
Reward Management Practices and Its Impact On Employees Motivation An Evidence
6 pages
Cumulative Test
No ratings yet
Cumulative Test
7 pages
Cost and Management Accounting I Group (5) Assignment
No ratings yet
Cost and Management Accounting I Group (5) Assignment
9 pages
Mahamaya Technical University,: Noida
No ratings yet
Mahamaya Technical University,: Noida
47 pages
Department of Education: Brigada Eskwela School Accomplishment Report (F7)
No ratings yet
Department of Education: Brigada Eskwela School Accomplishment Report (F7)
4 pages
Problem 1 017
No ratings yet
Problem 1 017
3 pages
Conservation of Energy
No ratings yet
Conservation of Energy
3 pages
Flowmeter Result
No ratings yet
Flowmeter Result
7 pages
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet