0% found this document useful (0 votes)

19 views6 pages

2mrk Answers

The document provides an overview of deep learning, its distinctions from AI and machine learning, and its applications in various fields such as image recognition and natural language processing. It discusses fundamental concepts such as scalars, vectors, matrices, tensors, and the importance of probability in deep learning, along with techniques to prevent overfitting and optimize models. Additionally, it covers the roles of hyperparameters, point estimators, and the structure of feedforward neural networks.

Uploaded by

OLGA RAJEE C

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views6 pages

2mrk Answers

Uploaded by

OLGA RAJEE C

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

UNIT I DEEP NETWORKS BASICS

PART-A
1. What is Deep Learning?

Deep learning is a subset of machine learning that uses neural networks with many layers (hence the term "deep") to
model and understand complex patterns in data. Deep learning algorithms automatically learn feature
representations from raw data such as images, text, and speech, and are particularly useful in tasks like image
recognition, natural language processing, and autonomous driving.

2. What are the main differences between AI, Machine Learning, and Deep Learning?

 Artificial Intelligence (AI): The broader field that involves creating machines or systems that can perform
tasks requiring human intelligence, such as decision-making, problem-solving, and understanding natural
language.

 Machine Learning (ML): A subset of AI focused on building algorithms that allow computers to learn from
and make predictions or decisions based on data, without being explicitly programmed.

 Deep Learning (DL): A further subset of ML that uses artificial neural networks with many layers (deep neural
networks) to model complex patterns in large datasets. It is particularly useful for tasks like image
recognition, speech recognition, and natural language processing.

4. What are the applications of deep learning?

Deep learning is used in various applications, including:

 Image Recognition: Facial recognition, object detection, medical image analysis.

 Speech Recognition: Virtual assistants like Siri, Google Assistant, and speech-to-text systems.

 Natural Language Processing (NLP): Machine translation, sentiment analysis, chatbots.

 Autonomous Vehicles: Self-driving cars use deep learning for object detection and decision-making.

 Recommender Systems: Used in platforms like Netflix and YouTube.

 Robotics: Deep learning helps robots understand and interact with their environment.

 Gaming: AI models used in game development to create more realistic non-playable characters (NPCs) and
adversarial agents.

5. What is scalar and vector?

 Scalar: A single number or value (e.g., temperature, speed).

 Vector: A one-dimensional array or list of numbers, which represents a point in space or a set of values (e.g.,
position in 2D space as [x,y][x, y][x,y]).

6. What are matrices and tensors?

 Matrix: A 2D array of numbers arranged in rows and columns, used to represent data or transformations
(e.g., image data in grayscale).

 Tensor: A generalization of matrices to higher dimensions. A 3D tensor could represent a colored image with
width, height, and RGB color channels. In deep learning, tensors are used to represent multidimensional
data.

7. Illustrate why probability is important in deep learning?

Probability is important in deep learning for several reasons:

 Uncertainty Handling: Models need to handle uncertainty in predictions (e.g., for classification tasks, the
output might be a probability distribution over classes).

 Bayesian Methods: Many deep learning models (such as Bayesian Neural Networks) use probability
distributions to make inferences and predictions.

 Learning from Data: Deep learning models typically learn to predict the probabilities of outcomes, rather
than making deterministic predictions. This helps in tasks like classification, where the goal is to predict the
likelihood of an event.

8. Define Random Variable.

A random variable is a variable whose value is subject to random variation or uncertainty. It can take different values,
each with an associated probability. Random variables are classified as either discrete or continuous.

9. Do random variables is discrete or continuous?

 Discrete Random Variables take specific values (e.g., the number of heads in 10 coin tosses).

 Continuous Random Variables can take any value within a range (e.g., the height of a person or the
temperature).

10. What are probability distributions?

11. Define Probability Mass Function?

12. List the properties that probability mass function satisfies?

13. List the properties that probability density function satisfies?

14. What is Gradient-based optimizer?

A gradient-based optimizer is an algorithm used to minimize the loss function in machine learning and deep learning
by iteratively adjusting the model's parameters in the direction of the negative gradient of the loss function. Common
gradient-based optimizers include Gradient Descent, Stochastic Gradient Descent (SGD), and their variants like
Adam.

15. Why overfitting and underfitting in ML?

 Overfitting: Occurs when the model is too complex and learns not only the underlying patterns in the data
but also the noise, leading to poor generalization to new data.

 Underfitting: Happens when the model is too simple and cannot capture the underlying patterns in the data,
leading to poor performance even on the training data.

16. What is capacity of a model?

The capacity of a model refers to its ability to learn complex patterns and functions. A model with high capacity can
learn more intricate patterns but may also be more prone to overfitting. A model with low capacity may underfit and
fail to capture important patterns in the data.

17. How to control the capacity of learning algorithm?

You can control the capacity of a model by:

 Regularization techniques (e.g., L1 or L2 regularization).

 Choosing simpler models (e.g., linear models instead of deep neural networks).

 Limiting the number of parameters in the model (e.g., fewer layers or nodes in neural networks).

 Data augmentation (for tasks like image classification).

18. Define Bayes error.

Bayes error is the lowest possible error that can be achieved by any classifier on a given problem, assuming the true
underlying probability distribution is known. It represents the irreducible error in classification tasks due to the
inherent randomness in the data.

19. Why hyperparameters in ML?

Hyperparameters in machine learning are settings that control the training process and model architecture, such as
learning rate, batch size, and number of epochs. Unlike model parameters, hyperparameters are not learned from
data but are set before training.

They are important because they directly affect model performance, learning efficiency, and generalization. Proper
tuning of hyperparameters can improve accuracy, prevent overfitting or underfitting, and help the model converge
more quickly. Hyperparameter tuning is typically done through methods like grid search or random search.

20. How to solve overfitting problem caused by learning hyperparameters on training dataset?

Overfitting caused by hyperparameter tuning can be mitigated by:

 Using cross-validation to evaluate hyperparameter choices on multiple subsets of the data.

 Implementing regularization techniques (e.g., L2 regularization, dropout).

 Applying early stopping to halt training when performance on the validation set starts to degrade.

21. What are point estimators?

Point estimators are statistics that estimate the value of a parameter (e.g., mean, variance) based on sample data.
They provide a single value (point) as an estimate of the true parameter.

22. List the characteristics or Properties of Point Estimators?

 Unbiasedness: The expected value of the estimator is equal to the true parameter.

 Efficiency: The estimator has the smallest variance among all unbiased estimators.

 Consistency: As the sample size increases, the estimator converges to the true value of the parameter.

23. What is a deep feedforward network?

A deep feedforward network is a type of artificial neural network where information moves only in one direction—
from input to output—through multiple layers of neurons. It is used in supervised learning tasks such as classification
and regression.

24. What is the working principle of a feedforward neural network?

In a feedforward neural network, the input is passed through a series of hidden layers where each neuron processes
the input through an activation function and passes it to the next layer. The output layer produces the final prediction
or classification.

25. What are the Layers of feedforward neural network?

The layers in a feedforward neural network include:

 Input layer: The layer that takes the input features.

 Hidden layers: Layers that perform computations and learn features from the input data.

 Output layer: The layer that produces the final output, which could be a classification or a regression value.

27. What is Regularization?

Regularization is a technique used to prevent overfitting by adding a penalty term to the loss function, which
discourages overly complex models. Common regularization methods include L2 regularization (Ridge) and L1
regularization (Lasso).

28. What is dropout in neural network?

Dropout is a regularization technique where, during training, random units (neurons) are "dropped" or ignored in
each iteration. This helps prevent overfitting by ensuring that the network does not rely too heavily on any particular
neuron.

29. Difference between Regularization and Optimization

 Regularization:
Regularization refers to techniques used to prevent a model from overfitting to the training data by
introducing additional constraints or penalties. The goal is to create a simpler model that generalizes well to
unseen data. Common regularization methods include L1 (Lasso) and L2 (Ridge) regularization, where the
model's complexity is penalized based on the size of the coefficients. Regularization discourages the model
from assigning too much importance to any particular feature, which helps to prevent overfitting.

 Optimization:
Optimization refers to the process of finding the best parameters (weights) for a machine learning model to
minimize or maximize an objective function (such as a loss or cost function). Optimization methods, such as
Gradient Descent, adjust the model's parameters iteratively to minimize the loss function. In contrast to
regularization, which specifically controls the model's complexity, optimization focuses on finding the best fit
for the data.

30. How Splitting a Dataset into Train, Dev, and Test Sets Helps Identify Overfitting

When you split a dataset into three parts—training, development (validation), and test sets—it allows you to
identify if your model is overfitting, as follows:

 Training Set: Used to train the model and adjust its parameters.
 Development (Validation) Set: Used to tune hyperparameters (such as learning rate, model complexity, etc.)
and evaluate the model's performance during training.

 Test Set: Used only after the model is trained and hyperparameters are finalized, providing an unbiased
estimate of the model's generalization performance.

Overfitting occurs when the model performs very well on the training data but poorly on unseen data (validation or
test set). By using a validation set, you can monitor if the model's performance is significantly better on the training
set than on the validation set. If the model has high training accuracy but low validation accuracy, it's likely
overfitting. If this discrepancy is large, techniques like regularization, pruning, or cross-validation might be used to
address the overfitting.

31. Stochastic Gradient Descent (SGD) with Merits and Demerits

Definition:
SGD is an optimization algorithm where model parameters are updated based on the gradient of the loss function for
a single data point (or mini-batch), rather than the whole dataset.

Merits:

1. Faster updates: Updates are quicker since they use one data point.

2. Memory efficient: Requires less memory, making it scalable for large datasets.

3. Helps avoid local minima: The noisy updates can help escape local minima.

Demerits:

1. Noisy updates: The model may oscillate around the optimal solution.

2. Requires careful tuning: The learning rate must be set carefully.

3. Suboptimal convergence: It can converge to suboptimal solutions due to noisy updates.

Deep Learning Notes
No ratings yet
Deep Learning Notes
200 pages
The Genesis or
No ratings yet
The Genesis or
151 pages
Tensorflow
No ratings yet
Tensorflow
25 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
Fundamentals of Machine Learning: a Simplified Approach
From Everand
Fundamentals of Machine Learning: a Simplified Approach
Er. Sudhir Goswami
No ratings yet
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
Ebook Deep Learning Objective Type Questions
No ratings yet
Ebook Deep Learning Objective Type Questions
102 pages
Laphormur F7 - Rieter Manual
No ratings yet
Laphormur F7 - Rieter Manual
391 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
3rd Unit DL Final Class Notes
No ratings yet
3rd Unit DL Final Class Notes
78 pages
HeadRush Amp & Effect List
No ratings yet
HeadRush Amp & Effect List
10 pages
DL Viva
No ratings yet
DL Viva
7 pages
ML Interview Notes
No ratings yet
ML Interview Notes
3 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
III-II CSM (Ar 20) DL - Units - 1 & 2 - Question Answers As On 4-3-23
No ratings yet
III-II CSM (Ar 20) DL - Units - 1 & 2 - Question Answers As On 4-3-23
56 pages
Ch4 and Ch5 Notes
No ratings yet
Ch4 and Ch5 Notes
38 pages
120 Deep Learning Important Questions + Answers ?
No ratings yet
120 Deep Learning Important Questions + Answers ?
68 pages
Deep Learning
No ratings yet
Deep Learning
18 pages
Understanding How PeopleCode Events Work
No ratings yet
Understanding How PeopleCode Events Work
14 pages
Kamala Das Poems
No ratings yet
Kamala Das Poems
14 pages
AI Basics
From Everand
AI Basics
Anand Vemula
No ratings yet
Deep Learning Interview Questions and Answers
No ratings yet
Deep Learning Interview Questions and Answers
21 pages
Deep Learning Questions
50% (2)
Deep Learning Questions
51 pages
Unit 2
No ratings yet
Unit 2
16 pages
WheelHorse Raider 10 and Raider 12 Owners Manual For Models 1-6051 1-6251-1-6252-1-6253
100% (3)
WheelHorse Raider 10 and Raider 12 Owners Manual For Models 1-6051 1-6251-1-6252-1-6253
12 pages
Deep Learning Techniques: 1. Define Neural Networks
No ratings yet
Deep Learning Techniques: 1. Define Neural Networks
31 pages
cq02 Vdthanh Ass3
No ratings yet
cq02 Vdthanh Ass3
20 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Module1 - Deep Learning
No ratings yet
Module1 - Deep Learning
26 pages
AI Course Interview V1docx
No ratings yet
AI Course Interview V1docx
20 pages
DL QB With Ans
No ratings yet
DL QB With Ans
38 pages
Assignment - Intro To Deep Learning
No ratings yet
Assignment - Intro To Deep Learning
4 pages
Cost Estimate For Construction of Cross Drainage Works Road:-Devari To Kalkoti Road Chainage: - Slab Culvert of Size 8.00 X 5.00 No of Span 5 Slab Thickness 600
No ratings yet
Cost Estimate For Construction of Cross Drainage Works Road:-Devari To Kalkoti Road Chainage: - Slab Culvert of Size 8.00 X 5.00 No of Span 5 Slab Thickness 600
12 pages
21CS743
No ratings yet
21CS743
27 pages
Ann CNN RNN
No ratings yet
Ann CNN RNN
26 pages
2 Marks Gen AI
No ratings yet
2 Marks Gen AI
14 pages
Deep Learning Is A Well
No ratings yet
Deep Learning Is A Well
16 pages
Deep Learning Final
No ratings yet
Deep Learning Final
17 pages
DGM Mid Sem
No ratings yet
DGM Mid Sem
39 pages
Rauc Iom 14 - 06012007
No ratings yet
Rauc Iom 14 - 06012007
76 pages
DLQ Eyelashes
No ratings yet
DLQ Eyelashes
36 pages
Deep Learning
No ratings yet
Deep Learning
10 pages
Interview Material
No ratings yet
Interview Material
14 pages
Deep Learning 15
No ratings yet
Deep Learning 15
13 pages
DL Internal
No ratings yet
DL Internal
9 pages
QB Unit 1
No ratings yet
QB Unit 1
6 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
VetcoGray S-Series SVXT
No ratings yet
VetcoGray S-Series SVXT
2 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Deep Learning
No ratings yet
Deep Learning
15 pages
DL Assessment - I
No ratings yet
DL Assessment - I
8 pages
Beu ML 20 Vvi Questions
No ratings yet
Beu ML 20 Vvi Questions
4 pages
Question Bank
No ratings yet
Question Bank
2 pages
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
No ratings yet
Deep Learning Assignment 1 Solution: Name: Vivek Rana Roll No.: 1709113908
5 pages
DL Questions
No ratings yet
DL Questions
5 pages
Deep Learning Viva Questions (1-3)
No ratings yet
Deep Learning Viva Questions (1-3)
4 pages
1 5
No ratings yet
1 5
5 pages
Interview Questions Answers
No ratings yet
Interview Questions Answers
7 pages
What Are The Components of A Neural Network? Explain: Unit - I Part - A
No ratings yet
What Are The Components of A Neural Network? Explain: Unit - I Part - A
8 pages
Day 1 Special Bonus
No ratings yet
Day 1 Special Bonus
23 pages
NoteGPT Summary DL Mod2
No ratings yet
NoteGPT Summary DL Mod2
8 pages
DL Imp Viva
No ratings yet
DL Imp Viva
5 pages
Viva Questions-Dllab
No ratings yet
Viva Questions-Dllab
4 pages
Deep Learning Viva Questions Simple Answers
No ratings yet
Deep Learning Viva Questions Simple Answers
3 pages
Deped Mission and Vision
No ratings yet
Deped Mission and Vision
5 pages
Deep Learning Theory Questions
No ratings yet
Deep Learning Theory Questions
3 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
QuestionBank C# and
No ratings yet
QuestionBank C# and
3 pages
M. M Arinze Corporate Law Practice Note 2
No ratings yet
M. M Arinze Corporate Law Practice Note 2
160 pages
Unit 1
No ratings yet
Unit 1
2 pages
Sat Practice Test 7
No ratings yet
Sat Practice Test 7
3 pages
Unit I
No ratings yet
Unit I
31 pages
Unit II
No ratings yet
Unit II
17 pages
2024 Estimation
No ratings yet
2024 Estimation
91 pages
Sir Sanny DLP
No ratings yet
Sir Sanny DLP
8 pages
Mockingbird
No ratings yet
Mockingbird
4 pages
CCS355 Neural Network and Deep Learning
No ratings yet
CCS355 Neural Network and Deep Learning
32 pages
Aureole Book
No ratings yet
Aureole Book
360 pages
Tentative Schedule Summer, Carryover, Supplementrary 2024-25
No ratings yet
Tentative Schedule Summer, Carryover, Supplementrary 2024-25
11 pages
Activity 2.1 Scavenger Hunt Form
No ratings yet
Activity 2.1 Scavenger Hunt Form
2 pages
Daa Lab Man
No ratings yet
Daa Lab Man
42 pages
14.3 Learning Manifolds
No ratings yet
14.3 Learning Manifolds
25 pages
200 One Word Substitution With Examples
No ratings yet
200 One Word Substitution With Examples
14 pages
Seismic Fragility of Transportation Lifeline Piers in The Philippines, Under Confinement and Shear Failure.
No ratings yet
Seismic Fragility of Transportation Lifeline Piers in The Philippines, Under Confinement and Shear Failure.
20 pages
Selcia's Social Block24 Presentation
No ratings yet
Selcia's Social Block24 Presentation
14 pages
Godavarman Case
No ratings yet
Godavarman Case
9 pages
The Design and Manufacture of Medicines: M I C H A E L E - Aultoribpharmphdfaapsfrpharms
No ratings yet
The Design and Manufacture of Medicines: M I C H A E L E - Aultoribpharmphdfaapsfrpharms
3 pages
Decoding Generative and Discriminative Models
No ratings yet
Decoding Generative and Discriminative Models
8 pages
Corex Delivery
No ratings yet
Corex Delivery
37 pages
Only One Mind PDF
No ratings yet
Only One Mind PDF
34 pages
Toolbox Talks - Overhead Power Lines
No ratings yet
Toolbox Talks - Overhead Power Lines
2 pages
ETHICS Module 1 Lesson 1
No ratings yet
ETHICS Module 1 Lesson 1
16 pages
Traffic Control in Atm
No ratings yet
Traffic Control in Atm
8 pages
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
No ratings yet
Safety Data Sheet: 1. Identification of The Substance/Mixture and The Supplier
8 pages
Stauffer 1957
No ratings yet
Stauffer 1957
7 pages
All in The Stars English Practice
No ratings yet
All in The Stars English Practice
4 pages
Rubric For Preparation of Design/Computational Plate
No ratings yet
Rubric For Preparation of Design/Computational Plate
1 page