0% found this document useful (0 votes)

24 views11 pages

MLT Unit 3

Uploaded by

Shrishti Bhasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views11 pages

MLT Unit 3

Uploaded by

Shrishti Bhasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

NVIDIA CUDA

NVIDIA CUDA (Compute Unified Device Architecture) is a parallel computing platform and
application programming interface (API) model created by NVIDIA. It allows developers to
leverage the power of NVIDIA GPUs (Graphics Processing Units) for general-purpose
processing tasks beyond just rendering graphics. CUDA enables efficient parallel computing
on NVIDIA GPUs, making it possible to accelerate a wide range of computationally intensive
applications.

Here are some key aspects and features of NVIDIA CUDA:

1. Parallel Computing: CUDA provides a framework for harnessing the parallel processing
capabilities of GPUs. It allows developers to write programs that can perform thousands of
parallel tasks simultaneously, making it suitable for tasks that can be divided into many
smaller, independent operations.

2. GPU Programming Model: CUDA introduces a programming model that allows

developers to write code for both the CPU (Central Processing Unit) and GPU in a single
program. This hybrid approach lets developers offload parallelizable tasks to the GPU while
keeping other parts of the program on the CPU.

3. C/C++ Language Extensions: CUDA extends the C and C++ programming languages
with special keywords and constructs to define and control parallel execution on the GPU.
This allows developers to write GPU-accelerated code using familiar programming languages.

4. GPU Libraries: NVIDIA provides a range of GPU-accelerated libraries for various

domains, such as linear algebra, image processing, machine learning, and more. These
libraries enable developers to leverage GPU acceleration without writing low-level CUDA
code.

5. Tools and SDK: NVIDIA offers a comprehensive suite of development tools and software
development kits (SDKs) to assist in CUDA development. These tools include profilers,
debuggers, and performance analysis tools.

6. Compatibility: CUDA is compatible with a wide range of NVIDIA GPUs, from entry-
level to high-end models. It also supports multiple operating systems, including Windows,
Linux, and macOS.

7. Applications: CUDA has been widely adopted in various fields, including scientific
computing, machine learning, deep learning, computer vision, data analytics, and more. It has
played a crucial role in accelerating the performance of applications in these domains.

CUDA has had a significant impact on the field of high-performance computing and has
made GPU acceleration accessible to a broad range of developers and researchers. It
continues to be a vital technology for accelerating computationally intensive workloads across
different industries.

NVIDIA cuDNN
The NVIDIA cuDNN (CUDA Deep Neural Network) toolkit is a GPU-accelerated library of
primitives for deep neural networks (DNNs). It is developed by NVIDIA and is designed to
improve the performance of deep learning frameworks that utilize GPUs (Graphics
Processing Units) for training and inference tasks.

Here are some key aspects and features of cuDNN:

1. GPU Optimization : cuDNN is optimized to take full advantage of the parallel processing
capabilities of NVIDIA GPUs. It provides highly efficient implementations of key operations
used in deep learning, such as convolutions, pooling, normalization, and activation functions.

2. Deep Learning Framework Integration : cuDNN is commonly used as a backend library by

various deep learning frameworks, including TensorFlow, PyTorch, Caffe, and others. These
frameworks leverage cuDNN to accelerate the execution of neural network operations on
NVIDIA GPUs.

3. Performance Boost : By utilizing cuDNN, deep learning models can achieve significantly
faster training and inference times compared to running on CPUs alone. This is particularly
important for large-scale neural networks and computationally intensive tasks.

4. Compatibility : cuDNN is compatible with a wide range of NVIDIA GPUs, making it

accessible to researchers, developers, and data scientists who work with different GPU
models.

5. Customizable : While cuDNN provides optimized implementations for common neural

network operations, it also allows users to customize certain aspects to suit their specific
requirements or experiment with different algorithms.

6. DNN Primitives : cuDNN offers a set of low-level DNN primitives that include
convolution, pooling, normalization, activation, tensor operations, and more. These primitives
are building blocks for constructing deep learning models.

7. Cross-Platform : While cuDNN is primarily used on NVIDIA GPUs, some deep learning
frameworks, like TensorFlow, offer support for multiple backends, allowing users to switch
between cuDNN and other libraries for portability.

In summary, cuDNN is a critical tool for accelerating deep learning workloads on NVIDIA
GPUs. It enables researchers and practitioners to train and deploy deep neural networks more
efficiently, leading to faster model development and improved performance in a wide range
of machine learning and artificial intelligence applications.

LOSS FUNCTION IN MACHINE LEARNING -

The loss function estimates how well a particular algorithm models the provided data. Loss
functions are classified into two classes based on the type of learning task

Regression Models: predict continuous values.

Classification Models: predict the output from a set of finite categorical values.

Mean Squared Error (MSE) / Quadratic Loss / L2 Loss

It is the Mean of Square of Residuals for all the datapoints in the dataset.
Residuals is the difference between the actual and the predicted prediction by the model.
Squaring of residuals is done to convert negative values to positive values.
The normal error can be both negative and positive. If some positive and negative numbers
are summed up, the sum maybe 0. This will tell the model that the net error is 0 and the model
is performing well but contrary to that, the model is still performing badly. Thus, to get the
actual performance of the model, only positive values are taken to get positive, squaring is
done.

Example 1: Mean Squared Error (MSE) for Linear Regression

Suppose you are building a linear regression model to predict the price of houses based on
their square footage. You have a dataset with the following data:

- Actual house prices (target values):

- House 1: $300,000
- House 2: $400,000
- House 3: $500,000

- Predicted prices by your model:

- House 1: $280,000
- House 2: $390,000
- House 3: $510,000

To calculate the MSE, you square the difference between each predicted and actual price,
calculate the mean of these squared differences, and then take the square root:

MSE = [(300,000 - 280,000)^2 + (400,000 - 390,000)^2 + (500,000 - 510,000)^2] / 3 =

(400,000 + 100,000 + 100,000) / 3 = 200,000

So, the Mean Squared Error for this linear regression model is $200,000.

Absolute mean =

Example 2: Binary Cross-Entropy Loss for Binary Classification

Suppose you are working on a binary classification problem where you predict whether an
email is spam (1) or not spam (0). You have the following data for four emails:

- Actual labels (ground truth):

- Email 1: 1 (spam)
- Email 2: 0 (not spam)
- Email 3: 1 (spam)
- Email 4: 0 (not spam)

- Predicted probabilities of being spam by your model:

- Email 1: 0.8 (predicted as spam)
- Email 2: 0.2 (predicted as not spam)
- Email 3: 0.7 (predicted as spam)
- Email 4: 0.4 (predicted as not spam)

The binary cross-entropy loss for each email is calculated as follows:

- Loss for Email 1: -[1 * log(0.8) + (1 - 1) * log(1 - 0.8)] = -[0.223 + 0] = -0.223

- Loss for Email 2: -[0 * log(0.2) + (1 - 0) * log(1 - 0.2)] = -[0 + 0.223] = -0.223
- Loss for Email 3: -[1 * log(0.7) + (1 - 1) * log(1 - 0.7)] = -[0.357 + 0] = -0.357
- Loss for Email 4: -[0 * log(0.4) + (1 - 0) * log(1 - 0.4)] = -[0 + 0.916] = -0.916
To get the overall binary cross-entropy loss, you typically calculate the average of these
individual losses:

Average Loss = (-0.223 - 0.223 - 0.357 - 0.916) / 4 ≈ -0.43

So, the average binary cross-entropy loss for this binary classification model is approximately
-0.43. Note that it's common to use the negative sign for this loss, so the larger the value, the
worse the model's performance.

Certainly, let's go through a numerical example of the Mean Squared Error (MSE)
loss function, which is commonly used in regression tasks. In this example, we'll
calculate the MSE for a simple linear regression problem.

Suppose we have a dataset with three data points:

Data point 1:
- True value (target): 10
- Predicted value: 12

Data point 2:
- True value: 15
- Predicted value: 18

Data point 3:
- True value: 20
- Predicted value: 22

To calculate the Mean Squared Error (MSE) for this dataset, follow these steps:

1. Calculate the squared error for each data point, which is the square of the difference
between the true value and the predicted value:

- For data point 1: (10 - 12)^2 = 4

- For data point 2: (15 - 18)^2 = 9
- For data point 3: (20 - 22)^2 = 4

2. Calculate the average (mean) of these squared errors:

MSE = (4 + 9 + 4) / 3 = 17 / 3 ≈ 5.67

So, the Mean Squared Error (MSE) for this dataset is approximately 5.67. This value
represents the average squared difference between the predicted values and the true
target values. Lower MSE values indicate better model performance, as they signify
that the predictions are closer to the true values.
Keep in mind that in practice, machine learning libraries and frameworks handle the
calculation of loss functions automatically during the training process. The model's
objective during training is to minimize this loss function (MSE in this case) by
adjusting its parameters (weights and biases) through techniques like gradient descent.

Mean Bias Error:

It is the same as MSE. but less accurate and can could conclude if the model has a
positive bias or negative bias.

Mean Absolute Error (MAE) / La Loss:

Certainly! Here are the solutions to the numerical examples on loss functions:

1. Mean Squared Error (MSE) Loss:

Actual Values: [10, 15, 20, 25]
Predicted Values: [12, 18, 22, 28]

MSE Loss = (1/4) * [(10-12)^2 + (15-18)^2 + (20-22)^2 + (25-28)^2]

= (1/4) * [4 + 9 + 4 + 9]
= (1/4) * 26
= 6.5

Cross-Entropy Loss = -(1/4) * [1 * log(

So, the MSE loss for these predictions is 6.5.

2. Cross-Entropy Loss:
True Labels: [1, 0, 1, 0]
Predicted Probabilities: [0.9, 0.2, 0.8, 0.3]
0.9) + (1-0) * log(1-0.2) + 1 * log(0.8) + (1-0) * log(1-0.3)]
= -(1/4) * [0.1054 + 0.2231 + 0.2231 + 0.3567]
= -(1/4) * 0.9083
≈ -0.2271

So, the cross-entropy loss for these predictions is approximately -0.2271.

3. Hinge Loss:
True Labels: [1, -1, 1, -1]
Model Scores: [0.5, -0.7, 0.9, -0.2]

Hinge Loss = (1/4) * [max(0, 1 - 0.5) + max(0, 1 - (-0.7)) + max(0, 1 - 0.9) + max(0,
1 - (-0.2))]
= (1/4) * [0.5 + 1.7 + 0.1 + 1.2]
= (1/4) * 3.5
= 0.875

So, the hinge loss for these predictions is 0.875.

4. Huber Loss:
Actual Values: [5, 8, 12, 18]
Predicted Values: [7, 9, 13, 15]
Delta = 2

Huber Loss = (1/4) * Σ [L(y_true, y_pred)]

= (1/4) * [Huber(5, 7, 2) + Huber(8, 9, 2) + Huber(12, 13, 2) + Huber(18, 15,
2)]
= (1/4) * [0.5 + 0.5 + 0.5 + 1]
= (1/4) * 2.5
= 0.625

So, the Huber loss for these predictions with a delta of 2 is 0.625.

5. Custom Loss Function:

Custom Loss = |y_true - y_pred| + 2 * (y_true - y_pred)^2
True Value (y_true) = 10
Predicted Value (y_pred) = 12

Custom Loss = |10 - 12| + 2 * (10 - 12)^2

=2+2*4
=2+8
= 10

So, the custom loss for these values is 10.

These are the solutions to the numerical examples involving different loss functions.

Cross-entropy loss, often referred to as log loss or logistic loss, is a commonly used
loss function in machine learning and deep learning, especially in binary and multi-
class classification problems. It measures the dissimilarity between the predicted
probabilities (or scores) and the actual true labels of the data.

The cross-entropy loss is defined as follows:

For binary classification:

L(y, p) = -[y * log(p) + (1 - y) * log(1 - p)]

For multi-class classification:

L(y, p) = -Σ(y_i * log(p_i))

Where:
- L(y, p) is the cross-entropy loss.
- y is a vector of true class labels (binary: 0 or 1, multi-class: one-hot encoded vector).
- p is a vector of predicted probabilities or scores for each class (summing to 1 in
multi-class).

In the binary case:

- If y = 1 (indicating the positive class), the loss term reduces to -log(p), penalizing
lower predicted probabilities for the positive class.
- If y = 0 (indicating the negative class), the loss term reduces to -log(1 - p),
penalizing higher predicted probabilities for the negative class.

In the multi-class case, the loss considers the logarithm of the predicted probability of
the true class label while penalizing deviations from the true distribution of class
probabilities.

The goal during training is to minimize the cross-entropy loss. This is typically done
using optimization algorithms like gradient descent, which iteratively adjusts the
model's parameters to reduce the loss, thereby improving the model's predictive
performance. Cross-entropy loss is a suitable choice for classification tasks because it
encourages the model to produce more confident and accurate class predictions.

--------------------------------------------------

Text classification is a common natural language processing (NLP) task where

you categorize text documents into predefined classes or categories. TensorFlow is a
popular deep learning framework that can be used for text classification tasks. Here,
I'll provide a step-by-step guide on how to perform text classification using
TensorFlow:

1. Prepare Your Data:

- Collect and preprocess your text data. This may involve tasks like tokenization
(splitting text into words or subword units), removing stopwords, and stemming or
lemmatization.
- Label your data by assigning categories or classes to each text document.

2. Tokenization:
- Tokenization is the process of breaking down text into smaller units, such as words
or subword tokens. You can use libraries like TensorFlow's Tokenizer or popular NLP
libraries like NLTK or spaCy for this.

3. Vectorization:
- Convert your text data into numerical format that can be fed into a machine
learning model. Common approaches include:
- Bag of Words (BoW): Represents text as a vector of word frequencies.
- TF-IDF (Term Frequency-Inverse Document Frequency): Represents text based
on the importance of words in documents.
- Word Embeddings: Pre-trained word embeddings like Word2Vec, GloVe, or
FastText can be used to represent words as dense vectors.
4. Split Your Data:
- Split your dataset into training, validation, and testing sets. A common split is 70-
80% for training, 10-15% for validation, and the remaining for testing.

5. Build a TensorFlow Model:

- You can use various deep learning architectures for text classification, but a
common choice is a neural network model, often based on Recurrent Neural
Networks (RNNs), Convolutional Neural Networks (CNNs), or Transformer-based
models like BERT.
- Define your model architecture using TensorFlow's high-level API, Keras.

6. Training:
- Train your model on the training dataset using the `model.fit()` method. You may
need to experiment with hyperparameters like batch size, learning rate, and the
number of epochs.

7. Evaluation:
- Evaluate your model's performance on the validation and test datasets using
metrics like accuracy, precision, recall, and F1-score.

8. Inference:
- Use your trained model to classify new text documents by feeding them through
the model's `predict()` method.

9. Fine-Tuning and Optimization:

- Depending on your results, you may need to fine-tune your model, adjust
hyperparameters, or try different architectures to improve performance.

10. Deployment:
- Once you're satisfied with your model's performance, you can deploy it for real-
world use, such as integrating it into a web application or API.

-----------------------------------------------------
Image classification and visualization-
⚫ Image classification is a common computer vision task where you train a model
to classify images into predefined categories or classes.
⚫ TensorFlow is a powerful deep learning framework that can be used for image
classification tasks.
⚫ Additionally, visualization is crucial for understanding the model's performance
and gaining insights into its decision-making process.
⚫ Here's a step-by-step guide on how to perform image classification and
visualization using TensorFlow:

# Image Classification
1. Prepare Your Data:
- Collect and preprocess your image data. Ensure that your images are properly
labeled into different classes.
2. Data Augmentation (Optional):
- To improve the model's robustness and generalization, you can apply data
augmentation techniques to your images.
TensorFlow provides the `ImageDataGenerator` class for this purpose.

3. Split Your Data:

- Divide your dataset into training, validation, and testing sets.

4. Build a TensorFlow Model:

- Choose a suitable pre-trained model (like VGG16, ResNet, Inception, or
MobileNet) from TensorFlow's `tf.keras.applications` or build a custom convolutional
neural network (CNN) for your specific task.
- Modify the model's output layer to have the same number of neurons as your
classes, typically using a `Dense` layer with softmax activation.

5. Compile and Train Your Model:

- Compile the model, specifying the optimizer, loss function, and evaluation metrics.

6. Data Preprocessing and Training:

- Preprocess your image data, including resizing images to the required input size
and normalizing pixel values.
- Train your model using the `model.fit()` method, specifying the training and
validation datasets.

7. Evaluation:
- Evaluate your model's performance on the test dataset and analyze metrics like
accuracy, precision, recall, and F1-score.

### Visualization

1. Loss and Accuracy Curves:

- Visualize training and validation loss and accuracy over epochs to assess model
performance and potential overfitting.

2. Confusion Matrix:
- Generate a confusion matrix to understand which classes the model is struggling to
classify correctly.

3. Visualization of Predictions:
- Visualize some example predictions along with their true labels to understand how
well the model is performing.

These visualization techniques can help you gain insights into your image
classification model's performance and make improvements as needed.
Time series with Recurrent Neural Networks (RNN):

⚫ Time series data is a sequence of data points collected or recorded at regular

intervals over time.
⚫ Examples of time series data include stock prices, temperature readings, and
sensor data. Recurrent Neural Networks (RNNs) are a type of neural network
architecture particularly well-suited for handling sequences, making them a
powerful tool for time series analysis and prediction.

Here's how RNNs work for time series data:

1. Sequential Data Handling:

- RNNs are designed to handle sequential data, where the order of data points
matters. They have a hidden state that maintains information about previous time
steps, allowing them to capture temporal dependencies in the data.

2. Architecture:
- In RNN, each time step is processed one at a time. At each time step, the RNN
takes two inputs: the current input data point and the hidden state from the previous
time step.
- The RNN updates its hidden state based on the current input and the previous
hidden state using a set of learned weights.
- The updated hidden state is then used to make predictions or is passed to the next
time step.

3. Training:
- RNNs are trained using backpropagation through time (BPTT). This means that
the model's weights are updated not only for the current time step but also for all
previous time steps in the sequence.

4. Applications:
- RNNs are widely used for various time series tasks, such as time series forecasting,
anomaly detection, and sequence generation.

------------------------------------------------------

Text Generation with Recurrent Neural Networks (RNN):

Text generation using RNNs is a fascinating application of recurrent neural networks

in the field of natural language processing (NLP). It involves training an RNN to
predict the next word or character in a sequence of text and using the model to
generate new text. Here's how text generation with RNNs works:

1. Data Preparation:
- To train an RNN for text generation, you need a large corpus of text data as your
training dataset. This can be books, articles, poems, or any text source.
- Tokenize the text into words or characters and create a vocabulary.
2. Sequence Generation:
- Divide the text into sequences of fixed length (e.g., a sentence or a paragraph).
Each sequence will serve as an input to the RNN.

3. Model Architecture:
- You can use various types of RNN architectures for text generation, such as
vanilla RNNs, LSTM (Long Short-Term Memory), or GRU (Gated Recurrent Unit).
- The RNN takes a sequence of words or characters as input and learns to predict the
next word or character in the sequence.

4. Training:
- Train the RNN to minimize the prediction error by comparing the predicted word
or character to the actual next word or character.
- This process is similar to a classification problem, where you classify the next
token from the vocabulary.

5. Sampling:
- To generate text, you start with an initial seed sequence and use the trained RNN
to predict the next word or character.
- Append the predicted token to the sequence and repeat the process iteratively to
generate longer text.

6. Temperature and Sampling Strategy:

- You can control the creativity of the generated text by adjusting the "temperature"
parameter. Higher temperature values make the output more random, while lower
values make it more deterministic.

7. Applications:
- Text generation with RNNs is used in various applications, including chatbots,
creative writing assistance, and even generating code.

Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
35 pages
Lesson 04 Deep Neural Network
No ratings yet
Lesson 04 Deep Neural Network
81 pages
09 Evaluation
No ratings yet
09 Evaluation
64 pages
Deep Learning UNIT 5
No ratings yet
Deep Learning UNIT 5
182 pages
DeepLearning L1 Intro
No ratings yet
DeepLearning L1 Intro
92 pages
d2l en PDF
100% (1)
d2l en PDF
1,024 pages
Eye Disease Identifier Using CNN
No ratings yet
Eye Disease Identifier Using CNN
40 pages
DC Unit IV
No ratings yet
DC Unit IV
28 pages
Module 1
No ratings yet
Module 1
64 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
4th Unit Aktu Machine Learning
No ratings yet
4th Unit Aktu Machine Learning
9 pages
Chapter 5 Final
No ratings yet
Chapter 5 Final
80 pages
d2l en
No ratings yet
d2l en
1,022 pages
Deep Learning: Convolutional Neural Network & Its Applications
No ratings yet
Deep Learning: Convolutional Neural Network & Its Applications
53 pages
Ch2-Training, Optimization and Regularization of DNN-new
No ratings yet
Ch2-Training, Optimization and Regularization of DNN-new
114 pages
Midterm Study Guide Csci566
No ratings yet
Midterm Study Guide Csci566
20 pages
Manual - Deep Learning Lab.
No ratings yet
Manual - Deep Learning Lab.
43 pages
Lecture#3
No ratings yet
Lecture#3
16 pages
10 PDF
No ratings yet
10 PDF
12 pages
Practical-5 - 2CEIT606 - Artificial Intelligence
No ratings yet
Practical-5 - 2CEIT606 - Artificial Intelligence
14 pages
2024 - GR5245 Class2 Notes
No ratings yet
2024 - GR5245 Class2 Notes
10 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Libraries
No ratings yet
Libraries
10 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
36 pages
Main
No ratings yet
Main
183 pages
Project Report
No ratings yet
Project Report
16 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
HW 5
No ratings yet
HW 5
10 pages
Self-Taught Learning: Implementation Using MATLAB
100% (1)
Self-Taught Learning: Implementation Using MATLAB
42 pages
HR Database
25% (4)
HR Database
800 pages
Tensorflow: Features
No ratings yet
Tensorflow: Features
10 pages
HR Email List With Company Names
70% (10)
HR Email List With Company Names
93 pages
Gpucoder Ug
No ratings yet
Gpucoder Ug
560 pages
Performance Evaluation
No ratings yet
Performance Evaluation
24 pages
Deep Learning
No ratings yet
Deep Learning
40 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
12 pages
DOCTORS Mobile Numbers List
93% (14)
DOCTORS Mobile Numbers List
436 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
Implementation of A Fast Artificial Neural Network Library (Fann)
No ratings yet
Implementation of A Fast Artificial Neural Network Library (Fann)
92 pages
NISS Deep Learning Tutorial
No ratings yet
NISS Deep Learning Tutorial
58 pages
Deep Learning Intro Slides
No ratings yet
Deep Learning Intro Slides
68 pages
Types of Neural Networks
No ratings yet
Types of Neural Networks
7 pages
DSE 3141 Deep Learning Lab Manual 2024 Week4
No ratings yet
DSE 3141 Deep Learning Lab Manual 2024 Week4
14 pages
CV Lab 12 - Implementatin of A Simple CNN
No ratings yet
CV Lab 12 - Implementatin of A Simple CNN
9 pages
Module1 - Deep Learning
No ratings yet
Module1 - Deep Learning
26 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Neetu Singh Class Notes
83% (30)
Neetu Singh Class Notes
519 pages
Lesson 4 Deep Neural Network and Tools
No ratings yet
Lesson 4 Deep Neural Network and Tools
159 pages
6S191 MIT DeepLearning L1
No ratings yet
6S191 MIT DeepLearning L1
101 pages
Matconvnet Manual
No ratings yet
Matconvnet Manual
59 pages
PMFIAS Prelims Magnum 2025 06 Science and Technology
No ratings yet
PMFIAS Prelims Magnum 2025 06 Science and Technology
210 pages
PV Valves Operation and Maintenance Procedure
100% (2)
PV Valves Operation and Maintenance Procedure
6 pages
Bouncing Ball Project Computer Graphics
67% (3)
Bouncing Ball Project Computer Graphics
10 pages
Gagan Pratap Arithmetic Complete Class Notes
86% (22)
Gagan Pratap Arithmetic Complete Class Notes
775 pages
3.2 Preprocessing
No ratings yet
3.2 Preprocessing
10 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
CRIMINAL LAW - Estrada Problems and Answers
67% (3)
CRIMINAL LAW - Estrada Problems and Answers
3 pages
25 Jan 2023 One Lac Calling Data
83% (12)
25 Jan 2023 One Lac Calling Data
1,668 pages
Image Classification Using Small Convolutional Neural Network
No ratings yet
Image Classification Using Small Convolutional Neural Network
5 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
No ratings yet
Machine Learning: MACHINE LEARNING - Copy Rights Reserved Real Time Signals
56 pages
TIC-TAC-TOE Game and Library Management System
No ratings yet
TIC-TAC-TOE Game and Library Management System
12 pages
AP Minimum Wages Details - 1st Oct 2024 - 31st March 2025
No ratings yet
AP Minimum Wages Details - 1st Oct 2024 - 31st March 2025
29 pages
List of Angel Investors and VCs in India
93% (14)
List of Angel Investors and VCs in India
45 pages
Mumbai Data CFO
56% (32)
Mumbai Data CFO
1,197 pages
All India Database Details 1
11% (19)
All India Database Details 1
105 pages
100 SHORTCUTS TO CRACK QUANTITATIVE APTITUDE - Speed Matters PDF
70% (10)
100 SHORTCUTS TO CRACK QUANTITATIVE APTITUDE - Speed Matters PDF
102 pages
HR Managers Details in India
67% (58)
HR Managers Details in India
208 pages
Bangalore Database
93% (14)
Bangalore Database
67 pages
Chitaliya Dipak - Nirma
No ratings yet
Chitaliya Dipak - Nirma
93 pages
SSC Advance Maths (English) by Rakesh Yadav - by WWW - Learnengineering.in
92% (12)
SSC Advance Maths (English) by Rakesh Yadav - by WWW - Learnengineering.in
658 pages
Gardner Denver MH5 Hydrapak
0% (1)
Gardner Denver MH5 Hydrapak
8 pages
Lecture Notes v1.0 687 F22
No ratings yet
Lecture Notes v1.0 687 F22
115 pages
List of DOCTORS
76% (17)
List of DOCTORS
446 pages
Module 04 Investment N Securities Law-6
No ratings yet
Module 04 Investment N Securities Law-6
76 pages
Government Employes Database 2024 Sample
100% (4)
Government Employes Database 2024 Sample
12 pages
CEO 5000 Data
76% (21)
CEO 5000 Data
512 pages
HR Details 2024
67% (3)
HR Details 2024
6 pages
Gpu, Cuda and Pycuda
No ratings yet
Gpu, Cuda and Pycuda
11 pages
SSC Tcs Reasoning Complete PDF
75% (123)
SSC Tcs Reasoning Complete PDF
548 pages
MLT Unit 1 & 2
No ratings yet
MLT Unit 1 & 2
119 pages
Bangalore SMEs Business Owners Sample
91% (11)
Bangalore SMEs Business Owners Sample
36 pages
Basic Firefighting Course
No ratings yet
Basic Firefighting Course
15 pages
HR Detail
90% (10)
HR Detail
2 pages
11 Rules For Life Chetan Bhagat PDF
100% (7)
11 Rules For Life Chetan Bhagat PDF
44 pages
Problems in Engineering: Fifth Edition
No ratings yet
Problems in Engineering: Fifth Edition
5 pages
MNC Companies HR Email Ids To Forward Resumes
63% (64)
MNC Companies HR Email Ids To Forward Resumes
56 pages
Chapter 04
100% (1)
Chapter 04
27 pages
MLT Ese
No ratings yet
MLT Ese
21 pages
3 Com
No ratings yet
3 Com
465 pages
MLT Unit 2 Perceptron
No ratings yet
MLT Unit 2 Perceptron
34 pages
Email Mobile Database of HR Heads HR Managers Sample 1
100% (2)
Email Mobile Database of HR Heads HR Managers Sample 1
11 pages
Dao 2015-09
No ratings yet
Dao 2015-09
14 pages
Charles Crissman Wendy Crissman Christine Crissman v. Dover Downs Entertainment Inc. Dover Downs, Inc, 289 F.3d 231, 3rd Cir. (2000)
No ratings yet
Charles Crissman Wendy Crissman Christine Crissman v. Dover Downs Entertainment Inc. Dover Downs, Inc, 289 F.3d 231, 3rd Cir. (2000)
31 pages
???????? ??? ???????
80% (10)
???????? ??? ???????
569 pages
Fortune 500 Companies Details HR Contacts
67% (6)
Fortune 500 Companies Details HR Contacts
25 pages
Shortcut Methods Tips and Tricks
100% (11)
Shortcut Methods Tips and Tricks
385 pages
Attachment J - Weekly Inspection Report - New
No ratings yet
Attachment J - Weekly Inspection Report - New
8 pages
Try Free Fortinet NSE 6 - FortiMail 6.2 NSE6-FML - 6.2 Real Dumps PDF
No ratings yet
Try Free Fortinet NSE 6 - FortiMail 6.2 NSE6-FML - 6.2 Real Dumps PDF
11 pages
MLT Numericals
No ratings yet
MLT Numericals
4 pages
MLT Tensorflow Unit 3
No ratings yet
MLT Tensorflow Unit 3
20 pages
IOM Belven Ball Valves - General
No ratings yet
IOM Belven Ball Valves - General
3 pages
Data Mining
No ratings yet
Data Mining
16 pages
As ISO IEC 6523.1-2005 Information Technology - Structure For The Identification of Organizations and Organiz
No ratings yet
As ISO IEC 6523.1-2005 Information Technology - Structure For The Identification of Organizations and Organiz
7 pages
The Manifesto of The Super I For New Joinees
No ratings yet
The Manifesto of The Super I For New Joinees
11 pages
Master Your Mindset How To Get What You Truly Deserve - Reading Mindset
95% (22)
Master Your Mindset How To Get What You Truly Deserve - Reading Mindset
66 pages
Company Email Id
64% (11)
Company Email Id
9 pages
QC Sheet
No ratings yet
QC Sheet
8 pages
Data Interpretation (Foundation) - Practice-1
No ratings yet
Data Interpretation (Foundation) - Practice-1
13 pages
300UT-PL Concrete Mixer
No ratings yet
300UT-PL Concrete Mixer
8 pages
Maths Mount Olives.
No ratings yet
Maths Mount Olives.
16 pages
Extract Hidden Data From PDF
No ratings yet
Extract Hidden Data From PDF
2 pages
Fraud Alert!: "@ril - VC" and "@ril - Sg". These
No ratings yet
Fraud Alert!: "@ril - VC" and "@ril - Sg". These
2 pages
DSSF RefGuide 16-17 11-16-16
No ratings yet
DSSF RefGuide 16-17 11-16-16
41 pages
HR EMail IDs of Top 500 Indian Companies
53% (81)
HR EMail IDs of Top 500 Indian Companies
11 pages
Amp
No ratings yet
Amp
3 pages
AOA 2023 Solution
No ratings yet
AOA 2023 Solution
25 pages
Birds Nest Menu
No ratings yet
Birds Nest Menu
7 pages
Rrkabel
No ratings yet
Rrkabel
1 page
3D CAD Matrix PDF
No ratings yet
3D CAD Matrix PDF
5 pages
Statement of Facts El Matador
No ratings yet
Statement of Facts El Matador
6 pages
D R L F L S I G S: EEP Einforcement Earning For Urniture Ayout Imulation in Ndoor Raphics Cenes
No ratings yet
D R L F L S I G S: EEP Einforcement Earning For Urniture Ayout Imulation in Ndoor Raphics Cenes
6 pages
1500 Vocabulary Words
79% (75)
1500 Vocabulary Words
27 pages
How To Add Startup - Cs Class in ASP - NET Core 6 Project
No ratings yet
How To Add Startup - Cs Class in ASP - NET Core 6 Project
3 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
C Programming for the Pc the Mac and the Arduino Microcontroller System
From Everand
C Programming for the Pc the Mac and the Arduino Microcontroller System
Peter D Minns
No ratings yet
Foundation Course for Advanced Computer Studies
From Everand
Foundation Course for Advanced Computer Studies
Franck Ismael Djédjé
No ratings yet
CUDA Programming with C++: From Basics to Expert Proficiency
From Everand
CUDA Programming with C++: From Basics to Expert Proficiency
William Smith
No ratings yet
SC-200: Microsoft Security Operations Analyst Preparation
From Everand
SC-200: Microsoft Security Operations Analyst Preparation
Georgio Daccache
No ratings yet

MLT Unit 3

Uploaded by

MLT Unit 3

Uploaded by

NVIDIA CUDA

Here are some key aspects and features of NVIDIA CUDA:

2. GPU Programming Model: CUDA introduces a programming model that allows

4. GPU Libraries: NVIDIA provides a range of GPU-accelerated libraries for various

Here are some key aspects and features of cuDNN:

2. Deep Learning Framework Integration : cuDNN is commonly used as a backend library by

4. Compatibility : cuDNN is compatible with a wide range of NVIDIA GPUs, making it

5. Customizable : While cuDNN provides optimized implementations for common neural

LOSS FUNCTION IN MACHINE LEARNING -

Regression Models: predict continuous values.

Mean Squared Error (MSE) / Quadratic Loss / L2 Loss

Example 1: Mean Squared Error (MSE) for Linear Regression

- Actual house prices (target values):

- Predicted prices by your model:

MSE = [(300,000 - 280,000)^2 + (400,000 - 390,000)^2 + (500,000 - 510,000)^2] / 3 =

Example 2: Binary Cross-Entropy Loss for Binary Classification

- Actual labels (ground truth):

- Predicted probabilities of being spam by your model:

The binary cross-entropy loss for each email is calculated as follows:

- Loss for Email 1: -[1 * log(0.8) + (1 - 1) * log(1 - 0.8)] = -[0.223 + 0] = -0.223

Average Loss = (-0.223 - 0.223 - 0.357 - 0.916) / 4 ≈ -0.43

Suppose we have a dataset with three data points:

- For data point 1: (10 - 12)^2 = 4

2. Calculate the average (mean) of these squared errors:

Mean Bias Error:

Mean Absolute Error (MAE) / La Loss:

1. Mean Squared Error (MSE) Loss:

MSE Loss = (1/4) * [(10-12)^2 + (15-18)^2 + (20-22)^2 + (25-28)^2]

Cross-Entropy Loss = -(1/4) * [1 * log(

So, the cross-entropy loss for these predictions is approximately -0.2271.

So, the hinge loss for these predictions is 0.875.

Huber Loss = (1/4) * Σ [L(y_true, y_pred)]

5. Custom Loss Function:

Custom Loss = |10 - 12| + 2 * (10 - 12)^2

So, the custom loss for these values is 10.

The cross-entropy loss is defined as follows:

For binary classification:

For multi-class classification:

In the binary case:

Text classification is a common natural language processing (NLP) task where

1. Prepare Your Data:

5. Build a TensorFlow Model:

9. Fine-Tuning and Optimization:

3. Split Your Data:

4. Build a TensorFlow Model:

5. Compile and Train Your Model:

6. Data Preprocessing and Training:

1. Loss and Accuracy Curves:

⚫ Time series data is a sequence of data points collected or recorded at regular

Here's how RNNs work for time series data:

1. Sequential Data Handling:

Text Generation with Recurrent Neural Networks (RNN):

Text generation using RNNs is a fascinating application of recurrent neural networks

6. Temperature and Sampling Strategy:

You might also like