0% found this document useful (0 votes)

34 views28 pages

f8194544 Microsoft PowerPoint DeepLearning

Uploaded by

karunakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views28 pages

f8194544 Microsoft PowerPoint DeepLearning

Uploaded by

karunakar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Implementation of Deep Learning Models

Colaboratory : Tesla GPU based Free Cloud

Cost : INR 3.5 Lacs

https://colab.research.google.com
Cloud Configurations

# GPU count and name (SMI: System Management Interface)

!nvidia-smi -L

!nvidia-smi

!lscpu |grep 'Model name'

# no.of sockets i.e available slots for physical processors

!lscpu | grep 'Socket(s):'

# no.of cores each processor is having

!lscpu | grep 'Core(s) per socket:'
Configuration: GPU Based Remote System
GPU: 1 x Tesla K80 , compute 3.7, having 2496 CUDA cores , 12GB
GDDR5 VRAM

CPU: 1 x single core hyper threaded i.e(1 core, 2 threads) Xeon

Processors @2.3Ghz (No Turbo Boost) , 45MB Cache

RAM: ~12.6 GB Available

Disk: ~320 GB Available (OverlayFS)

Idle Time: 90 minutes

Every 12 Hours: Disk, RAM, VRAM, CPU Cache etc. data on

allotted Virtual Machine get erased
https://colab.research.google.com/drive/151805XTDg--dgHb3-AXJCpnWaqRhop_2#scrollTo=vEWe-FHNDY3E
Extraction of Data Files
!apt-get install p7zip-full
!p7zip -d file_name.tar.7z
!tar -xvf file_name.tar
from google.colab import files
Sync Google Drive
Deep Learning and Transfer Functions in Keras
1. Activation Function or Transfer Function is used to
determine the output of node

2. Determine the output of neural network like Yes or No

3. It maps the resulting values in between 0 to 1 or -1 to

1 etc. (depending upon the function).
Categories of Activation / Transfer Functions
• Linear Activation Function
• Non-linear Activation Functions

If Activation function not applied, then the output signal will be a

simple linear function as a polynomial of one degree. Deep
Networks are complex
Equation : f(x) = x
Range : (-infinity to infinity)

Not fit for complexity or various parameters of usual data (Real Time)
that is fed to the neural networks.

Images have encoding in Spatial Domain rather than

Frequency Domain
• Most used Activation
Functions
• Makes easy to adapt /
generalize variety of data
and differentiate between
output

Key Terminologies to understand for nonlinear functions

• Derivative or Differential: Change in y-axis w.r.t.
change in x-axis (Slope)
• Monotonic function: A function which is either
entirely non-increasing or non-decreasing.

The Nonlinear Activation Functions are mainly divided on

the basis of their Range or Curves
Activation Function: Sigmoid / Logistic
Activation Function: Tanh - Hyperbolic Tangent
• Mathematical formula is f(x) = 1 - exp(-2x) / 1 + exp(-2x).

• It’s output is zero centered because its range in between -1 to 1

i.e -1 < output < 1 .

• Optimization is easier in this method hence in practice it is

always preferred over Sigmoid function

The sigmoid and hyperbolic

tangent activation functions
cannot be used in networks
with many layers due to the
Vanishing Gradient Problem
Need of Rectified Linear Unit (ReLU)
• Overcomes vanishing gradient problem, allow models learn faster and
perform better

• ReLU is default activation when developing MLP and CNN. The model
takes less time to train or run

• As ReLU is 0 for all negative inputs for any given unit not to activate at all
(For Missing Data or Data Sparsity)

• The downside for being zero for all negative values is a problem called
dying ReLU, Neurons die for all inputs and remain inactive no matter what
input is supplied, here no gradient flows

• The leak helps to increase the range of the ReLU function. Usually, the
value of a is 0.01 or so. When a is not 0.01 then it is called Randomized
ReLU. Therefore the range of the Leaky ReLU is (-infinity to infinity)
Avoidance of Vanishing Gradient in ReLU

In Back-propagation, while calculating gradients of loss

(Error) with respect to the weights, the gradients tends to
get smaller and smaller as we keep on moving backward in
the Network. This means that the neurons in
the Earlier layers learn very slowly as compared to the
neurons in the later layers in the Hierarchy. The Earlier
layers in the network are slowest to train.
Using TensorFlow APIs in Keras
# Uploading Dynamic Files Dense implements the operation:

from google.colab import files • A dense layer is just a regular layer of

uploaded = files.upload() neurons in a neural network.
• Each neuron receives input from all
the neurons in the previous layer,
thus densely connected.
# Create MLP in Keras
from keras.models import Sequential output = activation(dot(input, kernel) +
bias)
from keras.layers import Dense
import numpy Activation: Element-wise activation
function passed as the activation
argument
# fix random seed for reproducibility Kernel: Weights Matrix created by layer
numpy.random.seed(7)
bias: Bias vector created by the layer
(only applicable if use_bias is True).
# load dataset of scores
dataset = numpy.loadtxt("scores.csv", delimiter=",")

# split into input (X) and output (Y) variables

X = dataset[:,0:8]
Y = dataset[:,8]

# create model
model = Sequential()
model.add(Dense(12, input_dim=8, activation='relu'))
model.add(Dense(8, activation='relu'))
model.add(Dense(4, activation='relu'))
model.add(Dense(2, activation='relu'))
model.add(Dense(1, activation='sigmoid'))

# Compile model
model.compile(loss='binary_crossentropy', optimizer='adam', metrics=['accuracy'])
https://keras.io/activations/
# Fit the model
model.fit(X, Y, epochs=100, batch_size=10)

# Evaluate the model

scores = model.evaluate(X, Y)
print("\n%s: %.2f%%" % (model.metrics_names[1], scores[1]*100))

# Test Data
testdata = files.upload()
testdataset = numpy.loadtxt("testdata.csv", delimiter=",")
X2 = testdataset[:,0:8]
predictions = model.predict(X2)

# Round predictions
rounded = [round(x[0]) for x in predictions]
print(rounded)
Loss Functions
A loss function (or objective function, or optimization
score function) is one of the two parameters required
to compile a model:

from keras import losses

model.compile(loss='mean_squared_error',
optimizer='sgd')
• for binary_crossentropy: sigmoid activation, scalar
target

• for categorical_crossentropy: softmax activation, one-

hot encoded target

• If it is a multiclass problem, use categorical_crossentropy

Interpretation of Output

• Loss: A scalar value that we attempt to minimize during our training of

the model. The lower the loss, the closer our predictions are to the true
labels.
• Both loss and val_loss should be decreasing and Accuracy (acc and
val_acc) should be increasing.
• acc is the accuracy of training set. val_acc is the measure of how good
the predictions of your model are.
• Training loss is the average of the losses over each batch of training data
A function that transforms the values or
states the conditions for the decision of
the output neuron is known as
an activation function
• Sigmoid (In-Between Solution like
MAY-BE, Intermediate Prediction)
• Tanh
• Softmax
• and many others
Classification and Regression in Prediction

Classification -> Task of predicting a discrete class label

(If forecasting Target Class)

Regression -> Task of predicting a continuous quantity

(If forecasting a Value)
Metrics
A metric is a function that is used to judge the performance of your
model. Metric functions are to be supplied in the metrics parameter
when a model is compiled.

model.compile(loss='mean_squared_error',
optimizer='sgd',
metrics=['mae', 'acc'])

from keras import metrics

model.compile(loss='mean_squared_error',
optimizer='sgd',
metrics=[metrics.mae, metrics.categorical_accuracy])

A metric function is similar to a loss function, except that the results

from evaluating a metric are not used when training the model.
GAURAV KUMAR

Magma Research and Consultancy Services

Ambala Cantt., Haryana, India

Mobile Numbers : +91-9416366178, +91-9034001978

E-mail : [email protected]

http://www.gauravkumarindia.com

Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
A - Z Computer Glossary
50% (4)
A - Z Computer Glossary
15 pages
Evolution of Processors
No ratings yet
Evolution of Processors
91 pages
Deep Learning
No ratings yet
Deep Learning
40 pages
What Are The Activation Functions, How Do I Deter...
No ratings yet
What Are The Activation Functions, How Do I Deter...
3 pages
Keras1 - 1.3 Improving Your Model Performance
No ratings yet
Keras1 - 1.3 Improving Your Model Performance
13 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
DL Exp-3 16010422230
No ratings yet
DL Exp-3 16010422230
9 pages
ANN Viva Prep
No ratings yet
ANN Viva Prep
66 pages
Neural Networks Activation Functions 1694135997
No ratings yet
Neural Networks Activation Functions 1694135997
7 pages
Lec 07 8
No ratings yet
Lec 07 8
40 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Cnnforfashionmnist 220403 160135
No ratings yet
Cnnforfashionmnist 220403 160135
25 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
ANN Notes
No ratings yet
ANN Notes
7 pages
Unit Ii DNN
No ratings yet
Unit Ii DNN
24 pages
DeepLearing Theory
No ratings yet
DeepLearing Theory
51 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Object Classification Using CNN
No ratings yet
Object Classification Using CNN
9 pages
Pr1 ANN Writeup
No ratings yet
Pr1 ANN Writeup
7 pages
Deep Learning
No ratings yet
Deep Learning
5 pages
Day 10
No ratings yet
Day 10
17 pages
KT 01 Intro2Keras
No ratings yet
KT 01 Intro2Keras
24 pages
L4 Training Neural Networks en
No ratings yet
L4 Training Neural Networks en
48 pages
DL M2 Tech
No ratings yet
DL M2 Tech
32 pages
Artificial Neural Networks - DL
No ratings yet
Artificial Neural Networks - DL
55 pages
ANN Analysis
No ratings yet
ANN Analysis
5 pages
Unit 2
No ratings yet
Unit 2
35 pages
Implementation of Activation Layer
No ratings yet
Implementation of Activation Layer
17 pages
ML Modelling - Part 1
No ratings yet
ML Modelling - Part 1
7 pages
Module 2
No ratings yet
Module 2
13 pages
Deep Learning Module-02 Search Creators
No ratings yet
Deep Learning Module-02 Search Creators
15 pages
003 Activation Functions in Machine Learning
No ratings yet
003 Activation Functions in Machine Learning
19 pages
Activation - Loss - Accuracy
No ratings yet
Activation - Loss - Accuracy
16 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
Lecture 9-NN - Modified
No ratings yet
Lecture 9-NN - Modified
94 pages
Activation Function
No ratings yet
Activation Function
43 pages
CH 02 Summary
No ratings yet
CH 02 Summary
3 pages
Unit 4
No ratings yet
Unit 4
19 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
CHAPTER 3.3 - Activation - Loss - Accuracy
No ratings yet
CHAPTER 3.3 - Activation - Loss - Accuracy
14 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
3 pages
Crashcourse DL Pytorch Parr
No ratings yet
Crashcourse DL Pytorch Parr
39 pages
Tutorial 3
No ratings yet
Tutorial 3
9 pages
Different Activation Functions With The Equations
No ratings yet
Different Activation Functions With The Equations
6 pages
Day 2 - Loss & Activation Functions
No ratings yet
Day 2 - Loss & Activation Functions
8 pages
SDL Unit 2 3 4
No ratings yet
SDL Unit 2 3 4
12 pages
Training Deep Neural Networks
No ratings yet
Training Deep Neural Networks
55 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
06 AIS302 ANN Backpropagation
No ratings yet
06 AIS302 ANN Backpropagation
83 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
NN Unit - 1
No ratings yet
NN Unit - 1
27 pages
Regularization For Neural Network
No ratings yet
Regularization For Neural Network
37 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
Activations
No ratings yet
Activations
8 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Fifo Explanation
No ratings yet
Fifo Explanation
3 pages
Differences Between IPC Mechanisms On A Single System Vs
No ratings yet
Differences Between IPC Mechanisms On A Single System Vs
3 pages
Non-Solicit Self-Declaration Form-Signed
No ratings yet
Non-Solicit Self-Declaration Form-Signed
1 page
Week 3
No ratings yet
Week 3
3 pages
Multiple - Processor Scheduling
No ratings yet
Multiple - Processor Scheduling
16 pages
DWDM-Unit 2 CH-1
No ratings yet
DWDM-Unit 2 CH-1
36 pages
Servlet Questions
No ratings yet
Servlet Questions
6 pages
Question No. Option Question No. Option
No ratings yet
Question No. Option Question No. Option
1 page
TAUGCSE491 5 2233 954WTpdf PDF
No ratings yet
TAUGCSE491 5 2233 954WTpdf PDF
4 pages
Assignment-Ii: y X y Dy DX Dy DX X y and
No ratings yet
Assignment-Ii: y X y Dy DX Dy DX X y and
1 page
Opt-1 Opt-2 Opt-3
No ratings yet
Opt-1 Opt-2 Opt-3
4 pages
PHP Bits
No ratings yet
PHP Bits
24 pages
What Are Cookies
No ratings yet
What Are Cookies
13 pages
Unit 1
No ratings yet
Unit 1
2 pages
As Course
No ratings yet
As Course
3 pages
Assignment 3
No ratings yet
Assignment 3
1 page
WT Lesson Plan
No ratings yet
WT Lesson Plan
2 pages
Vector (Array) Processing and Superscalar Processors
No ratings yet
Vector (Array) Processing and Superscalar Processors
7 pages
121 - A. B. C. D.: View Answer Discuss Too Difficult!
No ratings yet
121 - A. B. C. D.: View Answer Discuss Too Difficult!
10 pages
Attributes and Usage of Jsp:Usebean Action Tag
No ratings yet
Attributes and Usage of Jsp:Usebean Action Tag
7 pages
XML Multiple Questions
No ratings yet
XML Multiple Questions
14 pages
1.1 Parallelism and Computing: 1.1.1 Trends in Applications
No ratings yet
1.1 Parallelism and Computing: 1.1.1 Trends in Applications
25 pages
Attributes and Usage of Jsp:Usebean Action Tag
No ratings yet
Attributes and Usage of Jsp:Usebean Action Tag
7 pages
Unit 3
No ratings yet
Unit 3
21 pages
PPDSDK Windows Trace
No ratings yet
PPDSDK Windows Trace
32 pages
L03 Architecture Memory
No ratings yet
L03 Architecture Memory
56 pages
Integrated Photonic Tensor Processing Unit For A M
No ratings yet
Integrated Photonic Tensor Processing Unit For A M
14 pages
Actual Parameter Application Software: Glossary of Object-Oriented and Programming Terms
No ratings yet
Actual Parameter Application Software: Glossary of Object-Oriented and Programming Terms
5 pages
02.ebook-Chapter 02 - Hardware
No ratings yet
02.ebook-Chapter 02 - Hardware
23 pages
8086 Pin Diagram
No ratings yet
8086 Pin Diagram
3 pages
System Information - Report Page
No ratings yet
System Information - Report Page
85 pages
Chapter 3
No ratings yet
Chapter 3
48 pages
Introducing The S7-200
No ratings yet
Introducing The S7-200
4 pages
Unit 4 Notes Micro Processor
No ratings yet
Unit 4 Notes Micro Processor
17 pages
S4 Test
No ratings yet
S4 Test
11 pages
Siemens
No ratings yet
Siemens
22 pages
SCS 1101 Handout
No ratings yet
SCS 1101 Handout
96 pages
CNP: An FPGA-based Processor For Convolutional Networks
No ratings yet
CNP: An FPGA-based Processor For Convolutional Networks
2 pages
Chapter No.1 Basics of Information Technology
No ratings yet
Chapter No.1 Basics of Information Technology
17 pages
Unit 3 Interfacing Devices
No ratings yet
Unit 3 Interfacing Devices
76 pages
What Are The Requirements For Blockchain Hardware
No ratings yet
What Are The Requirements For Blockchain Hardware
6 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
1 page
Goals of The Operating System: Primary Goal
No ratings yet
Goals of The Operating System: Primary Goal
27 pages
Unit 1 Introdution To Computers and Logic
No ratings yet
Unit 1 Introdution To Computers and Logic
45 pages
Chapters 1 and 3: ARM Processor Architecture
No ratings yet
Chapters 1 and 3: ARM Processor Architecture
44 pages
2nd Puc Computer Science Question Papers-2006 To 2011
No ratings yet
2nd Puc Computer Science Question Papers-2006 To 2011
32 pages
System Info
No ratings yet
System Info
2 pages
Datasheet, Vol. 1 - 7th Gen Intel® Core™ Processor U - Y-Platforms
No ratings yet
Datasheet, Vol. 1 - 7th Gen Intel® Core™ Processor U - Y-Platforms
118 pages
Advanced ATM Security System by Using Smart Card and IOT Technology
No ratings yet
Advanced ATM Security System by Using Smart Card and IOT Technology
69 pages
CAO Units PDF
No ratings yet
CAO Units PDF
357 pages
CSCE614 2011c HW1
0% (1)
CSCE614 2011c HW1
4 pages

f8194544 Microsoft PowerPoint DeepLearning

Uploaded by

f8194544 Microsoft PowerPoint DeepLearning

Uploaded by

Implementation of Deep Learning Models

Colaboratory : Tesla GPU based Free Cloud

Cost : INR 3.5 Lacs

# GPU count and name (SMI: System Management Interface)

!lscpu |grep 'Model name'

# no.of sockets i.e available slots for physical processors

# no.of cores each processor is having

CPU: 1 x single core hyper threaded i.e(1 core, 2 threads) Xeon

RAM: ~12.6 GB Available

Disk: ~320 GB Available (OverlayFS)

Idle Time: 90 minutes

Every 12 Hours: Disk, RAM, VRAM, CPU Cache etc. data on

2. Determine the output of neural network like Yes or No

3. It maps the resulting values in between 0 to 1 or -1 to

If Activation function not applied, then the output signal will be a

Images have encoding in Spatial Domain rather than

Key Terminologies to understand for nonlinear functions

The Nonlinear Activation Functions are mainly divided on

• It’s output is zero centered because its range in between -1 to 1

• Optimization is easier in this method hence in practice it is

The sigmoid and hyperbolic

In Back-propagation, while calculating gradients of loss

from google.colab import files • A dense layer is just a regular layer of

# split into input (X) and output (Y) variables

# Evaluate the model

from keras import losses

• for categorical_crossentropy: softmax activation, one-

• If it is a multiclass problem, use categorical_crossentropy

• Loss: A scalar value that we attempt to minimize during our training of

Classification -> Task of predicting a discrete class label

Regression -> Task of predicting a continuous quantity

from keras import metrics

A metric function is similar to a loss function, except that the results

Magma Research and Consultancy Services

Mobile Numbers : +91-9416366178, +91-9034001978

You might also like