100% found this document useful (1 vote)

171 views132 pages

TensorFlow All-Around

The document outlines an agenda for a TensorFlow All-Around event at SUTD. It includes an introduction to Python from 10:00 am to 12:30 pm, followed by lunch and networking. From 1:30 pm to 3:30 pm there will be introductions to machine learning and TensorFlow 2.0, with a tea break in between. The event will conclude at 6:00 pm.

Uploaded by

Venkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

171 views132 pages

TensorFlow All-Around

Uploaded by

Venkat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 132

TensorFlow All-Around @ SUTD

13 September 2019
10:00 am Introduction to Python

12:30 pm Lunch and Networking

01:30 pm Introduction to Machine Learning

Outline 03:30 pm Tea Break

04:00 pm Introduction to TensorFlow 2.0

06:00 pm End

2
Introduction to Python
Instructions
● Please move towards the front/center
● Try to ﬁnd seats near power plugs
● Try to seat nearer to the aisles
● WIFI: SUTD_Guest
You can ﬁnd today's material at http://bit.ly/tf_slides and this link.
Pigeonhole (to ask questions): https://pigeonhole.at/TENSORFLOW

3
Part 1
Introduction to Python
10:00 am - 12:30 pm

http://bit.ly/tf_slides

4
Outline

1. Preface
2. Python Basics
3. Python for Scientiﬁc Computing
4. Some additional tips (might skip depending on time)

http://bit.ly/tf_slides

5
Preface

Computers are useful but dumb:

● Useful to solve problems
● Dumb because you need to give them instructions ("programming")
● They will execute every instruction given, even bad ones!

6
Preface

● Programming in Python/XXX is just a tool for problem solving

● Programming is not going to be easy
○ Expect things to change all the time

○ Ask questions about things and Google for answers

○ Consistent learning and application is key!

7
Preface

● In this ~2h session, we are going to be learning about the basics of

programming using Python.
● During the "practical" parts, feel free to raise your hands and asking any of
the TAs for help, especially if you run into technical issues
● For conceptual questions, encouraged to ask on pigeonhole:
https://pigeonhole.at/TENSORFLOW
● Slides: http://bit.ly/tf_slides
8
What is Python?

● A programming language known for simplicity, yet

capable of building extremely complicated systems
● Open source project managed by the Python Software
Foundation
● Widely used by many people and companies for a
large variety of tasks, and usage is growing!

9
Python vs other programming languages

Easier Harder
10
Major Python Users

● Google (various tools and products)

“Python where we can, C++ where we must.” [1, 2]
● Instagram (most of the backend)
“Do the simple thing ﬁrst.” [1, 2]
● Spotify, Uber, Netﬂix, Dropbox, and many other companies

11
Python Advantages

● Clear and readable syntax

● Intuitive
● Natural expression
● Powerful
● Popular & Relevant

12
What does it mean to
"know" Python?

13
"Knowing English"

● The elements of English language

● The syntax of English grammar
● How to read sentences
● How to write meaningful sentences
Not trivial!
● How to combine sentences into a coherent passage

14
"Knowing Python"

● The elements of Python language

● The syntax of Python grammar
● How to read Python statements
● How to write meaningful Python statements
Not trivial!
● How to combine statements into a coherent program

15
How to get there?

● Know the elements/syntax/programming structure

○ This "crash course" will attempt to cover most of the basics

● The tools to help you

○ We'll show you some of them

● Consistent learning and application

○ Internet is your best friend :)

16
Tools for Programming in Python

● Simple editor for beginners:

IDLE, Thor
● More complex IDE-like environments:
VS Code, Pycharm (Pro edition free for students)
● Jupyter Notebook

17
Jupyter Notebook

● Web based development

environment
● File format is the
iPython notebook (.ipynb)

18
Jupyter Notebook

● Combines text, images, code,

output into a
computational narrative
● Preferred tool for data scientists
to do rapid iteration and share
results

19
Google Colab

Free hosted Jupyter notebooks

https://colab.research.google.com

20
Python Basics

● Statements
● Objects
● Functions
● Loops & Control Flow
● Object-oriented Programming

21
Python Statements

● Instructions that a Python interpreter can execute are called statements.

● Statements are usually contained within one line of code.
● (Refer to Notebook later)
● We'll go through the slides ﬁrst and come back to the notebook

22
Python Objects

● Almost everything in Python is an Object

● Common types
○ Text Type: str

○ Numeric Types: int, float

○ Sequence Types: list, tuple

○ Mapping Type: dict

23
Python Functions

● A function is a block of code which only runs when it is called.

● You can pass data, known as arguments, into a function.
○ Sometimes arguments are referred to as parameters.

● A function can return data as a result.

24
Python Functions

Arguments
Arguments with default value

def function(arg_1, arg_2, arg_3=default_value):

# do something
return some_data

25
Python Control Flow

● If-else-elif
○ Used in combination with logic to determine code path

● Try-except
○ Used to handle errors in code

● Loops
○ For-loop and while-loop

26
Python Logic

● True and False

● Equivalence operator: ==
○ not True == False

● Used to check if conditions are met. E.g.:

○ 1 == 2 -> False

○ 2 == 2 -> True

27
Notebook: Python Basics

● (Go to Notebook)
● http://bit.ly/tf_slides -> Slide 28

28
Checkpoint
Next: object oriented programming in Python

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

29
Object Oriented Programming

● Object-oriented Programming, or OOP for short, is a programming

paradigm
● It provides a means of structuring programs so that properties and
behaviors are bundled into individual objects.

30
Object Oriented Programming

● An object could represent:

○ A type e.g. person

○ With properties (attributes) such as, age, address, etc.

○ With behaviors (methods! aka functions) like walking, talking, breathing, and running.

31
Python Classes

● Classes are used to create new user-deﬁned data structures that

encapsulates attributes and methods.
● In the case of an animal, we could create an Animal() class to track
properties about the Animal like the name and age.
● It may help to think of a class as an idea for how something should be
deﬁned.

32
Inheritance

● Inheritance is the process by which one class takes on the attributes and
methods of another.
● Newly formed classes are called child classes, and the classes that child
classes are derived from are called parent classes.
● It’s important to note that child classes override or extend the functionality
(e.g., attributes and behaviors) of parent classes.

33
Object Oriented Programming

● Notebook
● http://bit.ly/tf_slides -> Slide 34

34
Checkpoint
Next: scientiﬁc computing in Python

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

35
Scientiﬁc Computing

● Scientiﬁc Computing is one of the most important uses of programming,

and spans many ﬁelds

36
Progress is "gated" by computation ability

The angular shape of the Have Blue prototype and production F-117 aircraft is a direct result of the lack
of computational ability to simulate more complex geometry
37
Scientiﬁc Computing

● Scientiﬁc Computing is one of the most important uses of programming,

and spans many ﬁelds
● The use of Python is common in some ﬁelds, primarily when machine
learning is involved

38
Climate Modelling (TF/Python @ Exascale)

Exascale Deep Learning for Climate Analytics (Kurth et al., 2018)

https://arxiv.org/abs/1810.01993 39
https://www.youtube.com/watch?v=e0QK5glozC8
Universe n-body simulation

Learning to predict the cosmological structure formation (He et al., 2019)

40
https://www.pnas.org/content/116/28/13825
Scaling in Scientiﬁc Computing

In scientiﬁc computing, we are often concerned about

● Running problems faster
● Running larger problems
Hence we get this notion of scaling:
● Strong scaling: how to run a single task faster with many processors
● Weak scaling: how to run more tasks with more processors

41
Python for Scientiﬁc Computing

● Arrays in Python
● Google Colab tricks
● Some other widely used libraries:
○ Matplotlib (Plotting)

○ OpenCV (Image Processing)

○ SciPy

42
Arrays

● An array, is a data structure consisting of a collection of elements, each

identiﬁed by at least one array index or key.
● Python List is somewhat similar conceptually, but is not an array!
● In Python, the most popular array format is the numpy array.

43
Array vs Python List

44
Numpy (TLDR version)

● Numpy gives you eﬃcient arrays in Python

○ Python lists are poor in terms of memory and speed

● Numpy allows you to write faster code with arrays

45
Numpy

Numpy is the fundamental package for scientiﬁc computing with Python.

It contains, among other things:

● a powerful N-dimensional array object

● sophisticated (broadcasting) functions
● tools for integrating C/C++ and Fortran code
● useful linear algebra, Fourier transform, and random number capabilities

46
Numpy

The main feature of NumPy is an ndarray object.

● All array elements have to be the same type (usually ﬂoat or integer);
● Array elements can be accessed, sliced, and manipulated in the same way
as the lists;
● Arrays can be N-dimensional;
● The number of elements in the array is ﬁxed;
● Shape of the array can be changed.

47
Brief History of Array Computing in Python
Pandas
Various incompatible libraries Manipulating numerical tables Dask
and time series
Numeric, numarray etc. Theano Dask scales Numpy workﬂows to
"Dark Ages" Manipulating and evaluating multiple cores or machines
mathematical expressions,
especially matrix-valued ones

2006 2012

< 2006 2008 2015

Numpy & Numpy API Numba

Numpy provides a standard Compiler that translates a subset

format and API for array of Python and NumPy into fast
computing in Python machine code using LLVM

48
Numpy

● Python vs Numpy Comparison:

https://colab.research.google.com/drive/1fEGWzxcdWyBNYleyUUX7FT-M
n906oSAw
● Good visual explanations (look at this in your spare time!):
http://jalammar.github.io/visual-numpy/
● Numpy 100 exercises (look at this in your spare time!):
https://github.com/rougier/numpy-100

49
Checkpoint
Next: "extras" - we'll skip if there's no time

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

50
Google Colab Tricks

● Overview of Colab Features:

https://colab.research.google.com/notebooks/basic_features_overview.ip
ynb
● Loading Data:
https://colab.research.google.com/notebooks/io.ipynb

51
Plotting with Matplotlib

● Notebook:
https://colab.research.google.com/notebooks/charts.ipynb

52
Images in Python and OpenCV

● Notebook:
https://colab.research.google.com/github/OpenSUTD/machine-learning-w
orkshop/blob/master/labs/Lab%201D%20-%20Images.ipynb

53
SciPy

Python-based ecosystem of open-source software for mathematics, science,

and engineering, with modules for

● statistics, optimization;
● integration, interpolation;
● linear algebra, solving non-linear equations;
● Fourier transforms;
● ODE solvers, special functions;
● signal and image processing.
54
Checkpoint
Next: Introduction to Machine Learning

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

55
Part 2
Introduction to Machine Learning
01:30 pm - 03:30 pm

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

56
Preface

● This is a ~2h session that aims to give a crash course on deep learning
basics. Where appropriate, additional links are provided for you to learn
more on your own (highly encouraged!)
● For conceptual questions, encouraged to ask on pigeonhole
Do not ask the TA conceptual questions! (TA reject questions pls)
You're unlikely the only person asking the question, asking on pigeonhole
lets everybody beneﬁt and get their question answered.

57
Preface

● Slides: http://bit.ly/tf_slides
● Pigeonhole: https://pigeonhole.at/TENSORFLOW

58
What is Machine Learning?

● Machine learning allows us to solve problems

without explicitly knowing the solution
● "Automate creating rules to solve problems"
● "Data is the new source code"

59
Key reasons to use Machine Learning

● We cannot reasonably create "rules" to solve a problem

● We want to approximate ("curve ﬁt") certain diﬃcult problems

60
What does ML look like
Traditional Programming Machine Learning

INPUT
[Training]
RULES
Algorithm
INPUT
RESULTS
Program /
RESULTS
Algorithm
Store as parameters
RULES

[Inference]
INPUT Algorithm RESULTS
+ rules

61
What is Machine Learning?

● Interesting slide deck (~2h) by a Googler:

https://docs.google.com/presentation/d/1kSuQyW5DTnkVaZEjGYCkfOxvz
CqGEFzWBy4e9Uedd9k/edit
● Serves as a good complement to today's material
● Highly encourage you to set aside ~2h or so, go through the entirely of this,
and Google new concepts and terms that you don't understand!

62
Exponential Growth of ML

ArXiv papers about ML Google codebase FLOPS to train a model

2009 - 2017 2012 - 2017 2013 - 2019
(log scale!)

63
Exponential Growth of ML

3× 30× 3×
Engineers Compute Resource Used Data Used for ML
Facebook Ranking system Facebook Facebook

64
https://venturebeat.com/2019/07/11/facebook-vp-ai-has-a-compute-dependency-problem/
Machine Learning Algorithms

● Clustering, Curve ﬁtting etc. algorithms

● Many can be found in the Scikit-Learn (CPU) or cuML (GPU) and other
libraries

65
66
Machine Learning Algorithms

● Clustering, Curve ﬁtting etc. algorithms

● Many can be found in the Scikit-Learn (CPU) or cuML (GPU) and other
libraries
● Today, we be focused on a subset of ML known as Deep Learning

67
Deep Learning (DL)

● A subset of ML that deals with deep neural networks

● A key idea in deep learning is allowing the computer to build complex
concepts (representations) out of simpler concepts
● This is also known as learning a hierarchy of representations

68
Deep Learning

● Deep Learning is very hyped now because

we have great success applying it to many
previously "unsolved problems"
(e.g. ImageNet).
● These problems are mainly perception
problems on unstructured data

69
Deep Learning vs Neuroscience

● "Deep Learning is inspired by how the brain works"

● Deep Learning is very loosely inspired by some theories of how the brain
works, but these have little practical implication on how we use DL
● "Artiﬁcial neuron" bears very little similarity to biological neuron

70
Deep Learning vs Neuroscience

71
Applications of Deep Learning

Computer Vision

● Robotics & Autonomous Vehicles

● Intelligent Video Analytics

Natural Language Processing

● Social Media Analytics

● Chatbots & Conversational Agents

72
Autonomous Driving

73
74
Breaking down Deep Learning

Components

● Model ("deep neural network")

● Dataset
● Training procedure (gradient descend)

75
DL Model

● Models are design to reﬂect a certain inductive bias (assumption)

○ We'll highlight this for the models we show later

● Many different types of DNNs architectures (popular ave. for exploration)

○ Multilayer Perceptron (MLP) / fully connected network

○ Convolutional Neural Networks (CNN)

○ Recurrent Neural Network (RNN)

○ Transformer

76
DL Model

● Let's walk through a short summary of different types of DNNs

77
Multilayer Perceptron
● A hierarchy of relatively simple models
can model something more complex

y = σ( Σ(wixi) + b)

78
Convolutional Neural Network

● Relative positions of pixels (or any value) of an image (or other inputs)
encodes a certain meaning (feature)
○ Relative positions -> certain "formations" of pixels

○ Relative, not absolute! -> translational invariance

● We use layers of convolutional ﬁlters to create feature maps from inputs

○ http://setosa.io/ev/image-kernels/

79
Problem
The object is the same, but the
pixels are completely different!

80
Convolutional Neural Network

81
Convolutional Neural Network

● Relative positions of pixels (or any value) encodes a certain meaning

● We use convolutional ﬁlters to create feature maps from inputs
● https://tensorspace.org/html/playground/alexnet.html
(large ﬁle warning)
● In-depth lecture: https://www.youtube.com/watch?v=bNb2fEVKeEo

82
Computer Vision

83
Recurrent Neural Network

● There is a "time dependency" in modelling sequential data (e.g. text).

● Complete information at time step t requires accumulated knowledge from
previous time steps.
● In-depth lecture: https://www.youtube.com/watch?v=iWea12EAu6U

84
Recurrent Neural Network

Outputs

Hidden
State
(changes
over time)

Inputs

85
Transformer

● Dependencies between elements of a sequence are not encoded by

relative position
○ Great to model things like language and music

● Uses primarily self-attention based architecture

● In-depth lecture: https://www.youtube.com/watch?v=5vcj8kSwBCY

86
Dataset

● The larger, the better. Literally always.

○ Deep Learning Scaling is Predictable, Empirically (Hestness, 2017)
https://arxiv.org/abs/1712.00409

● Huge datasets like ImageNet kickstarted the DL revolution

87
Training

Two major components required:

1. Metric to measure performance

2. Optimization algorithm

88
Training Metric

● Value of loss function - quantiﬁcation of error produced by model

○ We'll see examples of metrics provided by TF during the next segment

● The training algorithm will attempt to minimise this error with respect to
the model parameters

89
Training Algorithm

Core algorithm for DL training

● Backpropagation (or gradient descend)
● Iteratively change parameters to minimise the loss function

90
Training Algorithm

● Adaptive vs "vanilla" versions

● Different empirical performance

● All attempt to converge to some good

minima on the loss landscape

91
Training Algorithm (Forward Pass)

Data Model Prediction

Loss

Ground
Truth

92
Training Algorithm (Backward Pass)

Model

Loss
minimise (model parameters) w.r.t. loss

93
Training Algorithm Visualisation

● https://playground.tensorﬂow.org/

94
Checkpoint
Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

95
Downsides of Deep Learning

More often than not, in the real world:

● You don't have enough (good) data

● Results are empirical (not theoretical)
○ Not easy to understand why something works, or doesn't work

● You don't have enough compute, so you can't experiment fast enough

96
97
98
99
Hardware Requirements for DL

● Deep Learning involves lots of

parallel computations and matrix-multiplications
● This makes it suitable for hardware accelerators
such as GPUs
● There are DL-speciﬁc hardware accelerators
starting to appear (e.g. TPU)

100
NVIDIA DGX SuperPOD (#22 on TOP500) 101
Google Cloud TPU v3 Pod 102
Hardware Requirements for DL

● The good news is for most "simple" tasks, 1 GPU is enough

Especially if you use transfer learning
● Throughout this workshop series, we will be using Google Colab, which
provides a free GPU for up to 12 hours
● Generally, you don't want to run things locally on a laptop.
Even a high-end gaming laptop!

103
NVIDIA Tensor Core GPUs
Delivering cutting-edge Deep Learning performance

NVIDIA Tesla T4
Single Precision: 8.1 TFLOPS
Mixed Precision: 65 TFLOPS
300GB/s VRAM (GDDR6)
Ultra-eﬃcient 70W

Free for education/research use

on Google Colab ( )

104
NVIDIA Tensor Core GPUs
Delivering cutting-edge Deep Learning performance

NVIDIA Tesla T4
(~ low power RTX 2070)

Can be 25% faster than

GTX 1080 Ti in training speed

45% more VRAM

Free for education/research use

on Google Colab ( )

105
Tip: Don't be a hero

● Use Google Colab

○ No need to set up environment

○ Powerful GPU/TPU for free (faster than your laptop GPU)

● Simpler is better
○ Keep up to date, but there's no need to chase the latest "state of the art"

○ Start simple, add on complexity one step at a time

106
Checkpoint
Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

107
Part 3
Introduction to TensorFlow 2.0
04:00 pm - 06:00 pm

Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

108
Keras in one slide

● Keras is a super simple Python library for building deep learning models
● Keras provides 99% of the building blocks you need
○ Layers, Activations
○ Optimizers
○ Nice extras
■ Datasets
■ Callbacks

● Keras is widely used in industry and research

109
TensorFlow in one slide

● TensorFlow is a Google "open source" library designed for graph execution

for deep learning research and production at scale
● High level Python API, low level C++ API, everything in between
○ "Next-generation" Swift API in development

● Train models and deploy on a variety of platform

● High degree of performance optimization from both Google and vendors

110
tf.keras

● TensorFlow is the "most popular" library used for deep learning

○ Most people absolutely hate the API of TensorFlow 1.x

○ But TensorFlow ecosystem for research/development/deployment is good

● For TensorFlow 2.0, the oﬃcial high-level API is now Keras

This is known as tf.keras,
and we will use this throughout our workshop for all code examples

111
TensorFlow Lattice
Components for interpolated
look-up tables in TensorFlow
TensorFlow Data Validation
Library for exploring and validating machine TensorFlow Federated
learning data Machine learning framework for
decentralized data
TensorFlow Transform
Library for preprocessing and manipulating TensorFlow Privacy
data with TensorFlow Training and analysing models with
differential privacy
TensorFlow Model Analysis
Evaluate models using a variety of metrics TensorFlow Probability
and techniques Probabilistic reasoning and
statistical analysis in TensorFlow
TensorFlow Model Optimization
Suite of tools for optimizing ML models for Mesh TensorFlow TensorFlow Agents
deployment and execution Large-scale distributed model Eﬃcient batched reinforcement
parallelism toolkit for TensorFlow learning in TensorFlow
TensorFlow Serving
Scalable, high-performance serving system TensorFlow Hub TensorFlow Graphics
for models Library for transfer learning by Differentiable graphics layers for
reusing parts of TensorFlow models TensorFlow
TensorFlow Extended (TFX)
TensorFlow Datasets TensorFlow Ranking
Provides various public datasets in Components for Learning-to-Rank
for form of tf.data.Datasets (LTR) techniques in TensorFlow
112
Tensor
a multidimensional array (aka ndarray)

Flow
a computational graph

113
import tensorflow as tf

mnist = tf.keras.datasets.mnist
Load Dataset (x_train, y_train),(x_test, y_test) = mnist.load_data()
x_train, x_test = x_train / 255.0, x_test / 255.0

model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
Deﬁne Model tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
Compile into loss='sparse_categorical_crossentropy',
optimized graph
metrics=['accuracy'])

Train Model model.fit(x_train, y_train, epochs=5)

114
tf.keras

● "Low Bar"
Easy to piece together cutting edge models that work
● "High Ceiling"
Write custom pieces and training loops
Do cutting edge research work
https://www.tensorﬂow.org/beta/tutorials/quickstart/advanced

115
A bit more about Keras
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

116
Dense layer - a fully connected layer

117
Dense layer - a fully connected layer

activation
function

weights
bias

118
A bit more about Keras
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

119
Activation Functions

120
A bit more about Keras
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

121
Dropout

122
A bit more about Keras
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

123
Keras Optimizers

● SGD(lr=0.01, momentum=0.0, decay=0.0)

○ Vanilla, usually quite slow, might give the best ﬁnal result eventually

● Adam(lr=0.001, decay=0.0)
○ Good ﬁrst choice, just pick this

● RMSprop(lr=0.001, decay=0.0)
○ Maybe good choice for RNNs. Try Adam, and then this, and compare.

124
Keras Optimizers
● Learning Rate (lr)
○ Too small == no ﬁtting; too large == diverge/explode

○ Usually problem speciﬁc; Adam is less sensitive to LR

● Decay
○ Multiply learning rate by (1-rate) after every step (not epoch!!)

● Other parameters
○ Usually Optimizers have some other parameters. Unless you know why, there is no real
reason to tweak those. "Sane defaults" are set.
125
A bit more about Keras
model = tf.keras.models.Sequential([
tf.keras.layers.Flatten(input_shape=(28, 28)),
tf.keras.layers.Dense(128, activation='relu'),
tf.keras.layers.Dropout(0.2),
tf.keras.layers.Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

126
Loss

Class Predictions Ground Truth

● Loss functions compare difference between class predictions and ground

truth values that you supply (Supervised learning)
● Smaller the error, the better the prediction

127
Loss

● binary_crossentropy
○ Possible to have multiple correct labels (e.g. tweet is both sad and angry)
Check if each category (individually) is correct

● categorical_crossentropy
○ Only one category is correct (e.g. picture is a cat, not a dog)
Check if the category is correct (only one out of many)

● mean_square_error
○ Compare input and output for differences (e.g. image reconstruction)

128
Dataset: MNIST

● Handwritten digits

● 28 x 28, single colour channel

● "Hello world"

● Generally not treated as signiﬁcant

anymore - simple models can easily solve
this challenge

129
Dataset: CIFAR

● 32 x 32 colour images of real-world

objects

● Much, much harder than MNIST

● Often used as a benchmark for

"okay, this is a model that works"

● Harder CIFAR100 version also exists

130
Checkpoint
Slides: http://bit.ly/tf_slides
Pigeonhole: https://pigeonhole.at/TENSORFLOW

131
Code Walkthrough

1. Colab - Checking for GPU

2. Colab - MNIST
3. Colab - Image Classiﬁcation
http://bit.ly/tf_slides -> slide 132

132

Danka T Mathematics of Machine Learning Master Linear Algebr
100% (2)
Danka T Mathematics of Machine Learning Master Linear Algebr
729 pages
SIC - C&P - Chapter 1. Unit 1
No ratings yet
SIC - C&P - Chapter 1. Unit 1
81 pages
Scikit Learn Docs PDF
No ratings yet
Scikit Learn Docs PDF
2,387 pages
Python Unit I
No ratings yet
Python Unit I
29 pages
Yolo v8
No ratings yet
Yolo v8
10 pages
SIC - C - P - Chapter 1. Programing Basic Concept and Starting Python - v1
100% (1)
SIC - C - P - Chapter 1. Programing Basic Concept and Starting Python - v1
545 pages
Fine Tuning Techniques For Large Language Models LLMs
No ratings yet
Fine Tuning Techniques For Large Language Models LLMs
15 pages
PyTorch - Advanced Deep Learning
No ratings yet
PyTorch - Advanced Deep Learning
237 pages
SIC Python Course Material
No ratings yet
SIC Python Course Material
74 pages
Pstricks: User'S Guide
100% (4)
Pstricks: User'S Guide
338 pages
Pythonlab 1
No ratings yet
Pythonlab 1
80 pages
Paper 17881
No ratings yet
Paper 17881
6 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Programming in Modern C++
No ratings yet
Programming in Modern C++
1 page
HHIH
No ratings yet
HHIH
61 pages
23 DeepLearning PDF
No ratings yet
23 DeepLearning PDF
74 pages
Conda Cheat Sheet: Bit - Ly/tryconda
No ratings yet
Conda Cheat Sheet: Bit - Ly/tryconda
2 pages
Artificial Intelligence in Medicine 1st Edition Thompson Stephan Instant Download
No ratings yet
Artificial Intelligence in Medicine 1st Edition Thompson Stephan Instant Download
78 pages
Shruti Black Book Soft
No ratings yet
Shruti Black Book Soft
95 pages
Week 1
No ratings yet
Week 1
49 pages
Ai CW2
No ratings yet
Ai CW2
68 pages
Python Programming Language Book - The Complete Gu
No ratings yet
Python Programming Language Book - The Complete Gu
131 pages
Writing Code For NLP Research PDF
No ratings yet
Writing Code For NLP Research PDF
254 pages
Lecture 26
No ratings yet
Lecture 26
17 pages
Python Main Report
No ratings yet
Python Main Report
41 pages
Thesis Meat Cutter Final Defense 5
No ratings yet
Thesis Meat Cutter Final Defense 5
54 pages
Pandas For Machine Learning: Acadview
No ratings yet
Pandas For Machine Learning: Acadview
18 pages
Image Enhancement Effect On The Performance of Convolutional Neural Networks
No ratings yet
Image Enhancement Effect On The Performance of Convolutional Neural Networks
40 pages
Traffic Report
No ratings yet
Traffic Report
68 pages
Pavan PPT ?
No ratings yet
Pavan PPT ?
27 pages
Introduction To Python Programming
No ratings yet
Introduction To Python Programming
95 pages
StatisticsMachineLearningPythonDraft PDF
100% (1)
StatisticsMachineLearningPythonDraft PDF
219 pages
Generalized Fringe-To-Phase Framework For Single-Shot 3D Reconstruction Integrating Structured Light With Deep Learning
No ratings yet
Generalized Fringe-To-Phase Framework For Single-Shot 3D Reconstruction Integrating Structured Light With Deep Learning
18 pages
Deep Learning Tutorial: Release 0.1
100% (1)
Deep Learning Tutorial: Release 0.1
137 pages
CNN3 Pooling and Fully Contected Layers
No ratings yet
CNN3 Pooling and Fully Contected Layers
21 pages
Exploratory Data Analysis With Matlab
No ratings yet
Exploratory Data Analysis With Matlab
360 pages
In Process Monitoring of Porosity During Laser
No ratings yet
In Process Monitoring of Porosity During Laser
30 pages
Deep End-To-End Learning For Price Prediction of S
No ratings yet
Deep End-To-End Learning For Price Prediction of S
29 pages
High Accur (Book学术 Www.booksci.cn)
No ratings yet
High Accur (Book学术 Www.booksci.cn)
11 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages
Aiiot 2025 Overview
No ratings yet
Aiiot 2025 Overview
3 pages
LLM 2
No ratings yet
LLM 2
12 pages
DL Question Paper Solved
No ratings yet
DL Question Paper Solved
12 pages
Naukri AkashJadhav (6y 0m)
No ratings yet
Naukri AkashJadhav (6y 0m)
3 pages
Deep Learning in Barcode Recognition: A Systematic Literature Review
No ratings yet
Deep Learning in Barcode Recognition: A Systematic Literature Review
27 pages
Numpy Ref
No ratings yet
Numpy Ref
1,128 pages
01 PythonIntro
No ratings yet
01 PythonIntro
46 pages
Module03-Introduction To Python
No ratings yet
Module03-Introduction To Python
40 pages
A Review On Speaker Recognition - Technology and Challenges
No ratings yet
A Review On Speaker Recognition - Technology and Challenges
14 pages
An End To End Method To Extract Information
No ratings yet
An End To End Method To Extract Information
10 pages
DSM and Word Embedding
No ratings yet
DSM and Word Embedding
116 pages
Super Cheatsheet Machine Learning
100% (1)
Super Cheatsheet Machine Learning
15 pages
Pix2code Generating Code From A Graphical User Int
No ratings yet
Pix2code Generating Code From A Graphical User Int
8 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
Multi-Factor Based Stock Price Prediction Using Hybrid Neural Networks With Attention Mechanism
No ratings yet
Multi-Factor Based Stock Price Prediction Using Hybrid Neural Networks With Attention Mechanism
6 pages
Project Report
No ratings yet
Project Report
13 pages
Python Intro
No ratings yet
Python Intro
28 pages
MLT Numericals
No ratings yet
MLT Numericals
4 pages
Deep Learning For White Cabbage Seedling 2021 Computers and Electronics in A
No ratings yet
Deep Learning For White Cabbage Seedling 2021 Computers and Electronics in A
9 pages
ML Microsoft Course Overview: Machine Learning in Context
100% (1)
ML Microsoft Course Overview: Machine Learning in Context
53 pages
Pandas Tricks and Tips To Exceed
No ratings yet
Pandas Tricks and Tips To Exceed
12 pages
Cheet Sheet
No ratings yet
Cheet Sheet
47 pages
A Double Time-Scale CNN Solving Two-Dimensional Navier-Stokes Equationst
No ratings yet
A Double Time-Scale CNN Solving Two-Dimensional Navier-Stokes Equationst
7 pages
A Hybrid Intrution Detection Approach Based On Deep Learning
No ratings yet
A Hybrid Intrution Detection Approach Based On Deep Learning
16 pages
Data Science Deep Learning & Artificial Intelligence
No ratings yet
Data Science Deep Learning & Artificial Intelligence
9 pages
Introduction To Learning: Frederic Precioso 24/01/2019
No ratings yet
Introduction To Learning: Frederic Precioso 24/01/2019
179 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Rastogi 2020
No ratings yet
Rastogi 2020
6 pages
6.customizing Seaborn Plots PDF
No ratings yet
6.customizing Seaborn Plots PDF
17 pages
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
No ratings yet
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
4 pages
Data Science Interview Questions - Statistics: Mohit Kumar Dec 12, 2018 11 Min Read
100% (1)
Data Science Interview Questions - Statistics: Mohit Kumar Dec 12, 2018 11 Min Read
14 pages
Imp Question - Answers
No ratings yet
Imp Question - Answers
16 pages
A Visual Intro To Numpy and Data Representation
No ratings yet
A Visual Intro To Numpy and Data Representation
16 pages
Introduction of Data Science - Mahatma Gandhi Central University
No ratings yet
Introduction of Data Science - Mahatma Gandhi Central University
17 pages
Sentiment Analysis On Amazon Fine Food Reviews by Using Linear Machine Learning Models
No ratings yet
Sentiment Analysis On Amazon Fine Food Reviews by Using Linear Machine Learning Models
6 pages
LSTM
No ratings yet
LSTM
42 pages
Datavischeatsheet
No ratings yet
Datavischeatsheet
2 pages
Simple Libraries in Python
No ratings yet
Simple Libraries in Python
12 pages
Pandas Commands
No ratings yet
Pandas Commands
3 pages
Bidhu Final V36by48 2
No ratings yet
Bidhu Final V36by48 2
1 page
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
LCM LoRA Technical Report
No ratings yet
LCM LoRA Technical Report
7 pages
Python For Data Science Cheat Sheet: Subset Slice
50% (2)
Python For Data Science Cheat Sheet: Subset Slice
1 page
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Understand The Data Science From Scratch Know Everything in Detail!
No ratings yet
Understand The Data Science From Scratch Know Everything in Detail!
5 pages
Read & Download (PDF Kindle)
No ratings yet
Read & Download (PDF Kindle)
5 pages
Computer Education For Nepali School Students - QBASIC CLASS IX
No ratings yet
Computer Education For Nepali School Students - QBASIC CLASS IX
10 pages
Performance Analysis of LoRA Finetuning Llama-2
No ratings yet
Performance Analysis of LoRA Finetuning Llama-2
4 pages
PyTorch Workflow Fundamentals
No ratings yet
PyTorch Workflow Fundamentals
1 page
Scientific Python Workshop
100% (1)
Scientific Python Workshop
2 pages