0% found this document useful (0 votes)

23 views62 pages

ML Visuals

Uploaded by

Cheney li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views62 pages

ML Visuals

Uploaded by

Cheney li

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 62

ML Visuals

By dair.ai

https://github.com/dair-ai/ml-visuals
Basic ML Visuals
Softmax

Convolve

Sharpen
Softmax

Convolve

Sharpen
Softmax

Linear

Add & Norm

Feed
Forward

Add & Norm Add & Norm

Feed Multi-Head
Forward Attention

Add & Norm Add & Norm

Multi-Head Masked
Attention Multi-Head
Attention

Positional Positional
Encoding Encoding

Input Output
Embedding Embedding

Inputs Outputs(shifted right)

Softmax

Linear

Add & Norm

Feed
Forward

Add & Norm

Add & Norm
Multi-Head
Feed Attention
Forward
Add & Norm
Add & Norm
Masked
Multi-Head Multi-Head
Attention Attention

Positional Positional
Encoding Encoding

Input Output
Embedding Embedding

Inputs Outputs (shifted right)

I love coding and writing

Tokenize

“I love coding and writing”

Hidden
Layers

Input Layer Output

a[2]1 a[3]1
Layer

a[2]2 a[3]2

a[4] Ŷ
X

a[2]3 a[3]3

a[1]n a[2]n a[3]n

X = A[0] A[1] A[2] A[3] A[4]

Hidden
Layers

Input Layer Output

a[1]1 a[2]1 a[3]1
Layer

a[1]2 a[2]2 a[3]2

a[4] Ŷ
X
a[1]3 a[2]3 a[3]3

X = A[0] a[1]n a[2]n a[3]n A[4]

A[1] A[2] A[3]

Hidden
Layers

Input Layer Output

a1
[1 ]
a[2]1 a[3]1
Layer

a[1]2 a[2]2 a[3]2

a[4] Ŷ
X
a[1]3 a[2]3 a[3]3

a[1]n a[2]n a[3]n

X = A[0] A[1] A[2] A[3] A[4]

CONV
operation
MxM ReLU

+b1 +b1 MxMX2

+b2 +b2

NxNx3
MxM ReLU

a[l-1] a[l]
CONV
operation
MxM ReLU

+b1 +b1 MxMX2

+b2 +b2

NxNx3
MxM ReLU
CONV
operation
MxM ReLU

+b1 +b1 MxMX2

+b2 +b2

NxNx3
MxM ReLU
Abstract backgrounds
DAIR.AI
Gradient Backgrounds
Community Contributions
Striding in
CONV

S=1

S=2
MaxPool
Inception Same s=1
Module

5x5 Same
NxNx192
3x3 Same

1x1 Same

NxNx128
NxNx192

NxNx32

NxNx64
t-1 t

(a) Retraining w/o expansion

(b) No-Retraining w/ expansion (c) Partial Retraining w/ expansion
t-1 t t-1 t

(b) No-Retraining w/ expansion (c) Partial Retraining w/ expansion

t-1 t t-1 t t-1 t

(a) Retraining (b) No-Retraining (c) Partial Retraining

expansion expansion expansion
Size
Family
?
X Y
#bed

Walk PRICE
? ŷ Basic Neuron
ZIP
Model

Schoo
l
How does NN
Wealth work (Insprired
from Coursera)

Ŷ = 0

Ŷ = 1
Logistic
Regression
Linear ReLU(x)
regression

$
$

Size Size
Training
C
O
N
C C V C C C
O O 3 O O O
I N N N N N I1
V V V V V
1 2 C 6 7
5
O 128*128*1
128*128*1 N
V
4

Encoder Decoder

Decoder V1
V ENcoder
128*128*1
128*128*1
Large NN

Med NN

η Small NN

SVM,LR
etc

Amount of
Data

Why does Deep learning work?

Hidden
Input Output

a[1]1

One hidden layer neural network

a[1]2

X a[2] Ŷ

a[1]3

a[1]4

X = A[0] A[1] A[2]

x[1] a[1]1

a[2]

x[2] a[1]2

x[1]

x[2]

x[3]

Neural network templates

x[1] a[1]1

a[2]

x[2] a[1]2

x[1]

x[2]

x[3]

Neural network templates

Train Valid Test

Underfitting Good fit Overfitting

x2
x1 x1 x1

Train-Dev-Test vs. Model fitting

x[1]

a[L] DropOut
x[2]

x[3]

x2
r=1
x2

Normalizatio
x1 n

x1
w2 Early stopping
J

Er
r

w1
w1

Before Dev
w2
Normalization
Train
it
.
J w2

w1 w1

After Normalization
w2
x1

x2 w[L]

w[1] w[2] w[L- w[L- Deep neural

2] 1]
networks

FN TN

TP FP
Understanding
Precision & Recall
Batch vs. Mini-
batch
Gradient Descent

w2 w2
BGD
SGD

Batch
Gradient Descent
vs. SGD

SGD
w1 w1
x[1]
p[1]

x[2]
p[2]

x[3]

Softmax Prediction
with 2 outputs
Miscellaneous
16+3
16 16 16 1
2
Convolution 3x3 Convolution 1x1 Dropout
3 16 0.1
Dropout
Max Pooling 2x2 Skip connection
Up Sampling 0.2
Dropout
Block copied 0.3
2x2

32 32 32+6
32
4

64 64 64+12
64
8

12 12 12
128+256
8 8 8

25 25
6 6
Output
FC-512
Max-Pool
Output
Conv3-128
3er

FC-512
Lay

4er

Max-Pool
Lay

Max-Pool
Conv3-64
Conv3-128
3er

Conv3-64
2

Lay
er

Max-Pool
Lay

Max-Pool
Conv3-64
Conv3-32
Conv3-64
2er

Conv3-32
Lay

Max-Pool
Conv3-32
Conv3-32
Conv3-32
1er

Conv3-32
Lay

Input
Conv3-32
Conv3-32
1er
Lay
Conv

Conv
Max-

Max-

Input
Pool

Pool
FC

er3 er4 2 er 1 er
Lay Lay Lay Lay
Filter
concatenation

3x3 convolutions 5x5 convolutions 1x1 convolutions

1x1 convolutions

1x1 convolutions 1x1 convolutions 3x3 max pooling

Previous layer
Previous layer

1x3 conv,
1 padding 1x3 conv,
1 padding
1x7 conv,
1x3 conv, 3 padding
1 padding
1x5 conv,
2 padding
1x3 conv,
1 padding

Filter
concatenation
Auxiliary Classifier

Softmax
Auxiliary Classifier

FC
Softmax Conv
FC Avg-Pool
FC Inception
Conv Inception Softmax
Avg-Pool Max-Pool FC
Inception FC
Inception Conv
Inception Avg-Pool
Inception
Inception
Max-Pool
Inception
Inception
Max-Pool
Max-Pool
Conv
Conv
Max-Pool Max-Pool
Conv
ConvTranspose2d
Max-Pool
Conv Input
Input
Filter Filter
concatenation concatenation

3x3 1x3 3x1

conv. conv. conv.

1x1 3x3 3x3 1x3 3x1 3x3

1x1
conv. conv. conv. conv. conv. conv.
1x1 conv.
1x1
conv.
1x1 conv.
1x1 1x1
Pool 1x1
conv. conv. Pool conv.
conv.

Previous layer Previous layer

(a) (b)
Previous input Previous input

x x

F(x) Stacked layers F(x) Stacked layers x

identity
+
y=F(x)
y=F(x)+x

R1 R2

R1 R2 R3 R1 R2 R3
Dense Block 1 Dense Block 2 Dense Block 3

Avg-Pool
Avg-Pool
Avg-Pool

Softmax
Conv
Conv
Conv
Input

FC
Transition layers
hi+1

Filter
concatenation

hi+1 add add

Filter 3x3 3x3 3x3

identity
concatenation max conv avg

add add add add add add add add

3x3 3x3 5x5 3x3 3x3 3x3 3x3 5x5 7x7 5x5 3x3 7x7 3x3 5x5
identity identity
conv conv conv avg avg avg conv conv conv conv max conv avg conv

hi hi

... ...

hi-1 hi-1

(a) (b)
=
14*14
28*28
56*56
112*112
224*224
Max(1,1,5,6) = 6

Y
1 1 2 4

5 6 7 8 6 8

3 2 1 0 Pooling performed 3 4
with a 2x2 kernel
and a stride of 2
1 2 3 4

X
Image
Representation

Basic Components
No ratings yet
Basic Components
134 pages
机器学习绘图模板v2
No ratings yet
机器学习绘图模板v2
130 pages
NN and Optimization Regularization
No ratings yet
NN and Optimization Regularization
198 pages
CS4442 - CS9542 - Part 2 - Lecture 5 - DNN - Intro
No ratings yet
CS4442 - CS9542 - Part 2 - Lecture 5 - DNN - Intro
113 pages
Ch10 Deep Learning
No ratings yet
Ch10 Deep Learning
104 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
02 ML Fundatmentals 2
No ratings yet
02 ML Fundatmentals 2
81 pages
ML Visuals
No ratings yet
ML Visuals
61 pages
VIVA Questions forOOAD
71% (7)
VIVA Questions forOOAD
10 pages
Lessson 13 ANN
No ratings yet
Lessson 13 ANN
76 pages
Deep Learning
No ratings yet
Deep Learning
28 pages
Dive Into Deep Learning
No ratings yet
Dive Into Deep Learning
105 pages
Lecture 9. Neural Networks
No ratings yet
Lecture 9. Neural Networks
106 pages
Ann TP
No ratings yet
Ann TP
40 pages
Google Aiml
No ratings yet
Google Aiml
50 pages
Deep Learning U1
No ratings yet
Deep Learning U1
5 pages
01 - Introduction To Deep Learning
No ratings yet
01 - Introduction To Deep Learning
56 pages
Tensorflow Deep Learning and Artificial Intelligence Machine Learning
No ratings yet
Tensorflow Deep Learning and Artificial Intelligence Machine Learning
97 pages
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
No ratings yet
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
5 pages
Deep Learning
No ratings yet
Deep Learning
95 pages
Tensorflow
No ratings yet
Tensorflow
29 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
Chick Literature
No ratings yet
Chick Literature
9 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Introduction To Artificial Neural Networks
No ratings yet
Introduction To Artificial Neural Networks
31 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
3) Multi-Layer Perceptron Learning in Tensorflow
No ratings yet
3) Multi-Layer Perceptron Learning in Tensorflow
7 pages
Text To Self Editedenglish 9 Quarter 2 Module 1
No ratings yet
Text To Self Editedenglish 9 Quarter 2 Module 1
8 pages
CNN With Tensor Flow
No ratings yet
CNN With Tensor Flow
61 pages
1 Linear Algebra Basics 25-07-2024
No ratings yet
1 Linear Algebra Basics 25-07-2024
30 pages
HW1P1 F23
No ratings yet
HW1P1 F23
37 pages
1 AI - Introduction and ML
No ratings yet
1 AI - Introduction and ML
32 pages
Lecture2 Slides 1
No ratings yet
Lecture2 Slides 1
28 pages
What Is A Neural Network?
No ratings yet
What Is A Neural Network?
7 pages
Transformers
No ratings yet
Transformers
15 pages
Unit 1
No ratings yet
Unit 1
29 pages
Introduction To DL With TensorFlow
No ratings yet
Introduction To DL With TensorFlow
55 pages
Chapter 9
No ratings yet
Chapter 9
73 pages
ML Unit 4
No ratings yet
ML Unit 4
23 pages
Neural Networks - Annotated
No ratings yet
Neural Networks - Annotated
21 pages
Tensorflow and Deep Learning
No ratings yet
Tensorflow and Deep Learning
51 pages
Unit III
No ratings yet
Unit III
28 pages
ML06 Neural-Network 2024-2025
No ratings yet
ML06 Neural-Network 2024-2025
78 pages
Deep Learning
No ratings yet
Deep Learning
13 pages
09 Tensorflow101 Slide
No ratings yet
09 Tensorflow101 Slide
78 pages
Deep Learning
No ratings yet
Deep Learning
78 pages
Deep Learning
No ratings yet
Deep Learning
21 pages
Deep Learning Tutorial: Reference: Hung-Yi Lee
100% (1)
Deep Learning Tutorial: Reference: Hung-Yi Lee
179 pages
Unit I
No ratings yet
Unit I
90 pages
Aimlmid 2 Notes
No ratings yet
Aimlmid 2 Notes
4 pages
Unit Ii ML
No ratings yet
Unit Ii ML
22 pages
ML Unit-5
No ratings yet
ML Unit-5
14 pages
ML Unit-5
No ratings yet
ML Unit-5
19 pages
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
No ratings yet
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
44 pages
Matlab NN Toolbox
No ratings yet
Matlab NN Toolbox
18 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
Gold of Praise PDF
No ratings yet
Gold of Praise PDF
526 pages
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
No ratings yet
Introduction To Deep Learning - With Complexe Python and TensorFlow Examples - Jürgen Brauer PDF
245 pages
Reviewer
No ratings yet
Reviewer
7 pages
Step by Step Guide How To Rapidly Build Neural Networks
No ratings yet
Step by Step Guide How To Rapidly Build Neural Networks
6 pages
EmSAT English College Entry Exam Specification English
100% (1)
EmSAT English College Entry Exam Specification English
1 page
Codings 1
No ratings yet
Codings 1
12 pages
OEC 5 Day Bible Plan
No ratings yet
OEC 5 Day Bible Plan
2 pages
Oralcommunication q2 Mod3 Principlesofeffectivespeechwritinganddeliveryv2
100% (1)
Oralcommunication q2 Mod3 Principlesofeffectivespeechwritinganddeliveryv2
33 pages
BT 3308
No ratings yet
BT 3308
29 pages
Python - Module at Master Livewires - Python GitHub
No ratings yet
Python - Module at Master Livewires - Python GitHub
4 pages
Language Proficiency 1: Week 1 Lesson Plan
No ratings yet
Language Proficiency 1: Week 1 Lesson Plan
33 pages
Ankitseth SAP Basis
No ratings yet
Ankitseth SAP Basis
2 pages
A1 Módulo 1 Multifluent Book
No ratings yet
A1 Módulo 1 Multifluent Book
22 pages
7 Cs of Communication
No ratings yet
7 Cs of Communication
2 pages
MA-2203: Introduction To Probability and Statistics: Lecture Slides
No ratings yet
MA-2203: Introduction To Probability and Statistics: Lecture Slides
64 pages
CD - Love Changes - Kashif Audrey Wheeler Bashiri Johnson B
No ratings yet
CD - Love Changes - Kashif Audrey Wheeler Bashiri Johnson B
6 pages
13 Custom Auth Server
No ratings yet
13 Custom Auth Server
9 pages
Gerunds Infinitives
No ratings yet
Gerunds Infinitives
4 pages
Prose 1,2,3 & Poetry 1,2,3
No ratings yet
Prose 1,2,3 & Poetry 1,2,3
6 pages
Laravel Technical Document
No ratings yet
Laravel Technical Document
10 pages
Eng Lang cl6th WK 3
No ratings yet
Eng Lang cl6th WK 3
3 pages
02 Modul Exasol SQL - en
No ratings yet
02 Modul Exasol SQL - en
41 pages
The Adventures of A Desi Girl
No ratings yet
The Adventures of A Desi Girl
1 page
Midterm Reviewer For Life and Works of Rizal
No ratings yet
Midterm Reviewer For Life and Works of Rizal
5 pages
Soal Pedagogik Bahasa Inggris
No ratings yet
Soal Pedagogik Bahasa Inggris
3 pages
Data Structure and Algorithms I - Mid - Summer 2023
No ratings yet
Data Structure and Algorithms I - Mid - Summer 2023
2 pages
Why Do We Read Literature
No ratings yet
Why Do We Read Literature
2 pages
Subject: English Level: Grade 8 Class Size: 40 Students Duration: 1 Hour Lesson: Nouns Learning Competencies
No ratings yet
Subject: English Level: Grade 8 Class Size: 40 Students Duration: 1 Hour Lesson: Nouns Learning Competencies
4 pages
Practical R Exercises in Swirl Part 1 - Coursera
No ratings yet
Practical R Exercises in Swirl Part 1 - Coursera
3 pages
Python Regular Expression (Regex) Cheat Sheet: by Via
No ratings yet
Python Regular Expression (Regex) Cheat Sheet: by Via
3 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
TensorFlow深度学习项目实战: Chinese Edition
From Everand
TensorFlow深度学习项目实战: Chinese Edition
Posts & Telecom Press
No ratings yet