0% found this document useful (0 votes)
11 views20 pages

AlexNet

Uploaded by

nericjoel2003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views20 pages

AlexNet

Uploaded by

nericjoel2003
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 20

20IS712

Deep Learning
300 (3)
Dr. Lekshmi R. R.
Asst. Prof.
Department of Electrical & Electronics
Engineering
Amrita School of Engineering 1
AlexNet
Neural Network
• VGG – by Alex and team
• Solves many overfitting related problem
• Looks similar to LeNet
• There are 60 million parameters
• Architecture includes:
– 5 Convolution layers
– 3 Fully connected layer
• includes output layer
• More filers per layer
– Hence called as Deep Neural Network
• Includes dropout
Architecture
Convolution

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 227,

• Input: 227x227x3 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 = 11


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 4
• Filter: 96 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 0
– Size: 11x11 𝑛 + 2𝑝 − 𝑓 227 + 2 × 0 − 11
+1 +1
• Stride: 4 𝑠 4

= 55
Convolution

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 227,

• Input: 227x227x3 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 = 11


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 4
• Filter: 96 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 0
– Size: 11x11 𝑛 + 2𝑝 − 𝑓 227 + 2 × 0 − 11
+1 +1
• Stride: 4 𝑠 4

= 55
Max pooling

• Input: 56x56x96
𝑛 + 2𝑝 − 𝑓
• Filter size: 3x3 +1
𝑠
• Stride: 2
55 + 2 × 0 − 3
+1
2

= 27
Max pooling

• Input: 56x56x96
𝑛 + 2𝑝 − 𝑓
• Filter size: 3x3 +1
𝑠
• Stride: 2
55 + 2 × 0 − 3
+1
2

= 27
Convolution layer

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 27,

• Input: 27x27x96 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 =5


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 1
• Padding: 2 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 2
• Filter: 256 𝑛 + 2𝑝 − 𝑓 27 + 2 × 2 − 5
+1 +1
– Size: 5x5 𝑠 1

• Stride: 1 = 27
Max pooling

• Input:27x27x256
𝑛 + 2𝑝 − 𝑓
• Size: 3x3 𝑠
+1

• Stride: 2 27 + 2 × 0 − 3
+1
2
• Output:13x13x256
=13
Convolution layer

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 27,

• Input: 13x13x256 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 =5


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 1
• Padding: 1 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 2
• Filter: 384 𝑛 + 2𝑝 − 𝑓 13 + 2 × 1 − 3
+1 +1
– Size: 3x3 𝑠 1

• Stride: 1 = 13
Convolution layer

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 27,

• Input: 13x13x384 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 =5


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 1
• Padding: 1 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 2
• Filter: 384 𝑛 + 2𝑝 − 𝑓 13 + 2 × 1 − 3
+1 +1
– Size: 3x3 𝑠 1

• Stride: 1 = 13
Convolution layer

• Convolution layers ReLu 𝑃𝑖𝑥𝑒𝑙, 𝑛 = 27,

• Input: 13x13x384 𝐹𝑖𝑙𝑡𝑒𝑟 𝑠𝑖𝑧𝑒, 𝑓 =5


Stride 𝑠𝑖𝑧𝑒, 𝑠 = 1
• Padding: 1 P𝑎𝑑𝑑𝑖𝑛𝑔, 𝑝 = 2
• Filter: 256 𝑛 + 2𝑝 − 𝑓 13 + 2 × 1 − 3
+1 +1
– Size: 3x3 𝑠 1

• Stride: 1 = 13
Max pooling

• Input:13x13x256
𝑛 + 2𝑝 − 𝑓
• Size: 3x3 𝑠
+1

• Stride: 2 13 + 2 × 0 − 3
+1
2
• Output:6x6x256
=6
Fully connected

• Input:6x6x256 (flatten)
• Neurons: 4096
Fully connected

• Input:4096
• Neurons: 4096
Fully connected (Output)

• Input:4096
• Activation: Softmax
• Neurons: 1000
Activity
Image Operation
Output
227x227x Conv 11x11
3 4 stride 55x55x96
96 kernels

Input Operation Output


55x55x96 Max pool 3x3 27x27x96
2 stride

Input Operation Output


27x27x96 Conv 5x5 27x27x256
2 padding
256 kernels

Input Operation Output


27x27x256 Max pool 3x3 13x13x256
2 stride
Activity
Input Operation
Output
13x13x256 Conv 3x3
1 padding 13x13x384
384 kernels
Input Operation
13x13x384 Conv 3x3 Output
1 padding 13x13x384
384 kernels

Input Operation
Output
13x13x384 Conv 3x3
1 padding 13x13x256
256 kernels
Input Operation
Output
13x13x256 Max pool 3x3
2 stride 6x6x256
Thank you

You might also like