AlexNet Algorithm Presentation ML AI Deep Learning

AlexNet, introduced in 2012, revolutionized image classification by winning the ImageNet challenge with a 15.3% error rate, utilizing a deep architecture with 5 convolutional and 3 fully connected layers. Key innovations included the use of ReLU activations, local response normalization, and overlapping max pooling, alongside GPU-based training to handle large datasets. Its legacy continues to influence advanced CNN architectures and the broader adoption of deep learning in various fields.

Uploaded by

Andrew Rupam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views10 pages

AlexNet Algorithm Presentation ML AI Deep Learning

Uploaded by

Andrew Rupam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

ALEXNET: THE BREAKTHROUGH CNN

ARCHITECTURE
DISCUSSIONS ABOUT ALEXNET

GANGAHARIDI NARAYAN KASHYAP ANIK PAUL RUPAM PAUL

AGENDA

 HISTORICAL CONTEXT & MOTIVATION

 ALEXNET ARCHITECTURE OVERVIEW
 DETAILED LOOK AT CONVOLUTIONAL
LAYERS
 POOLING, NORMALIZATION &
REGULARIZATION
 TRAINING METHODOLOGY
 IMPACT AND LEGACY OF ALEXNET
Historical Context & Motivation

Background:
Introduced in 2012 at the ImageNet Large Scale Visual
Recognition Challenge (ILSVRC), it was the winner(15.3% error
rate).
Developed by Alex Krizhevsky, Ilya Sutskever, and Geoffrey
Hinton from the University of Toronto.

Motivation:
Conventional methods had reached a plateau in accuracy and
efficiency.
the need for models that leverage GPUs, large-scale datasets,
and deep architectures.

Image classification problem:

1000 classes
650K neurons
62M parameters
Training: 1.2M images
Validation: 50K images
Test: 150K images
ARCHITECTURE
5 CONVOLUTIONAL LAYERS:
• EARLY LAYERS CAPTURE LOW-LEVEL FEATURES LIKE EDGES AND
TEXTURES.
• LATER LAYERS CAPTURE COMPLEX PATTERNS AND ABSTRACT
FEATURES.
•
3 FULLY CONNECTED LAYERS:
THESE LAYERS INTEGRATE THE FEATURES EXTRACTED BY THE
CONVOLUTIONAL LAYERS FOR THE FINAL CLASSIFICATION.

KEY INNOVATIONS:
• USE OF RELU ACTIVATIONS FOR FASTER TRAINING.
• LOCAL RESPONSE NORMALIZATION (LRN) TO IMPROVE
GENERALIZATION.
• OVERLAPPING MAX POOLING FOR SPATIAL DOWNSAMPLING.
• PARALLEL GPU TRAINING

Fig. AlexNet
block diagram
TRAINING METHODOLOGY
Tr a i n i n g D e t a i l s :

Optimizer: Stochastic Gradient Descent (SGD)

Overcoming Hardware Limitations:

Use of GPUs and distributed training to manage

high computational requirements.

Epochs & Data:

Training on millions of images from the ImageNet

dataset.
Fig2.: Two examples of the neural network
POOLING, NORMALIZATION &
REGUL ARIZATION
POOLING MECHANISM:
OVERLAPPING MAX POOLING, WHICH HELPS DOWNSAMPLE THE
FEATURE MAPS WHILE PRESERVING CRITICAL FEATURES.
NORMALIZATION:
LOCAL RESPONSE NORMALIZATION (LRN) ADDS LATERAL INHIBITION,
SIMULATING A KIND OF COMPETITION BETWEEN NEURONS.
REGULARIZATION:
DROPOUT: A PROBABILITY-BASED TECHNIQUE TO DROP NEURONS
DURING TRAINING THAT HELPS REDUCE OVERFITTING
•The CNN is split across two GPUs, which communicate only at certain layers; some convolutional layers are
restricted to processing data only on their own GPU.
•Conv1 uses 96 filters of size 11×11×3 with stride 4, followed by ReLU, response normalization, and max pooling.
•Conv2 has 256 filters of size 5×5×48, also followed by ReLU, response normalization, and max pooling.
•Conv3–5 use 3×3 filters (384, 384, and 256 filters respectively); Conv3 connects across both GPUs, while Conv4 and
Conv5 are GPU-local; Conv5 is followed by max pooling.
•Three fully connected layers with 4096 neurons each follow, all using ReLU, and are fully connected across all
previous layer outputs.
THE LASTING LEGACY OF
ALEXNET IN AI
1. Foundation for Advanced Networks: AlexNet paved the way for
deeper CNN architectures like VGG and ResNet, becoming a cornerstone
in modern deep learning.

2. Broader Adoption of Deep Learning: Its success accelerated the use

of deep learning across various fields, including computer vision and NLP.

3. Ongoing Research Benchmark: AlexNet remains a key reference for

evaluating and developing new AI models and techniques.
.

Alex Net
No ratings yet
Alex Net
26 pages
Al3502 - Dlv Unit 3
No ratings yet
Al3502 - Dlv Unit 3
11 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Unit V
No ratings yet
Unit V
84 pages
XCXC
No ratings yet
XCXC
16 pages
AlexNet and Other Pretrained Models_Presentation
No ratings yet
AlexNet and Other Pretrained Models_Presentation
182 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
Modern Convolutional Neural Networks
No ratings yet
Modern Convolutional Neural Networks
68 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
cours8b
No ratings yet
cours8b
39 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
Deep CNN.pptx
No ratings yet
Deep CNN.pptx
66 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Alex Net
No ratings yet
Alex Net
15 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Day 4. Deep Neural Networks
No ratings yet
Day 4. Deep Neural Networks
44 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
Alexnet: The Architecture That Challenged Cnns
No ratings yet
Alexnet: The Architecture That Challenged Cnns
6 pages
7 CNN
No ratings yet
7 CNN
66 pages
The Evolution of Deep Learning
No ratings yet
The Evolution of Deep Learning
53 pages
5b Dana
No ratings yet
5b Dana
67 pages
convnets3
No ratings yet
convnets3
17 pages
Deep Learning
No ratings yet
Deep Learning
30 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Kernel Slides
No ratings yet
Kernel Slides
33 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Lec14 CNNRNNModels
No ratings yet
Lec14 CNNRNNModels
64 pages
Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
43 pages
AI Slide 2
No ratings yet
AI Slide 2
82 pages
DL Inference FPGA Class1
No ratings yet
DL Inference FPGA Class1
56 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
DL - Unit IV
No ratings yet
DL - Unit IV
36 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Mổ xẻ cái AlexNet network
No ratings yet
Mổ xẻ cái AlexNet network
5 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
BEFA
No ratings yet
BEFA
23 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
Chapter 5 Deep Learning
No ratings yet
Chapter 5 Deep Learning
35 pages
4 March 23 - DL
No ratings yet
4 March 23 - DL
79 pages
An Overview of Convolutional Neural Network Architectures For Deep Learning
No ratings yet
An Overview of Convolutional Neural Network Architectures For Deep Learning
22 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Alexnet and Data Augmentation
No ratings yet
Alexnet and Data Augmentation
6 pages
Maira Ali 9F 20775 COMPUTER SCIENCE Research Work
No ratings yet
Maira Ali 9F 20775 COMPUTER SCIENCE Research Work
5 pages
Unit 3
No ratings yet
Unit 3
37 pages
Microproject Report Group 2
No ratings yet
Microproject Report Group 2
15 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
Deeplearning - PPT - Unit 4 and 5
No ratings yet
Deeplearning - PPT - Unit 4 and 5
154 pages
Deep Learning Hardware
No ratings yet
Deep Learning Hardware
82 pages
Ug4 Proj
No ratings yet
Ug4 Proj
44 pages
Deep Learning Assign 2
No ratings yet
Deep Learning Assign 2
5 pages
Deep Learning Most Important Ideas PDF
No ratings yet
Deep Learning Most Important Ideas PDF
16 pages
Author Biographies Preface Acknowledgments Table of Figures
No ratings yet
Author Biographies Preface Acknowledgments Table of Figures
6 pages
Convolutional Neural Networks in Python - DataCamp
No ratings yet
Convolutional Neural Networks in Python - DataCamp
22 pages
Machine - Learning (ANN)
No ratings yet
Machine - Learning (ANN)
88 pages
Intrusion Detection System Using Voting-Based Neural Network
No ratings yet
Intrusion Detection System Using Voting-Based Neural Network
12 pages
Learning Law in Neural Networks
100% (2)
Learning Law in Neural Networks
19 pages
Flower Classification With Deep CNN and Machine Learning Algorithms
No ratings yet
Flower Classification With Deep CNN and Machine Learning Algorithms
5 pages
Deep Learning For Software Defect Prediction - A Survey
No ratings yet
Deep Learning For Software Defect Prediction - A Survey
6 pages
CSD311: Artificial Intelligence
No ratings yet
CSD311: Artificial Intelligence
12 pages
Lecture 8 Deep Learning Overview PDF
No ratings yet
Lecture 8 Deep Learning Overview PDF
98 pages
Deep Learning UNIT 1&2
No ratings yet
Deep Learning UNIT 1&2
69 pages
Keras CheatSheet PGAA
No ratings yet
Keras CheatSheet PGAA
1 page
Recurrent Neural Networks (RNNS) : Shusen Wang
No ratings yet
Recurrent Neural Networks (RNNS) : Shusen Wang
33 pages
Deep Learning LAB
No ratings yet
Deep Learning LAB
47 pages
Rainfall-Runoff Modelling Using Artificial Neural
No ratings yet
Rainfall-Runoff Modelling Using Artificial Neural
7 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
AI Notes
No ratings yet
AI Notes
5 pages
Artificial Neural Networks Kluniversity Course Handout
No ratings yet
Artificial Neural Networks Kluniversity Course Handout
18 pages
Paper of Rolling Net
No ratings yet
Paper of Rolling Net
9 pages
6 05 Undercomplete Vs Overcomplete Hidden Layer
No ratings yet
6 05 Undercomplete Vs Overcomplete Hidden Layer
4 pages
Adarsh - 2024en01 - Soft Computing Assignment 1
No ratings yet
Adarsh - 2024en01 - Soft Computing Assignment 1
12 pages
A Review of Recurrent Neural Networks
No ratings yet
A Review of Recurrent Neural Networks
36 pages
Soft Computing Manual.-1
No ratings yet
Soft Computing Manual.-1
45 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
3 pages
Introduction To Artificial Neural Networks: Andrew L. Nelson
No ratings yet
Introduction To Artificial Neural Networks: Andrew L. Nelson
29 pages
Vggnet
No ratings yet
Vggnet
8 pages
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
No ratings yet
CS 522 Selected Topics in CS: Lecture 07 - Artificial Neural Network
52 pages
Perceptron
No ratings yet
Perceptron
8 pages
Homework DL 5GI Sheet2
No ratings yet
Homework DL 5GI Sheet2
2 pages
D4304-Syllabus-Neural Networks and Fuzzy Systems
0% (1)
D4304-Syllabus-Neural Networks and Fuzzy Systems
1 page
Perbandingan Metode Naïve Bayes Dan C4.5 Klasifikasi Status Gizi Bayi Balita
No ratings yet
Perbandingan Metode Naïve Bayes Dan C4.5 Klasifikasi Status Gizi Bayi Balita
11 pages
A Deep Learning Approach For Sentiment Analysis in Spanish Tweets
No ratings yet
A Deep Learning Approach For Sentiment Analysis in Spanish Tweets
8 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages