Alexnet and Data Augmentation

AlexNet is a convolutional neural network architecture developed by Alex Krizhevsky and others, notable for being the first to utilize GPU for training efficiency. It comprises 5 convolutional layers, 3 max-pooling layers, and employs techniques like ReLU activation, dropout, and data augmentation to enhance performance and reduce overfitting. The model was trained on a GTX 580 GPU and features over 60 million parameters, with input images typically sized at 227x227x3.

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views6 pages

Alexnet and Data Augmentation

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Alexnet

1. The convolutional neural network (CNN) architecture known

as AlexNet was created by Alex Krizhevsky, Ilya Sutskever, and
Geoffrey Hinton, who served as Krizhevsky’s PhD advisor.
2. This was the first architecture that used GPU to boost the
training performance.
3. AlexNet consists of 5 convolution layers, 3 max-pooling layers,
2 Normalized layers, 2 fully connected layers and 1 SoftMax
layer.
4. Each convolution layer consists of a convolution filter and a
non-linear activation function called “ReLU”. The pooling
layers are used to perform the max-pooling function and the
input size is fixed due to the presence of fully connected layers.
5. The input size is mentioned at most of the places as 224x224x3
but due to some padding which happens it works out to be
227x227x3. Above all this AlexNet has over 60 million
parameters.

Key Features:

 ‘ReLU’ is used as an activation function rather than ‘tanh’

 Batch size of 128

 SGD Momentum is used as a learning algorithm

 Data Augmentation is been carried out like flipping, jittering,

cropping, colour normalization, etc.

AlexNet was trained on a GTX 580 GPU with only 3 GB of memory

which couldn’t fit the entire network. So the network was split
across 2 GPUs, with half of the neurons(feature maps) on each
GPU.
Data Augmentation

Overfitting can be avoided by showing Neural Net various iterations

of the same image. Additionally, it assists in producing more data
and compels the Neural Net to memorise the main qualities.

Data augmentation- artificially increase the size of the training set-

create a batch of "new" data from existing data by means of
translation, flipping, noise

 Augmentation by Mirroring

Consider that our training set contains a picture of a cat. A cat can
also be seen as its mirror image. This indicates that by just flipping
the image above the vertical axis, we may double the size of the
training datasets.

Data Augmentation by Mirroring

 Augmentation by Random Cropping of Images

Randomly cropping the original image will also produce additional

data that is simply the original data shifted.

For the network’s inputs, the creators of AlexNet selected random

crops with dimensions of 227 by 227 from within the 256 by 256
image boundary. They multiplied the size of the data by 2048 using
this technique.

Data Augmentation by Random Cropping

Dropout

A neuron is removed from the neural network during dropout with a

probability of 0.5. A neuron that is dropped does not make any
contribution to either forward or backward propagation. As seen in
the graphic below, each input is processed by a separate Neural
Network design. The acquired weight parameters are therefore more
reliable and less prone to overfitting.

AlexNet Summary

Different Deep CNN Architectures - LeNet, AlexNet, VGG
No ratings yet
Different Deep CNN Architectures - LeNet, AlexNet, VGG
13 pages
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet
Deeplearning - PPT - Unit 4 and 5
No ratings yet
Deeplearning - PPT - Unit 4 and 5
154 pages
3 DL ConvNets
No ratings yet
3 DL ConvNets
46 pages
AlexNet Algorithm Presentation ML AI Deep Learning
No ratings yet
AlexNet Algorithm Presentation ML AI Deep Learning
10 pages
ML Lec 15 Alexnet CNN
No ratings yet
ML Lec 15 Alexnet CNN
8 pages
Modern Convolutional Neural Networks
No ratings yet
Modern Convolutional Neural Networks
68 pages
11-Regular Expression To FA-19!01!2023
No ratings yet
11-Regular Expression To FA-19!01!2023
33 pages
Unit V
No ratings yet
Unit V
84 pages
7 CNN
No ratings yet
7 CNN
66 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
XCXC
No ratings yet
XCXC
16 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
MN906 AI Watermarking
No ratings yet
MN906 AI Watermarking
99 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
Unit III
No ratings yet
Unit III
58 pages
Lect11 Neural Nets2
No ratings yet
Lect11 Neural Nets2
48 pages
11 Ece
No ratings yet
11 Ece
15 pages
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
No ratings yet
Convolutional Neural Network Ilsvrc Alexnet (2012) Zfnet (2013) Vggnet (2014) Googlenet 2014) Resnet (2015) Conclusion
82 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
Kernel Slides
No ratings yet
Kernel Slides
33 pages
04 - CNN Case Studies
No ratings yet
04 - CNN Case Studies
20 pages
E-Note 36013 Content Document 20250507121811PM
No ratings yet
E-Note 36013 Content Document 20250507121811PM
7 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
56 pages
Unit 3
No ratings yet
Unit 3
38 pages
Unit 3
No ratings yet
Unit 3
37 pages
XLA Final Report
No ratings yet
XLA Final Report
17 pages
4 March 23 - DL
No ratings yet
4 March 23 - DL
79 pages
Alex Net
No ratings yet
Alex Net
26 pages
BEFA
No ratings yet
BEFA
23 pages
Day 4. Deep Neural Networks
No ratings yet
Day 4. Deep Neural Networks
44 pages
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
No ratings yet
COMP3220 Lect 11 - Introduction To Convolutional Neural Networks
13 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
DL - Unit IV
No ratings yet
DL - Unit IV
36 pages
Data Augmentation Techniques I
No ratings yet
Data Augmentation Techniques I
23 pages
Mổ xẻ cái AlexNet network
No ratings yet
Mổ xẻ cái AlexNet network
5 pages
Untitled Document
No ratings yet
Untitled Document
15 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
4b Image Processing
No ratings yet
4b Image Processing
63 pages
Unit 5
No ratings yet
Unit 5
24 pages
DoMinhQuan 521H0290
No ratings yet
DoMinhQuan 521H0290
4 pages
Difference of LeNet and AlexNet
No ratings yet
Difference of LeNet and AlexNet
11 pages
Unit - I Artificial Neural Networks
No ratings yet
Unit - I Artificial Neural Networks
23 pages
Exercise 8
No ratings yet
Exercise 8
6 pages
Alex Net
No ratings yet
Alex Net
2 pages
Deep Learning Assign 2
No ratings yet
Deep Learning Assign 2
5 pages
Object-Oriented Software Engineering: UNIT 03: Unified Modeling Language
No ratings yet
Object-Oriented Software Engineering: UNIT 03: Unified Modeling Language
111 pages
25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024
No ratings yet
25-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-12!09!2024
4 pages
DL Ass 742
No ratings yet
DL Ass 742
14 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
No ratings yet
23-CNN Operations - Architecture - Simple Convolution Network-09!09!2024
8 pages
Alex Net
No ratings yet
Alex Net
3 pages
Module3 Casestudy
No ratings yet
Module3 Casestudy
13 pages
Oops in Python
No ratings yet
Oops in Python
8 pages
Neural Network Project Report.
No ratings yet
Neural Network Project Report.
12 pages
UNIT II DL
No ratings yet
UNIT II DL
17 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
Dense Net
No ratings yet
Dense Net
15 pages
With Morphnet, Google Helps You Build Faster and Smaller Neural Networks
No ratings yet
With Morphnet, Google Helps You Build Faster and Smaller Neural Networks
6 pages
DFA To Regular Grammar Conversion Module
No ratings yet
DFA To Regular Grammar Conversion Module
3 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Technical Report On DenseNet Architecture (Deep Learning Network Model)
No ratings yet
Technical Report On DenseNet Architecture (Deep Learning Network Model)
9 pages
Alexnet: The Architecture That Challenged Cnns
No ratings yet
Alexnet: The Architecture That Challenged Cnns
6 pages
Unit 4
No ratings yet
Unit 4
25 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
CSE 213 Final
No ratings yet
CSE 213 Final
2 pages
Lecture 02 20190212
No ratings yet
Lecture 02 20190212
49 pages
Understanding AlexNet
No ratings yet
Understanding AlexNet
8 pages
Session 2 Introduction To Generative AI
No ratings yet
Session 2 Introduction To Generative AI
17 pages
Lecture 2
No ratings yet
Lecture 2
12 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
Ngrams Final
No ratings yet
Ngrams Final
28 pages
Neural Network Architecture
No ratings yet
Neural Network Architecture
3 pages
UNIT 6.machine Learning
No ratings yet
UNIT 6.machine Learning
34 pages
DAA Unit-4: P-Class
No ratings yet
DAA Unit-4: P-Class
8 pages
VGG Net
No ratings yet
VGG Net
6 pages
Meaning Representation
No ratings yet
Meaning Representation
7 pages
Autoencoders Tutorial - What Are Autoencoders - Edureka
No ratings yet
Autoencoders Tutorial - What Are Autoencoders - Edureka
10 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
Unit-4 - Cloud Computing Security Architecture
No ratings yet
Unit-4 - Cloud Computing Security Architecture
21 pages
AI Book
No ratings yet
AI Book
5 pages
@ Car Evaluation
No ratings yet
@ Car Evaluation
10 pages
Assignment 2
No ratings yet
Assignment 2
12 pages
CN 8
No ratings yet
CN 8
5 pages
Dynamic Panel Data Models. Nickell's Bias. Anderson-Hsiao Estimator. Arellano-Bond Estimator. System GMM Estimator or Blundell-Bond
No ratings yet
Dynamic Panel Data Models. Nickell's Bias. Anderson-Hsiao Estimator. Arellano-Bond Estimator. System GMM Estimator or Blundell-Bond
22 pages
11 Automata Theory Part1
No ratings yet
11 Automata Theory Part1
23 pages
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
No ratings yet
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
2 pages
Tutorial 2 PDF
No ratings yet
Tutorial 2 PDF
1 page
Google Net
No ratings yet
Google Net
7 pages
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
No ratings yet
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
3 pages
Object-Oriented Analysis and Design
No ratings yet
Object-Oriented Analysis and Design
14 pages
WSD Using Dictionary
No ratings yet
WSD Using Dictionary
4 pages
Ad3511 Set2
No ratings yet
Ad3511 Set2
2 pages
Cloud Stack
No ratings yet
Cloud Stack
4 pages
Syllabus 2025 Final
No ratings yet
Syllabus 2025 Final
4 pages
Michael Chan
No ratings yet
Michael Chan
6 pages
Module 3 Quiz - Review
No ratings yet
Module 3 Quiz - Review
4 pages
Btech Ec 6 Sem Artificial Neural Network Nec 013 2017
No ratings yet
Btech Ec 6 Sem Artificial Neural Network Nec 013 2017
1 page
Seq2seq - What Are Differences Between T5 and Bart - Stack Overflow
No ratings yet
Seq2seq - What Are Differences Between T5 and Bart - Stack Overflow
3 pages
Trans Area Property Ori
No ratings yet
Trans Area Property Ori
1 page