VGG Net

The document discusses Convolutional Neural Networks (CNNs) and their application in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), which aims to classify images into 1,000 categories using a large dataset. It highlights the VGG architecture, developed by the Visual Geometry Group at Oxford University, which utilizes small 3x3 convolution kernels to improve accuracy while managing parameters. The VGG models, ranging from VGG11 to VGG19, emphasize the benefits of deeper networks and multiple convolution layers for enhanced image classification performance.

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views6 pages

VGG Net

Uploaded by

beebird234

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

A Convolutional Neural Network (CNN, or ConvNet) are a special kind of multi-layer

neural networks, designed to recognize visual patterns directly from pixel images with
minimal preprocessing.. The ImageNet project is a large visual database designed for
use in visual object recognition software research. The ImageNet project runs an
annual software contest, the ImageNet Large Scale Visual Recognition Challenge
(ILSVRC), where software programs compete to correctly classify and detect objects
and scenes. Here I will talk about CNN architectures of ILSVRC top competitor

What is ImageNet?

ImageNet is formally a project aimed at (manually) labeling and categorizing images

into almost 22,000 separate object categories for the purpose of computer vision
research.

However, when we hear the term “ImageNet” in the context of deep learning and
Convolutional Neural Networks, we are likely referring to the ImageNet Large Scale
Visual Recognition Challenge, or ILSVRC for short.
The goal of this image classification challenge is to train a model that can correctly
classify an input image into 1,000 separate object categories.

Models are trained on ~1.2 million training images with another 50,000 images for
validation and 100,000 images for testing.

These 1,000 image categories represent object classes that we encounter in our day-
to-day lives, such as species of dogs, cats, various household objects, vehicle types,
and much more. When it comes to image classification, the ImageNet challenge is
the de facto benchmark for computer vision classification algorithms — and the
leaderboard for this challenge has been dominated by Convolutional Neural
Networks and deep learning techniques since 2012.

VGGNet
Introduction-

The full name of VGG is the Visual Geometry Group, which belongs to the Department
of Science and Engineering of Oxford University. It has released a series of
convolutional network models beginning with VGG, which can be applied to face
recognition and image classification, from VGG16 to VGG19. The original purpose of
VGG's research on the depth of convolutional networks is to understand how the depth
of convolutional networks affects the accuracy and accuracy of large-scale image
classification and recognition. -Deep-16 CNN), in order to deepen the number of
network layers and to avoid too many parameters, a small 3x3 convolution kernel is
used in all layers.

Network structure-

The input of VGG is set to an RGB image of 224x244 size. The average RGB value is
calculated for all images on the training set image, and then the image is input as an
input to the VGG convolution network. A 3x3 or 1x1 filter is used, and the convolution
step is fixed. . There are 3 VGG fully connected layers, which can vary from VGG11
to VGG19 according to the total number of convolutional layers + fully connected
layers. The minimum VGG11 has 8 convolutional layers and 3 fully connected layers.
The maximum VGG19 has 16 convolutional layers. +3 fully connected layers. In
addition, the VGG network is not followed by a pooling layer behind each convolutional
layer, or a total of 5 pooling layers distributed under different convolutional layers. The
following figure is VGG Structure diagram:

VGG16 contains 16 layers and VGG19 contains 19 layers. A series of VGGs are
exactly the same in the last three fully connected layers. The overall structure includes
5 sets of convolutional layers, followed by a MaxPool. The difference is that more and
more cascaded convolutional layers are included in the five sets of convolutional
layers .
Each convolutional layer in AlexNet contains only one convolution, and the size of the
convolution kernel is 7 * 7 ,. In VGGNet, each convolution layer contains 2 to 4
convolution operations. The size of the convolution kernel is 3 * 3, the convolution step
size is 1, the pooling kernel is 2 * 2, and the step size is 2. The most obvious
improvement of VGGNet is to reduce the size of the convolution kernel and increase
the number of convolution layers.
Using multiple convolution layers with smaller convolution kernels instead of a larger
convolution layer with convolution kernels can reduce parameters on the one hand,
and the author believes that it is equivalent to more non-linear mapping, which
increases the Fit expression ability.

Two consecutive 3 * 3 convolutions are equivalent to a 5 * 5 receptive field, and three

are equivalent to 7 * 7. The advantages of using three 3 * 3 convolutions instead of
one 7 * 7 convolution are twofold : one, including three ReLu layers instead of one ,
makes the decision function more discriminative; and two, reducing parameters . For
example, the input and output are all C channels. 3 convolutional layers using 3 * 3
require 3 (3 * 3 * C * C) = 27 * C * C, and 1 convolutional layer using 7 * 7 requires 7
* 7 * C * C = 49C * C. This can be seen as applying a kind of regularization to the 7 *
7 convolution, so that it is decomposed into three 3 * 3 convolutions.

The 1 * 1 convolution layer is mainly to increase the non-linearity of the decision

function without affecting the receptive field of the convolution layer. Although the 1 *
1 convolution operation is linear, ReLu adds non-linearity.

DL Unit 3-5
No ratings yet
DL Unit 3-5
44 pages
Lecture05 DeepLearningCNN Trang 2
No ratings yet
Lecture05 DeepLearningCNN Trang 2
45 pages
465-Lecture 7
No ratings yet
465-Lecture 7
46 pages
Chapter 5 PowerPoint 2016 Q and A
100% (1)
Chapter 5 PowerPoint 2016 Q and A
2 pages
DCNN Algorithms
No ratings yet
DCNN Algorithms
4 pages
S7 Failsafe Safety Config and Prog
No ratings yet
S7 Failsafe Safety Config and Prog
334 pages
Brainy Kl6 Short Tests Answer Key
No ratings yet
Brainy Kl6 Short Tests Answer Key
13 pages
Asa5510 Sec Bun k9 Datasheet
No ratings yet
Asa5510 Sec Bun k9 Datasheet
4 pages
DL Unit 4&5
No ratings yet
DL Unit 4&5
30 pages
03 Convolutional Neural Networks
No ratings yet
03 Convolutional Neural Networks
83 pages
CNN 190813145957
No ratings yet
CNN 190813145957
34 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
36 pages
CNN Architectures 01
No ratings yet
CNN Architectures 01
66 pages
Single Phase Smart Meter Using DLMS/COSEM Application Data
No ratings yet
Single Phase Smart Meter Using DLMS/COSEM Application Data
2 pages
DL UNIT 2 CNN Architectures
No ratings yet
DL UNIT 2 CNN Architectures
12 pages
Notes Ariba-Supplier-Guide
No ratings yet
Notes Ariba-Supplier-Guide
94 pages
Convolution Neural Networks Vs Fully Connected Neural Networks
No ratings yet
Convolution Neural Networks Vs Fully Connected Neural Networks
6 pages
138 A VGG Googlenet in B Now
No ratings yet
138 A VGG Googlenet in B Now
18 pages
Military AI-Week 05-AI in Computer Vision
No ratings yet
Military AI-Week 05-AI in Computer Vision
65 pages
Insead Knowledge Who Killed Nokia
No ratings yet
Insead Knowledge Who Killed Nokia
3 pages
Unit Iv - NNDL
No ratings yet
Unit Iv - NNDL
32 pages
Image Processing With Deep Learning
No ratings yet
Image Processing With Deep Learning
39 pages
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
No ratings yet
CS60010: Deep Learning CNN - Part 3: Sudeshna Sarkar
167 pages
Cos101 - Introduction To Computing Sciences - 051335
No ratings yet
Cos101 - Introduction To Computing Sciences - 051335
65 pages
CNN Architectures - Transfer Learning
No ratings yet
CNN Architectures - Transfer Learning
64 pages
MLT CNN Architectures
No ratings yet
MLT CNN Architectures
104 pages
DL6 - Convnets 4
No ratings yet
DL6 - Convnets 4
57 pages
5b Dana
No ratings yet
5b Dana
67 pages
Computer Assignment 2 by Arghya Mishra
No ratings yet
Computer Assignment 2 by Arghya Mishra
45 pages
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
No ratings yet
DLP&P Notes Faculty: Ms. Meenakshi Chaudhary: What Is A Convolutional Neural Network (CNN) ?
50 pages
Introduction To Ni
No ratings yet
Introduction To Ni
159 pages
AGMG7VSS
No ratings yet
AGMG7VSS
8 pages
Convolutional Networks
No ratings yet
Convolutional Networks
211 pages
Difference Between AlexNet, VGGNet, ResNet, and Inception
No ratings yet
Difference Between AlexNet, VGGNet, ResNet, and Inception
25 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
Lecture2 Advanced CNN
No ratings yet
Lecture2 Advanced CNN
55 pages
Unit 4
No ratings yet
Unit 4
25 pages
AWR Linked
No ratings yet
AWR Linked
36 pages
138 B Pretrained Networks Classification Complete
No ratings yet
138 B Pretrained Networks Classification Complete
47 pages
17 VGG 03 09 2024
No ratings yet
17 VGG 03 09 2024
10 pages
VGG Net
No ratings yet
VGG Net
22 pages
Unit 2 CNN
No ratings yet
Unit 2 CNN
15 pages
2 Kongsberg Help Aim Flexi Modules 20
No ratings yet
2 Kongsberg Help Aim Flexi Modules 20
20 pages
L3 - UUCLxDeepMind DL2020
No ratings yet
L3 - UUCLxDeepMind DL2020
110 pages
2023+CISSP+Domain+2+Study+Guide+by+ThorTeaches Com+v4 0
No ratings yet
2023+CISSP+Domain+2+Study+Guide+by+ThorTeaches Com+v4 0
9 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
1711059914-ENG4U Course Handbook
No ratings yet
1711059914-ENG4U Course Handbook
5 pages
OnePlus - Wikipedia
No ratings yet
OnePlus - Wikipedia
10 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
17 pages
Online Management Information System With Appointment System With AI Powered Chatbot
No ratings yet
Online Management Information System With Appointment System With AI Powered Chatbot
38 pages
Ngrams Final
No ratings yet
Ngrams Final
28 pages
Project Synopsis College Notes Management System: Presented By
No ratings yet
Project Synopsis College Notes Management System: Presented By
8 pages
Senior Level Assignments Arvind K. Mahajan: IT-Infrastructure IT Operations Network Administration
No ratings yet
Senior Level Assignments Arvind K. Mahajan: IT-Infrastructure IT Operations Network Administration
4 pages
MCGD Issue
No ratings yet
MCGD Issue
4 pages
Computer Programming Lesson Plan - Amsa
No ratings yet
Computer Programming Lesson Plan - Amsa
5 pages
Modern CNN Architectures
No ratings yet
Modern CNN Architectures
32 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
STD-848531 01
No ratings yet
STD-848531 01
10 pages
Algorithms by Jeff
No ratings yet
Algorithms by Jeff
8 pages
ch4 CNN
No ratings yet
ch4 CNN
35 pages
19 ResNet 10 09 2024
No ratings yet
19 ResNet 10 09 2024
35 pages
The Effects of Social Media in The Academic Performance of The Grade 12 Students of Limay Senior High School
No ratings yet
The Effects of Social Media in The Academic Performance of The Grade 12 Students of Limay Senior High School
4 pages
Block Cipher Modes of Operation: Electronic Code Book (ECB)
No ratings yet
Block Cipher Modes of Operation: Electronic Code Book (ECB)
3 pages
Gsi Markets Setup Guide
No ratings yet
Gsi Markets Setup Guide
7 pages
Data Science Interview Preparation (#DAY 14)
No ratings yet
Data Science Interview Preparation (#DAY 14)
11 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
AE556 2024 Topic4 CNN
No ratings yet
AE556 2024 Topic4 CNN
26 pages
Meaning Representation
No ratings yet
Meaning Representation
7 pages
Analysis of Variance
No ratings yet
Analysis of Variance
4 pages
Professional Summary: Skills
No ratings yet
Professional Summary: Skills
1 page
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Autoencoders Tutorial - What Are Autoencoders - Edureka
No ratings yet
Autoencoders Tutorial - What Are Autoencoders - Edureka
10 pages
ML II - Unit IV
No ratings yet
ML II - Unit IV
20 pages
CNN (1) - Unit 3 - Merged
No ratings yet
CNN (1) - Unit 3 - Merged
14 pages
CN 8
No ratings yet
CN 8
5 pages
Unit-4 - Cloud Computing Security Architecture
No ratings yet
Unit-4 - Cloud Computing Security Architecture
21 pages
Senior Quality Assurance Engineer
No ratings yet
Senior Quality Assurance Engineer
3 pages
Alexnet and Data Augmentation
No ratings yet
Alexnet and Data Augmentation
6 pages
Identify Web Cam Images Using Neural Networks
No ratings yet
Identify Web Cam Images Using Neural Networks
17 pages
Real Time Facial Emotion Recognition Using Deep Learning
No ratings yet
Real Time Facial Emotion Recognition Using Deep Learning
5 pages
Seminar 8 - Network Models I (Exercise)
No ratings yet
Seminar 8 - Network Models I (Exercise)
2 pages
Trustworthy - Final Essay
No ratings yet
Trustworthy - Final Essay
21 pages
Difference Between Alexnet, Vggnet, Resnet, and Inception
No ratings yet
Difference Between Alexnet, Vggnet, Resnet, and Inception
14 pages
Module 3 - Convolutional Neural Networks: History
No ratings yet
Module 3 - Convolutional Neural Networks: History
3 pages
Expert Veri Ed, Online, Free.: Microsoft AZ-104 Exam Actual Questions
No ratings yet
Expert Veri Ed, Online, Free.: Microsoft AZ-104 Exam Actual Questions
1 page
Famous Networks
No ratings yet
Famous Networks
6 pages
WSD Using Dictionary
No ratings yet
WSD Using Dictionary
4 pages
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
No ratings yet
13.3.1 Packet Tracer - Use Icmp To Test and Correct Network Connectivity
2 pages
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
No ratings yet
Why Were Gans Developed in The First Place?: Generative Adversarial Network (Gan)
3 pages
Google Net
No ratings yet
Google Net
7 pages
CNN
No ratings yet
CNN
2 pages
Cloud Stack
No ratings yet
Cloud Stack
4 pages
Classify Webcam Images Using Deep Learning
No ratings yet
Classify Webcam Images Using Deep Learning
17 pages
CNN Apps
No ratings yet
CNN Apps
17 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
Literature Review On Image Classification Architecture
No ratings yet
Literature Review On Image Classification Architecture
14 pages
Convolutional Neural Network Report
No ratings yet
Convolutional Neural Network Report
5 pages
CNN Case Studies Unit 4
No ratings yet
CNN Case Studies Unit 4
13 pages
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Polygon Computer Graphics: Exploring the Intersection of Polygon Computer Graphics and Computer Vision
From Everand
Polygon Computer Graphics: Exploring the Intersection of Polygon Computer Graphics and Computer Vision
Fouad Sabry
No ratings yet

VGG Net

Uploaded by

VGG Net

Uploaded by

A Convolutional Neural Network (CNN, or ConvNet) are a special kind of multi-layer

ImageNet is formally a project aimed at (manually) labeling and categorizing images

Two consecutive 3 * 3 convolutions are equivalent to a 5 * 5 receptive field, and three

The 1 * 1 convolution layer is mainly to increase the non-linearity of the decision

You might also like