0% found this document useful (0 votes)

94 views11 pages

Data Science Interview Preparation (#DAY 14)

The document provides information about various deep learning models: - AlexNet was an influential early CNN that achieved breakthrough results on the ImageNet challenge in 2012. It had 5 convolutional layers and 3 fully connected layers. - VGGNet improved on AlexNet by replacing larger filters with multiple 3x3 filters. It had 16-19 layers and popularized the use of very deep CNNs. - ResNet introduced "skip connections" that allowed training of even deeper networks over 150 layers while maintaining lower complexity than VGGNet.

Uploaded by

ARPAN MAITY

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views11 pages

Data Science Interview Preparation (#DAY 14)

Uploaded by

ARPAN MAITY

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

DATA SCIENCE

INTERVIEW
PREPARATION
(30 Days of Interview
Preparation)

# DAY 14

P a g e 1 | 11
Q1. What is Alexnet?
Answer:
The Alex Krizhevsky, Geoffrey Hinton and Ilya Sutskever created the neural network architecture
called ‘AlexNet’ and won Image Classification Challenge (ILSVRC) in 2012. They trained their
network on 1.2 million high-resolution images into 1000 different classes with 60 million parameters
and 650,000 neurons. The training was done on two GPUs with split layer concept because GPUs
were a little bit slow at that time.
AlexNet is the name of convolutional neural network which has had a large impact on the field
of machine learning, specifically in the application of deep learning to machine vision. The network
had very similar architecture as the LeNet by Yann LeCun et al. but was deeper with more filters per
layer, and with the stacked convolutional layers. It consist of ( 11×11, 5×5,3×3, convolutions), max
pooling, dropout, data augmentation, ReLU activations and SGD with the momentum. It attached
with ReLU activations after every convolutional and fully connected layer. AlexNet was trained for
six days simultaneously on two Nvidia Geforce GTX 580 GPUs, which is the reason for why their
network is split into the two pipelines.
Architecture

AlexNet contains eight layers with weights, first five are convolutional, and the remaining three are
fully connected. The output of last fully-connected layer is fed to a 1000-way softmax which
produces a distribution over the 1000 class labels. The network maximises the multinomial logistic
regression objective, which is equivalent to maximising the average across training cases of the log-
probability of the correct label under the prediction distribution. The kernels of second, fourth, and
the fifth convolutional layers are connected only with those kernel maps in the previous layer which
reside on the same GPU. The kernels of third convolutional layer are connected to all the kernel maps
in second layer. The neurons in fully connected layers are connected to all the neurons in the previous
layers.

P a g e 2 | 11
In short, AlexNet contains five convolutional layers and three fully connected layers. Relu is applied
after the very convolutional and the fully connected layer. Dropout is applied before the first and
second fully connected year. The network has the 62.3 million parameters and needs 1.1 billion
computation units in a forward pass. We can also see convolution layers, which accounts for 6% of
all the parameters, consumes 95% of the computation.

Q2. What is VGGNet?

Answer:
VGGNet consists of 16 convolutional layers and is very appealing because of its very uniform
architecture. Similar to AlexNet, only 3x3 convolutions, but lots of filters. Trained on 4 GPUs for 2–
3 weeks. It is currently the most preferred choice in the community for extracting features from
images. The weight configuration of the VGGNet is publicly available and has been used in many
other applications and challenges as a baseline feature extractor. However, VGGNet consists of 138
million parameters, which can be a bit challenging to handle.
There are multiple variants of the VGGNet (VGG16, VGG19 etc.) which differ only in total number
of layers in the networks. The structural details of the VGG16 network has been shown:

The idea behind having the fixed size kernels is that all the variable size convolutional kernels used
in the Alexnet (11x11, 5x5, 3x3) can be replicated by making use of multiple 3x3 kernels as the
building blocks. The replication is in term of the receptive field covered by kernels .

P a g e 3 | 11
Let’s consider the example. Say we have an input layer of the size 5x5x1. Implementing the conv
layer with kernel size of 5x5 and stride one will the results and output feature map of (1x1). The same
output feature map can obtained by implementing the two (3x3) Conv layers with stride of 1 as
below:

Now, let’s look at the number of the variables needed to be trained. For a 5x5 Conv layer filter, the
number of variables is 25. On the other hand, two conv layers of kernel size 3x3 have a total of
3x3x2=18 variables (a reduction of 28%).

P a g e 4 | 11
Q3. What is VGG16?
Answer:
VGG16: It is a convolutional neural network model proposed by the K. Simonyan and A. Zisserman
from the University of Oxford in the paper “Very Deep Convolutional Networks for the Large-Scale
Image Recognition”. The model achieves 92.7% top 5 test accuracy in ImageNet, which is the dataset
of over 14 million images belonging to the 1000 classes. It was one of famous model submitted
to ILSVRC-2014. It improves AlexNet by replacing the large kernel-sized filters (11 and 5 in the first
and second convolutional layer, respectively) with multiple 3×3 kernel-sized filters one after another.
VGG16 was trained for weeks and was using NVIDIA Titan Black GPU’s.

The Architecture
The architecture depicted below is VGG16.

The input to the Cov1 layer is of fixed size of 224 x 224 RGB image. The image is passed through
the stack of convolutional (conv.) layers, where the filters were used with a very small receptive field:
3×3 (which is the smallest size to capture the notion of left/right, up/down, centre). In one of the
configurations, it also utilises the 1×1 convolution filters, which can be seen as the linear
transformation of the input channels . The convolution stride is fixed to the 1 pixel, the spatial padding
of the Conv. layer input is such that, the spatial resolution is preserved after the convolution, i.e. the

P a g e 5 | 11
padding is 1-pixel for 3×3 Conv. layers. Spatial pooling is carried out by the five max-pooling layers,
which follows some of the Conv. Layers. Max-pooling is performed over the 2×2 pixel window, with
stride 2.
Three Fully-Connected (FC) layers follow the stack of convolutional layers (which has the different
depth in different architectures): the first two have 4096 channels each, the third performs 1000-way
ILSVRC classification and thus contains 1000 channels . The final layer is softmax layer. The
configurations of the fully connected layers is same in all the networks.
All hidden layers are equipped with rectification (ReLU) non-linearity. It is also noted that none of
the networks (except for one) contain the Local Response Normalisation (LRN), such normalisation
does not improve the performance on the ILSVRC dataset, but leads to increased memory
consumption and computation time.

Q4. What is ResNet?

Answer:
At the ILSVRC 2015, so-called Residual Neural Network (ResNet) by the Kaiming He et al
introduced the anovel architecture with “skip connections” and features heavy batch normalisation.
Such skip connections are also known as the gated units or gated recurrent units and have the strong
similarity to recent successful elements applied in RNNs. Thanks to this technique as they were able
to train the NN with 152 layers while still having lower complexity than the VGGNet. It achieves the
top-5 error rate of 3.57%, which beats human-level performance on this dataset.

Q5. What is HAAR CASCADE?

Answer:
Haar Cascade: It is the machine learning object detections algorithm used to identify the objects in
an image or the video and based on the concept of features proposed by Paul Viola and Michael
Jones in their paper "Rapid Object Detection using a Boosted Cascade of Simple Features" in 2001.
It is a machine learning-based approach where the cascade function is trained from the lot of positive
and negative images. It is then used to detect the objects in other images.
The algorithm has four stages:
P a g e 6 | 11
 Haar Feature Selection
 Creating Integral Images
 Adaboost Training
 Cascading Classifiers
It is well known for being able to detect faces and body parts in an image but can be trained to
identify almost any object.

Q6. What is Transfer Learning?

Answer:
Transfer learning: It is the machine learning method where the model developed for a task is
reused as the starting point for the model on the second task .
Transfer Learning differs from the traditional Machine Learning in that it is the use of pre-trained
models that have been used for another task to jump-start the development process on a new task
or problem.

P a g e 7 | 11
The benefits of the Transfer Learning are that it can speed up the time as it takes to develop and
train the model by reusing these pieces or modules of already developed models. This helps to
speed up the model training process and accelerate results.

Q7. What is Faster, R-CNN?

Answer:
Faster R-CNN: It has two networks: region proposal network (RPN) for generating region
proposals and a network using these proposals to detect objects. The main difference here with
the Fast R-CNN is that the later uses selective search to generate the region proposals. The time
cost of generating the region proposals is much smaller in the RPN than selective search, when
RPN shares the most computation with object detection network. In brief, RPN ranks region
boxes (called anchors) and proposes the ones most likely containing objects.
Anchors
Anchors play an very important role in Faster R-CNN. An anchor is the box. In default
configuration of Faster R-CNN, there are nine anchors at the position of an image. The graphs
shown 9 anchors at the position (320, 320) of an image with size (600, 800).

P a g e 8 | 11
Region Proposal Network:
The output of the region proposal network is the bunch of boxes/proposals that will be examined
by a classifier and regressor to check the occurrence of objects eventually. To be more
precise, RPN predicts the possibility of an anchor being background or foreground, and refine
the anchor.

Q8. What is RCNN?

Answer:
To bypass the problem of selecting the huge number of regions, Ross Girshick et al. proposed a
method where we use the selective search to extract just 2000 regions from the image, and he
called them as region proposals. Therefore, instead of trying to classify the huge number of
regions, you can work with 2000 regions.

P a g e 9 | 11
Problems with R-CNN:

 It still takes the huge amount of time to train the network as we would have to classify
2000 region proposals per image.
 It cannot be implemented real-time as it takes around 47 seconds for each test image.
 The selective search algorithm is the fixed algorithm. Therefore, no learning is happening
at that stage. This leads to the generation of the bad candidate region proposals.

Q9.What is GoogLeNet/Inception?
Answer:
The winner of the ILSVRC 2014 competition was GoogLeNet from Google. It achieved a top-5 error
rate of 6.67%! This was very close to human-level performance which the organisers of the challenge
were now forced to evaluate. As it turns out, this was rather hard to do and required some human
training to beat GoogLeNets accuracy. After the few days of training, the human expert (Andrej
Karpathy) was able to achieve the top-5 error rate of 5.1%(single model) and 3.6%(ensemble). The
network used the CNN inspired by LeNet but implemented a novel element which is dubbed an
inception module. It used batch normalisation, image distortions and RMSprop. This module is based
on the several very small convolutions to reduce the number of parameters drastically. Their
architecture consisted of the 22 layer deep CNN but reduced the number of parameters from 60
million (AlexNet) to 4 million.
It contains 1×1 Convolution at the middle of network, and global average pooling is used at the end
of the network instead of using the fully connected layers. These two techniques are from another
paper “Network In-Network” (NIN). Another technique, called inception module, is to have different
sizes/types of convolutions for the same input and to stack all the outputs.

P a g e 10 | 11
Q10. What is LeNet-5?
Answer:
LeNet-5, a pioneering 7-level convolutional network by the LeCun et al in 1998, that classifies digits,
was applied by several banks to recognise hand-written numbers on checks (cheques) digitised in
32x32 pixel greyscale input images. The ability to process higher-resolution images requires larger
and more convolutional layers, so the availability of computing resources constrains this technique.

LeNet-5 is very simple network. It only has seven layers, among which there are three convolutional
layers (C1, C3 and C5), two sub-sampling (pooling) layers (S2 and S4), and one fully connected layer
(F6), that are followed by output layers. Convolutional layers use 5 by 5 convolutions with stride 1.
Sub-sampling layers are 2 by 2 average pooling layers. Tanh sigmoid activations are used to
throughout the network. Several interesting architectural choices were made in LeNet-5 that are not
very common in the modern era of deep learning.

------------------------------------------------------------------------------------------------------------------------

P a g e 11 | 11

Gestalt Learning Theory
63% (8)
Gestalt Learning Theory
21 pages
Pgi Memory Scale
91% (35)
Pgi Memory Scale
7 pages
Keras Succinctly
No ratings yet
Keras Succinctly
107 pages
Data Science M-1 Notes
No ratings yet
Data Science M-1 Notes
34 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
39 pages
Lectures Machine Learning
No ratings yet
Lectures Machine Learning
205 pages
UE20CS302 Unit4 Slides
No ratings yet
UE20CS302 Unit4 Slides
312 pages
ECE2191 Lecture Notes
No ratings yet
ECE2191 Lecture Notes
106 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Data Science
No ratings yet
Data Science
74 pages
ML Unit 1 Notes
100% (1)
ML Unit 1 Notes
19 pages
Machine Learning Basic Principles
No ratings yet
Machine Learning Basic Principles
124 pages
Data Science Interview
No ratings yet
Data Science Interview
32 pages
Machine Learning and Neural Networks: Riccardo Rizzo
100% (1)
Machine Learning and Neural Networks: Riccardo Rizzo
113 pages
WISC
No ratings yet
WISC
24 pages
Capgemini Interview
No ratings yet
Capgemini Interview
13 pages
PSD02 - Data Science Overview
No ratings yet
PSD02 - Data Science Overview
64 pages
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
No ratings yet
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
14 pages
Programming For Data Science
100% (1)
Programming For Data Science
4 pages
GARCH Models in Python 1
No ratings yet
GARCH Models in Python 1
31 pages
ML Projects For Final Year
No ratings yet
ML Projects For Final Year
7 pages
THE Organization of Knowledge in The Mind
No ratings yet
THE Organization of Knowledge in The Mind
29 pages
Senior Data Scientist PDF
No ratings yet
Senior Data Scientist PDF
6 pages
Data Science Book
No ratings yet
Data Science Book
722 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
Data Science Course in Hyderabad - Innomatics
No ratings yet
Data Science Course in Hyderabad - Innomatics
10 pages
Jti Final Assignment
No ratings yet
Jti Final Assignment
33 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Intro To Data Science With DB
No ratings yet
Intro To Data Science With DB
33 pages
CO1-CC-PPT Session-2
100% (1)
CO1-CC-PPT Session-2
14 pages
Data Science Regular Handout
No ratings yet
Data Science Regular Handout
25 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
A Brief Introduction To Mathematica: The Very Basics
No ratings yet
A Brief Introduction To Mathematica: The Very Basics
27 pages
Advanced Certification in Data Science and Artificial Intelligence
No ratings yet
Advanced Certification in Data Science and Artificial Intelligence
18 pages
Data Science Tools Study Guides For MIT's 15.003
No ratings yet
Data Science Tools Study Guides For MIT's 15.003
23 pages
Building A Career in Data Science - The Overview
No ratings yet
Building A Career in Data Science - The Overview
2 pages
Role of Machine Learning in The Field of Fiber Reinforced Polymer
No ratings yet
Role of Machine Learning in The Field of Fiber Reinforced Polymer
6 pages
Data Science - Curriculum Brochure
No ratings yet
Data Science - Curriculum Brochure
31 pages
Advanced NLP With Spacy Chapter3
No ratings yet
Advanced NLP With Spacy Chapter3
29 pages
Data Science New
No ratings yet
Data Science New
9 pages
41 Essential Machine Learning Interview Questions: 18 Mins Read
No ratings yet
41 Essential Machine Learning Interview Questions: 18 Mins Read
21 pages
Career Plans For Next 2 Years
No ratings yet
Career Plans For Next 2 Years
11 pages
ST2195 Programming For Data Science
No ratings yet
ST2195 Programming For Data Science
11 pages
Case Study Nuix EDRM Enron Data Set
No ratings yet
Case Study Nuix EDRM Enron Data Set
5 pages
A Guide To Teaching Data Science PDF
No ratings yet
A Guide To Teaching Data Science PDF
26 pages
5 Powerful Scikit-Learn Examples - Towards Data Science
No ratings yet
5 Powerful Scikit-Learn Examples - Towards Data Science
10 pages
Data Science Resource Package!
No ratings yet
Data Science Resource Package!
14 pages
000+ +curriculum+ +Complete+Data+Science+and+Machine+Learning+Using+Python
No ratings yet
000+ +curriculum+ +Complete+Data+Science+and+Machine+Learning+Using+Python
10 pages
Binary Classification Tutorial With The Keras Deep Learning Library
No ratings yet
Binary Classification Tutorial With The Keras Deep Learning Library
33 pages
Pizza - Google Hashcode
No ratings yet
Pizza - Google Hashcode
3 pages
Naïve Bayes Classifier (Week 8)
No ratings yet
Naïve Bayes Classifier (Week 8)
18 pages
Managing and Caring For The Self
100% (2)
Managing and Caring For The Self
33 pages
Artificialintelligence
No ratings yet
Artificialintelligence
18 pages
Image-Based Vehicle Detection Using Various Features
No ratings yet
Image-Based Vehicle Detection Using Various Features
5 pages
Deep Learning - Wikipedia
No ratings yet
Deep Learning - Wikipedia
36 pages
Data Science With R - Course Materials
No ratings yet
Data Science With R - Course Materials
25 pages
HackerRank Notes
No ratings yet
HackerRank Notes
10 pages
Programming in Oracle With PL/SQL
No ratings yet
Programming in Oracle With PL/SQL
26 pages
Arun Mani Sam, R&D Software Engineer
No ratings yet
Arun Mani Sam, R&D Software Engineer
21 pages
Dutcher (2014) What Is Big Data
No ratings yet
Dutcher (2014) What Is Big Data
10 pages
Biopsychology
No ratings yet
Biopsychology
53 pages
CSC8001-Data Science Project Report
No ratings yet
CSC8001-Data Science Project Report
5 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
16 pages
Better Data Science - Generate PDF Reports With Python
No ratings yet
Better Data Science - Generate PDF Reports With Python
5 pages
Data Science Learning Plan
No ratings yet
Data Science Learning Plan
3 pages
CERN Deep Learning and Vision
No ratings yet
CERN Deep Learning and Vision
72 pages
Wais-Iv Subject 3 Julie y
No ratings yet
Wais-Iv Subject 3 Julie y
2 pages
Chapter 7 Memory
No ratings yet
Chapter 7 Memory
67 pages
Visual Search Paper
No ratings yet
Visual Search Paper
13 pages
Data Science Interview Questions (#Day9)
No ratings yet
Data Science Interview Questions (#Day9)
9 pages
Chapter 5 Class 11th
No ratings yet
Chapter 5 Class 11th
14 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
25 pages
Eidetic Kids
No ratings yet
Eidetic Kids
8 pages
PERETZ Music Language and Modularity in Action
100% (1)
PERETZ Music Language and Modularity in Action
15 pages
Neural Networks - Comprehensive Foundation (Introduction)
No ratings yet
Neural Networks - Comprehensive Foundation (Introduction)
47 pages
Brain and Brands Developing Mutually Informative R
No ratings yet
Brain and Brands Developing Mutually Informative R
17 pages
D. L. Scharcter - The Seven Sins of Memory PDF
100% (1)
D. L. Scharcter - The Seven Sins of Memory PDF
22 pages
Marketing 334 Consumer Behavior: Learning, Memory and Product Positioning
No ratings yet
Marketing 334 Consumer Behavior: Learning, Memory and Product Positioning
27 pages
Sde 201
No ratings yet
Sde 201
24 pages
AI Research
No ratings yet
AI Research
25 pages
Ob MCQ Emotions and Moods
No ratings yet
Ob MCQ Emotions and Moods
6 pages
Data Science Interview Preparation (# DAY 22)
No ratings yet
Data Science Interview Preparation (# DAY 22)
16 pages
Rancho Los Amigos Pediatric Level of Consciousness Scale
No ratings yet
Rancho Los Amigos Pediatric Level of Consciousness Scale
1 page
Chapter 7 Memory
No ratings yet
Chapter 7 Memory
63 pages
Memory PDF
No ratings yet
Memory PDF
3 pages
Data Science Interview Preparation (#DAY 10)
No ratings yet
Data Science Interview Preparation (#DAY 10)
11 pages
Data Science Interview Questions (#Day27)
No ratings yet
Data Science Interview Questions (#Day27)
18 pages
Cognitive Decision Sciences MSC
No ratings yet
Cognitive Decision Sciences MSC
4 pages
Usability Principles: Human Ability Human Capabilities Memory Process Observations Problem Solving
No ratings yet
Usability Principles: Human Ability Human Capabilities Memory Process Observations Problem Solving
17 pages
Data Science Interview Preparation (#DAY 16)
No ratings yet
Data Science Interview Preparation (#DAY 16)
13 pages
Data Science Interview Questions (#Day15)
No ratings yet
Data Science Interview Questions (#Day15)
12 pages
Reviews: Managing Competing Goals - A Key Role For The Frontopolar Cortex
No ratings yet
Reviews: Managing Competing Goals - A Key Role For The Frontopolar Cortex
13 pages
How To Create A Metacog
No ratings yet
How To Create A Metacog
6 pages
2019 Ganesh, Deep Orange Mask R-CNN Based Orange PDF
No ratings yet
2019 Ganesh, Deep Orange Mask R-CNN Based Orange PDF
6 pages
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet

Data Science Interview Preparation (#DAY 14)

Uploaded by

Data Science Interview Preparation (#DAY 14)

Uploaded by

DATA SCIENCE

Q2. What is VGGNet?

Q4. What is ResNet?

Q5. What is HAAR CASCADE?

Q6. What is Transfer Learning?

Q7. What is Faster, R-CNN?

Q8. What is RCNN?

You might also like