CS462 Assignment2
CS462 Assignment2
1
Misr University for Science and Technology
College of Information Technology
In ............. , adjustment of the weights of the network is performed after the epoch
a. Stochastic mode c. Machine Learning mode
b. Batch mode d. Adaptive Learning mode
In ............. , adjustment of the weights of the network is performed on sample-by-sample basis
a. Stochastic mode c. Machine Learning mode
b. Batch mode d. Adaptive Learning mode
CNN is the acronyms of
a. Concurrent Neural Networks c. Convolutional Neural Networks
b. Convectional Neural Networks d. All of the above
Which stage in the pattern recognition system the Convolution layers play its role?
a. Data Acqquisition c. Feature Extraction
b. Preprocessing d. Classification
The RBF is a multilayer network?
a. true
b. false
In the SVM, support vectors are chosen data samples that:
a. Have non-zero alpha value c. Maximize the margin between classes
b. Have zero alpha value d. (a) and (c)
Both MLP and CNN can be used for Image classification however MLP takes vector as
input and CNN takes tensor as input so CNN can understand spatial relation(relation
between nearby pixels of image)between pixels of images better thus for complicated
images CNN will perform better than MLP
J. Discuss the different components of the CNN and list some of the famous CNN
architectures.
Convolution is a mathematical operation on an image and kernel to detect a
particular feature from the input image and get the result having information
about that feature
Pooling is the process of reducing the size of the data while keeping the most
important features
Flattening is converting the data into a 1-dimensional array for inputting it to the
next layer.
K. Discuss the advantages and disadvantages of the CNN.
(Adv) Very accurate at image recognition and classification.
(Adv) Weight sharing
(Adv) CNNs make use of the same knowledge across all image locations.
(DisAdv) A lot of training data is needed for the CNN to be effective.
(DisAdv) CNNs tend to be much slower because of operations like maxpool.
L. Discuss the kernel trick and state the main condition that should be satisfied in
the kernel.
The “trick” is that kernel methods represent the data only through a set of
pairwise similarity comparisons between the original data observations x (with
the original coordinates in the lower dimensional space), instead of explicitly
applying the transformations ϕ(x)ز
M. State and Discuss Cover’s Theorem, and prove its correctness with a numeric
example.
A dichotomy {C1, C2} is said to be φ-separable if there exist a m1-dimensional
vector w such that we may write