ConvNet1
ConvNet1
COMP 3340
Convolutional Neural Network
for Image Classification
• Representing an image in a computer involves storing it as a grid of pixels, where each pixel represents the
smallest unit of the image.
• RGB (Red, Green, Blue) color model, where each color is represented by a combination of these three primary
colors.
• Channel: each value ranges from 0 to 255, (255, 0, 0) red, while (0, 255, 0) green, and (0, 0, 255) blue. A
combination of these, like (255, 255, 0), yellow.
• Variants:
• Low-shot learning
• Continual learning
• Applications:
• Face recognition
• Robotics
COMP3340 Applied Deep Learning [1] Reference
20
2.3 Boarder Impact
[-1,+1]
24
PIL stands for Python Imaging Library, and it's the original library that
enabled Python to deal with images.
25
26
27
2.5.1 Preliminaries for Image Classification
31
…
32
33
34
…
Accuracy: 35.39%
35
2.5.1.1 K-Nearest Neighbor
Wx+b
COMP3340 Applied Deep Learning [1] http://cs231n.stanford.edu/slides/2021/lecture_2.pdf
44
2.5.2.4 Neural Network
1*0.55+2*0.54+0.45=2.08
W11
b1 = 0.45
1 W12
W21
W22
b2 = 0.89
b3 = 0.96
80
2.5.2.2 Linear Classifier and Softmax
gradient descent
where