Lecture 8.4
Lecture 8.4
Neural Networks
Instructor: Dr. Liang Zhang
TAs: Jiacheng Zhang, Ruoyao Wang
College of Information Science
University of Arizona
Quiz at Tophat (Join: 436056)
Use the web app or mobile app to answer:
With your group, come to consensus on the correct answer, and discuss
With your group, come to consensus on the correct answer, and discuss
Having two different units model exactly the same function is probably not useful
+ better symmetry-breaking
W, _, _ = SVD(random matrix)
● linear unit:
ImageNet provides over 14 million images, tagged (but not segmented) for more
than 20 thousand word senses: 1633 images tagged with canine, 1380 images
tagged with motorcycle, etc. How might you use this data to pre-train your model?
How would you handle the architectural differences between the pre-training task
(predicting tags) and the real task (predicting segmentations)?
Solution: Pre-train with ImageNet & Fine-tune for
Segmentation
Step 1: Pre-train on ImageNet
● Replace final fully connected layers (classification) with pixel-wise segmentation layers (fully
convolutional).
● Add upsampling or deconvolution layers to generate segmentation masks.
● Use encoder-decoder architectures like U-Net for better segmentation performance.