Deep Learning Important Studies
Deep Learning Important Studies
Image Classification:
Network in Network [Paper] [Note] [Torch Code]
Lin, Min, Qiang Chen, and Shuicheng Yan. "Network in network." arXiv preprint
arXiv:1312.4400 (2013).
Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for
large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014).
He, Kaiming, et al. "Deep residual learning for image recognition." Proceedings of
the IEEE Conference on Computer Vision and Pattern Recognition. 2016.
Popular Module
Dropout [Paper] [Note]
Srivastava, Nitish, et al. "Dropout: a simple way to prevent neural networks from
overfitting." Journal of Machine Learning Research 15.1 (2014): 1929-1958.
Spatial pyramid pooling in deep convolutional networks for visual recognition [[Paper]]
(http://arxiv.org/abs/1406.4729) [Note] [Code]
Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun, Faster R-CNN: Towards Real-
Time Object Detection with Region Proposal Networks, arXiv:1506.01497.
Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi, You Only Look Once:
Unified, Real-Time Object Detection, arXiv:1506.02640
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent
Neural Networks [Paper] [Note]
Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross Girshick. arXiv:1512.04143, 2015.
G-CNN: an Iterative Grid Based Object Detector [Paper]
Image Caption:
Exploring Nearest Neighbor Approaches for Image Captioning [Paper]
Vinyals, Oriol, et al. "Show and tell: A neural image caption generator."
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
2015.
Image Generations:
Pixel Recurrent Neural Networks [Paper] [Note]
Scribbler: Controlling Deep Image Synthesis with Sketch and Color [Paper] [Note]
Patsorn Sangkloy, Jingwan Lu, et al. Scribbler: Controlling Deep Image Synthesis
with Sketch and Color. arXiv preprint arXiv:1612.00835, 2016.
Levent Karacan, Zeynep Akata, Aykut Erdem, Erkut Erdem. Learning to Generate
Images of Outdoor Scenes from Attributes and Semantic Layouts [J]. arXiv
preprint arXiv:1612.00215, 2016.
Berthelot, David, Tom Schumm, and Luke Metz. "BEGAN: Boundary Equilibrium
Generative Adversarial Networks." arXiv preprint arXiv:1703.10717 (2017).
Activation Maximization
Synthesizing the preferred inputs for neurons in neural networks via deep generator
networks [Paper] [Note]
Style Transfer
A neural algorithm of artistic style [Paper] [Note]
Gatys L A, Ecker A S, Bethge M. A neural algorithm of artistic style[J]. arXiv
preprint arXiv:1508.06576, 2015.
Perceptual losses for real-time style transfer and super-resolution [Paper] [Note]
Johnson J, Alahi A, Fei-Fei L. Perceptual losses for real-time style transfer and
super-resolution[J]. arXiv preprint arXiv:1603.08155, 2016.
Super Resolution
Texture Enhancement via High-Resolution Style Transfer for Single-Image Super-
Resolution [Paper] [Note]
Il Jun Ahn, Woo Hyun Nam. Texture Enhancement via High-Resolution Style
Transfer for Single-Image Super-Resolution [J]. arXiv preprint arXiv:1612.00085,
2016.
Others
Fully convolutional networks for semantic segmentation [Paper] [Note]
Open Courses
Online Books