DeepLearning References

This is a place to save the deep learning references that I believe are valueable and helpful.

First things first ...

How to read and understand a scientific paper: a guide for non-scientists

Neural Networks - the basics

Papers

Paper	Authors	Application	comment
Efficient BackProp	Yann LeCun		👈
Practical recommendations for gradient-based training of deep architectures	Yoshua Bengio	-	👈
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift	YSergey Ioffe, Christian Szegedy	-	👈
Understanding the difficulty of training deep feedforward neural networks	Xavier Glorot, Yoshua Bengio	-	👈
Visualizing Data using t-SNE	Laurens van der Maaten, Geoffrey Hinton	-	👈
Accelerating t-SNE using Tree-Based Algorithms	Laurens van der Maaten	-	👈

Articles and other resources

CNNs - Image & Object detection

Papers

Paper	Authors	Application	comment
Image Style Transfer Using Convolutional Neural Networks	Leon A. Gatys, Alexander S. Ecker, Matthias Bethge	Style Transfer
Depth Map Prediction from a Single Imageusing a Multi-Scale Deep Network	David Eigen, Christian Puhrsch, Rob Fergus	-
Dynamic Routing Between Capsules	Sara Sabour, Nicholas Frosst, Geoffrey E. Hinton	-
Densely Connected Convolutional Networks	Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger	-	My own implementation of Densenet as a python module can be found here.
Gradient Based Learning Applied to Document Recognition	Yann LeCun, Léon Bottou, Yoshua Bengio, Patrick Haffner	-
How transferable are features in deep neural networks?	Jason Yosinski, Jeff Clune, Yoshua Bengio, Hod Lipson	-	👈
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification	Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun	-	👈
Image Segmentation Using Deep Learning: A Survey	Shervin Minaee, Yuri Boykov, Fatih Porikli, Antonio Plaza, Nasser Kehtarnavaz, Demetri Terzopoulos	-
Deep Residual Learning for Image Recognition	Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun	-	👈
The Importance of Skip Connections in Biomedical Image Segmentation	Michal Drozdzal, Eugene Vorontsov, Gabriel Chartrand, Samuel Kadoury, Chris Pal	-	👈
Fully Convolutional Networks for Semantic Segmentation	Evan Shelhamer, Jonathan Long, Trevor Darrell	-	👈
U-Net: Convolutional Networks for Biomedical Image Segmentation	Olaf Ronneberger, Philipp Fischer, and Thomas Brox	Segmantic Segmentation	👈👈
DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs	Liang-Chieh Chen, George Papandreou, Iasonas Kokkinos, Kevin Murphy, and Alan L. Yuille	Segmantic Segmentation	👈 Impl.
Gated-SCNN: Gated Shape CNNs for Semantic Segmentation	Towaki Takikawa, David Acuna, Varun Jampani, Sanja Fidler	Segmantic Segmentation	👈 Impl.
FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation	Huikai Wu, Junge Zhang, Kaiqi Huang, Kongming Liang, Yizhou Yu	Segmantic Segmentation	👈 Impl.
On Power Jaccard Losses for Semantic Segmentation	David Duque-Arias, Santiago Velasco-Forero, Jean-Emmanuel Deschaud, Francois Goulette, Andres Serna, Etienne Decenciere and Beatriz Marcotegui	Segmantic Segmantation	👈 Loss functions for segmentation tasks
Locating Objects Without Bounding Boxes	Javier Ribera, David Güera, Yuhao Chen, Edward J. Delp	Object Location (Loss function	👈 implementation

Articles and other resources

Recurrent Neural Networks

Papers

Paper	Authors	Application	comment
Visualizing and Understanding Recurrent Networks	Andrej Karpathy, Justin Johnson, Li Fei-Fei	-	👈
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling	Junyoung Chung, Caglar Gulcehre, KyungHyun Cho, Yoshua Bengio	-
An Empirical Exploration of Recurrent Network Architectures	Rafal Jozefowicz, Wojciech Zaremba, Ilya Sutskever	-
LSTM: A Search Space Odyssey	Klaus Greff, Rupesh K. Srivastava, Jan Koutn ́ık, Bas R. Steunebrink, J ̈urgen Schmidhuber	-
An Empirical Exploration of Recurrent Network Architectures	Rafal Jozefowicz, Wojciech Zaremba, Ilya Sutskever	-
Massive Exploration of Neural Machine Translation Architectures	Denny Britz, Anna Goldie, Minh-Thang Luong, Quoc Le	-
WAVENET: A GENERATIVEMODEL FORRAWAUDIO	Aäron van den Oord, Sander Dieleman, Heiga Zen, Karen Simonyan, Oriol Vinyals, Alex Graves, Nal Kalchbrenner, Andrew Senior, Koray Kavukcuoglu	Deep generative model of raw audio waveforms
How to Generate a Good Word Embedding?	Siwei Lai, Kang Liu, Liheng Xu, Jun Zhao	-
Systematic evaluation of CNN advances on the ImageNet by	Dmytro Mishkin, Nikolay Sergievskiy, Jiri Matas	-	👈
Efficient Estimation of Word Representations inVector Space	Tomas Mikolov, Kai Chen, Greg Corrado, Jeffrey Dean	-	👈
Distributed Representations of Words and Phrasesand their Compositionality	Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg Corrado, Jeffrey Dean	-	👈
Neural Machine Translation by Jointly Learning to Align and Translate	Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio	-
Learning Phrase Representations using RNN Encoder–Decoderfor Statistical Machine Translation	Kyunghyun Cho, Bart van Merri ̈enboe, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio	-
Effective Approaches to Attention-based Neural Machine Translation	Minh-Thang Luong, Hieu Pham, Christopher D. Manning	-
Training Tips for the Transformer Model	Martin Popel, Ondřej Bojar	-

Example RNN Architectures

Application	Cell	Layers	Size	Vocabulary		Learning Rate	Paper
Speech Recognition (large vocabulary)	LSTM	5, 7	600, 1000	82K, 500K	--	--	Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
Speech Recognition	LSTM	1, 3, 5	250	--	--	0.001	Speech Recognition with Deep Recurrent Neural Networks
Machine Translation (seq2seq)	LSTM	4	1000	Source: 160K, Target: 80K	1,000	--	Sequence to Sequence Learning with Neural Networks
Image Captioning	LSTM	--	512	--	512	(fixed)	Show and Tell: A Neural Image Caption Generator
Image Generation	LSTM	--	256, 400, 800	--	--	--	DRAW: A Recurrent Neural Network For Image Generation
Question Answering	LSTM	2	500	--	300	--	A Long Short-Term Memory Model for Answer Sentence Selection in Question Answering
Text Summarization	GRU		200	Source: 119K, Target: 68K	100	0.001	Sequence-to-Sequence RNNs for Text Summarization

Articles and other resources

Generative Adversarial Networks

Papers

Paper	Authors	Application	comment
Generative Adversarial Nets	Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio	-	👈
UNSUPERVISED REPRESENTATION LEARNING WITH DEEP CONVOLUTIONAL GENERATIVE ADVERSARIAL NETWORKS	Alec Radford, Luke Metz	-	👈
Improved Techniques for Training GANs	Tim Salimans, Ian Goodfellow, Wojciech Zaremba, Vicki Cheung, Alec Radford, Xi Chen	-	👈
Fine-Grained Car Detection for Visual Census Estimation	Tim Gebru, Jonathan Krause, Yilun Wang, Duyun Chen, Jia Deng, Li Fei Fei	-
CycleGAN Face-off	Xiaohan Jin, Ye Qi Shangxuan Wu	-	👈
Image-to-Image Translation with Conditional Adversarial Networks	Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros	-	👈
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs	Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro	-
Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks	Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros	-
Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data	Amjad Almahairi, Sai Rajeswar, Alessandro Sordoni, Philip Bachman, Aaron Courville	-	👈
StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation	Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo	-	👈
Least Squares Generative Adversarial Networks	Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley	-	👈
Sampling Generative Networks	Tom White	-	👈 👈
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network	Wenzhe Shi, Jose Caballero, Ferenc Huszár, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, Zehan Wang	-	👈
Instance Normalization: The Missing Ingredient for Fast Stylization	Dimitry Ulyanov, Andrea Vedaldi, Victor Lempitsky	Replacement of BatchNorms
Taming Transformers for High-Resolution Image Synthesis	Patrick Esser, Robin Rombach, Björn Ommer	-

Articles and other resource

Deep Reinforcement Learning

Papers

Paper	Authors	Application	comment
Feedback Control For Cassie With Deep Reinforcement Learning	Zhaoming Xie, Glen Berseth, Patrick Clary, Jonathan Hurst, Michiel van de Panne	-	👈
Convergence of Optimistic and Incremental Q-Learning	Eyal Even-Dar, Yishay Mansour	Q-Table initialization	👈
Issues in Using Function Approximation for Reinforcement Learning	Sebastian Thrun, Anton Schwartz	-	👈
Deep Reinforcement Learning with Double Q-learning	Hado van Hasselt, Arthur Guez, David Silver	-	👈
Prioritized Experience Replay	Tom Schaul, John Quan, Ioannis Antonoglou, David Silver	-	👈
Dueling Network Architectures for Deep Reinforcement Learning	Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas	-	👈

Articles and other resource

Optimizers

Papers

Paper	Authors	Application	comment
SGDR: STOCHASTIC GRADIENT DESCENT WITH WARM RESTARTS	Ilya Loshchilov & Frank Hutter	-	👈

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepLearning References

First things first ...

Neural Networks - the basics

Papers

Articles and other resources

CNNs - Image & Object detection

Papers

Articles and other resources

Recurrent Neural Networks

Papers

Example RNN Architectures

Articles and other resources

Generative Adversarial Networks

Papers

Articles and other resource

Deep Reinforcement Learning

Papers

Articles and other resource

Optimizers

Papers

Articles and other resource

Miscellaneous

About

Releases

Packages

armhzjz/DeepLearning-References

Folders and files

Latest commit

History

Repository files navigation

DeepLearning References

First things first ...

Neural Networks - the basics

Papers

Articles and other resources

CNNs - Image & Object detection

Papers

Articles and other resources

Recurrent Neural Networks

Papers

Example RNN Architectures

Articles and other resources

Generative Adversarial Networks

Papers

Articles and other resource

Deep Reinforcement Learning

Papers

Articles and other resource

Optimizers

Papers

Articles and other resource

Miscellaneous

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages