A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, Advances in Neural Information Processing Systems (NIPS, 2012.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

T. Durand, T. Mordan, N. Thome, and M. Cord, WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
URL : https://hal.archives-ouvertes.fr/hal-01515640

J. Dai, Y. Li, K. He, and J. Sun, R-fcn: Object detection via region-based fully convolutional networks, Advances in Neural Information Processing Systems (NIPS), 2016.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, You only look once: Unified, real-time object detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.

T. Mordan, N. Thome, M. Cord, and G. Henaff, Deformable Part-based Fully Convolutional Network for Object Detection, British Machine Vision Conference (BMVC), 2017.
URL : https://hal.archives-ouvertes.fr/hal-01637070

L. C. Chen, G. Papandreou, I. Kokkinos, K. Murphy, and A. L. Yuille, Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs, IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018.

M. Engilberge, L. Chevallier, P. Pérez, and M. Cord, Finding beans in burgers: Deep semantic-visual embedding with localization, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
URL : https://hal.archives-ouvertes.fr/hal-02171857

M. Carvalho, R. Cadène, D. Picard, L. Soulier, N. Thome et al., Crossmodal retrieval in the cooking context: Learning semantic text-image embeddings, Special Interest Group on Information Retrieval, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01931470

H. Ben-younes, R. Cadène, N. Thome, and M. Cord, Mutan: Multimodal tucker fusion for visual question answering, IEEE International Conference on Computer Vision (ICCV, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02073637

A. Krogh and J. A. Hertz, A simple weight decay can improve generalization, Advances in Neural Information Processing Systems (NIPS), 1992.

S. Ioffe and C. Szegedy, Batch normalization: Accelerating deep network training by reducing internal covariate shift, Journal of Machine Learning Research, 2016.

N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Dropout: A simple way to prevent neural networks from overfitting, Journal of Machine Learning Research, 2014.

M. Blot, T. Robert, N. Thome, and M. Cord, Shade: Information-based regularization for deep learning, IEEE International Conference on Image Processing (ICIP, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01994740

Y. Bengio, P. Lamblin, D. Popovici, and H. Larochelle, Greedy layer-wise training of deep networks, Advances in Neural Information Processing Systems (NIPS), 2007.

G. E. Hinton and R. R. Salakhutdinov, Reducing the dimensionality of data with neural networks, Science, 2006.

J. Zhao, M. Mathieu, R. Goroshin, and Y. Lecun, Stacked What-Where Autoencoders, International Conference on Learning Representations Workshop, 2016.

Y. Zhang, K. Lee, and H. Lee, Augmenting supervised neural networks with unsupervised objectives for large-scale image classification, International Conference on Machine Learning (ICML), 2016.

A. Rasmus, M. Berglund, M. Honkala, H. Valpola, and T. Raiko, Semi-supervised learning with ladder networks, Advances in Neural Information Processing Systems (NIPS), 2015.

S. Mallat, Group invariant scattering, Communications on Pure and Applied Mathematics (CPAM, 2012.

J. Bruna and S. Mallat, Invariant scattering convolution networks, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2013.

A. Bietti and J. Mairal, Group Invariance, Stability to Deformations, and Complexity of Deep Convolutional Representations, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01536004

M. Sajjadi, M. Javanmardi, and T. Tasdizen, Regularization with stochastic transformations and perturbations for deep semi-supervised learning, Advances in Neural Information Processing Systems (NIPS), 2016.

S. Laine and T. Aila, Temporal ensembling for semi-supervised learning, International Conference on Learning Representations (ICLR, 2017.

A. Tarvainen and H. Valpola, Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, Advances in Neural Information Processing Systems (NIPS), 2017.

X. Zhu, Semi-supervised learning literature survey, Computer Sciences, 2005.

M. Ranzato and M. Szummer, Semi-supervised learning of compact document representations with deep networks, International Conference on Machine Learning (ICML), 2008.

M. Ranzato, F. J. Huang, Y. L. Boureau, and Y. Lecun, Unsupervised learning of invariant feature hierarchies with applications to object recognition, IEEE Conference on Computer Vision and Pattern Recognition (CVPR, 2007.

P. Vincent, H. Larochelle, Y. Bengio, and P. A. Manzagol, Extracting and composing robust features with denoising autoencoders, International Conference on Machine Learning (ICML), 2008.

M. Ranzato, C. Poultney, S. Chopra, and Y. Lecun, Efficient learning of sparse representations with an energy-based model, Advances in Neural Information Processing Systems (NIPS), 2007.

H. Larochelle and Y. Bengio, Classification using discriminative restricted boltzmann machines, International Conference on Machine Learning (ICML), 2008.

D. P. Kingma, S. Mohamed, D. J. Rezende, and M. Welling, Semi-supervised learning with deep generative models, Advances in Neural Information Processing Systems (NIPS), 2014.

D. Erhan, Y. Bengio, A. Courville, P. A. Manzagol, P. Vincent et al., Why does unsupervised pre-training help deep learning, Journal of Machine Learning Research, 2010.

H. Goh, N. Thome, M. Cord, and J. H. Lim, Top-down regularization of deep belief networks, Advances in Neural Information Processing Systems (NIPS), 2013.
URL : https://hal.archives-ouvertes.fr/hal-00947569

I. Goodfellow, J. Pouget-abadie, M. Mirza, B. Xu, D. Warde-farley et al., Generative adversarial nets, Advances in Neural Information Processing Systems (NIPS), 2014.

A. Makhzani, J. Shlens, N. Jaitly, and I. Goodfellow, Adversarial autoencoders, International Conference on Learning Representations (ICLR, 2016.

T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning, 2009.

C. Thériault, N. Thome, and M. Cord, Dynamic Scene Classification: Learning Motion Descriptors with Slow Features Analysis, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013.

I. J. Goodfellow, J. Shlens, and C. Szegedy, Explaining and harnessing adversarial examples, International Conference on Learning Representations (ICLR, 2015.

T. Miyato, S. I. Maeda, M. Koyama, K. Nakae, and S. Ishii, Distributional smoothing with virtual adversarial training, International Conference on Learning Representations (ICLR, 2016.

Z. Wojna, V. Ferrari, S. Guadarrama, N. Silberman, L. C. Chen et al., The devil is in the decoder, British Machine Vision Conference (BMVC), 2017.

V. Dumoulin and F. Visin, A guide to convolution arithmetic for deep learning, 2016.

A. N. Gomez, M. Ren, R. Urtasun, and R. B. Grosse, The reversible residual network: Backpropagation without storing activations, Advances in Neural Information Processing Systems (NIPS), 2017.

J. H. Jacobsen, A. Smeulders, E. Oyallon, and . :-i-revnet, International Conference on Learning Representations (ICLR, 2018.

W. Sweldens, The lifting scheme: A new philosophy in biorthogonal wavelet constructions, Wavelet Applications in Signal and Image Processing III, 1995.

S. G. Mallat and G. Peyré, A wavelet tour of signal processing : the sparse way, 2009.

M. D. Zeiler and R. Fergus, Visualizing and understanding convolutional networks, European Conference on Computer Vision (ECCV), 2014.

Y. Netzer, T. Wang, A. Coates, A. Bissacco, B. Wu et al., Reading digits in natural images with unsupervised feature learning, NIPS workshop on deep learning and unsupervised feature learning, 2011.

A. Krizhevsky and G. Hinton, Learning multiple layers of features from tiny images, 2009.

A. Coates, A. Ng, and H. Lee, An analysis of single-layer networks in unsupervised feature learning, International Conference on Artificial Intelligence and Statistics (AISTATS), 2011.

X. Gastaldi, Shake-shake regularization of 3-branch residual networks, International Conference on Learning Representations Workshop, 2017.

J. T. Springenberg, Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks, International Conference on Learning Representations (ICLR, 2016.

T. Salimans, I. Goodfellow, W. Zaremba, V. Cheung, A. Radford et al., Improved techniques for training gans, Advances in Neural Information Processing Systems (NIPS), 2016.