Human-level control through deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ... Nature 518 (7540), 529-533, 2015 | 20359 | 2015 |
Playing Atari with Deep Reinforcement Learning V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... arXiv preprint arXiv:1312.5602, 2013 | 9581 | 2013 |
Asynchronous methods for deep reinforcement learning V Mnih, AP Badia, M Mirza, A Graves, T Lillicrap, T Harley, D Silver, ... International Conference on Machine Learning, 1928-1937, 2016 | 7258 | 2016 |
Recurrent Models of Visual Attention V Mnih, N Heess, A Graves, K Kavukcuoglu Advances in Neural Information Processing Systems, 2204-2212, 2014 | 3264 | 2014 |
Multiple Object Recognition with Visual Attention J Ba, V Mnih, K Kavukcuoglu arXiv preprint arXiv:1412.7755, 2014 | 1077 | 2014 |
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016 | 1053 | 2016 |
IMPALA: Scalable distributed Deep-RL with importance weighted actor-learner architectures L Espeholt, H Soyer, R Munos, K Simonyan, V Mnih, T Ward, Y Doron, ... arXiv preprint arXiv:1802.01561, 2018 | 998 | 2018 |
Sample Efficient Actor-Critic with Experience Replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016 | 722 | 2016 |
Learning to detect roads in high-resolution aerial images V Mnih, GE Hinton European Conference on Computer Vision, 210-223, 2010 | 610 | 2010 |
Machine Learning for Aerial Image Labeling V Mnih University of Toronto, 2013 | 573 | 2013 |
Policy Distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015 | 489 | 2015 |
Massively Parallel Methods for Deep Reinforcement Learning A Nair, P Srinivasan, S Blackwell, C Alcicek, R Fearon, A De Maria, ... arXiv preprint arXiv:1507.04296, 2015 | 474 | 2015 |
Noisy networks for exploration M Fortunato, MG Azar, B Piot, J Menick, I Osband, A Graves, V Mnih, ... arXiv preprint arXiv:1706.10295, 2017 | 451 | 2017 |
Learning to Label Aerial Images from Noisy Data V Mnih, GE Hinton Proceedings of the 29th International Conference on Machine Learning (ICML …, 2012 | 425 | 2012 |
Learning by Playing-Solving Sparse Reward Tasks from Scratch M Riedmiller, R Hafner, T Lampe, M Neunert, J Degrave, T Van de Wiele, ... arXiv preprint arXiv:1802.10567, 2018 | 313 | 2018 |
On deep generative models with applications to recognition MA Ranzato, J Susskind, V Mnih, G Hinton Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on …, 2011 | 263 | 2011 |
Empirical bernstein stopping V Mnih, C Szepesvári, JY Audibert Proceedings of the 25th international conference on Machine learning, 672-679, 2008 | 236 | 2008 |
Using Fast Weights to Attend to the Recent Past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances In Neural Information Processing Systems, 4331-4339, 2016 | 176 | 2016 |
Conditional Restricted Boltzmann Machines for Structured Output Prediction V Mnih, H Larochelle, GE Hinton The 27th Conference on Uncertainty in Artificial Intelligence, 2011 | 154 | 2011 |
Strategic Attentive Writer for Learning Macro-Actions AS Vezhnevets, V Mnih, J Agapiou, S Osindero, A Graves, O Vinyals, ... arXiv preprint arXiv:1606.04695, 2016 | 151 | 2016 |