Volodymyr Mnih

Citado por

	Total	Desde 2019
Citas	71381	58376
Índice h	33	30
Índice i10	38	35

13000

6500

3250

9750

20142015201620172018201920202021202220232024217 580 1683 3275 6195 8815 10302 11621 12158 12214 3264

Coautores

koray kavukcuogluDeepMindDirección de correo verificada de kavukcuoglu.org
David SilverDeepMind, UCLDirección de correo verificada de google.com
Alex GravesUniversity of TorontoDirección de correo verificada de cs.toronto.edu
Geoffrey HintonEmeritus Prof. Computer Science, University of TorontoDirección de correo verificada de cs.toronto.edu
Steven HansenSenior Research Scientist, DeepMindDirección de correo verificada de google.com
David Warde-FarleyStaff Research Scientist at Google DeepMindDirección de correo verificada de google.com
Marc'Aurelio RanzatoDeepMindDirección de correo verificada de google.com
Brendan O'DonoghueStanford University, Google DeepMindDirección de correo verificada de alumni.stanford.edu
Tejas KulkarniDeepMind, Massachusetts Institute of TechnologyDirección de correo verificada de mit.edu
Csaba SzepesvariDeepMind & University of AlbertaDirección de correo verificada de cs.ualberta.ca
Hugo LarochelleGoogle DeepMind & MilaDirección de correo verificada de google.com
Richard ZemelProfessor of Computer Science, University of TorontoDirección de correo verificada de cs.toronto.edu
Jonathan TaylorSenior Scientist at perceptiveIO, IncDirección de correo verificada de cs.toronto.edu

Seguir

Volodymyr Mnih

DeepMind

Dirección de correo verificada de cs.toronto.edu - Página principal

Machine Learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Human-level control through deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ... Nature 518 (7540), 529-533, 2015	29568	2015
Playing Atari with Deep Reinforcement Learning V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... arXiv preprint arXiv:1312.5602, 2013	14427	2013
Asynchronous methods for deep reinforcement learning V Mnih, AP Badia, M Mirza, A Graves, T Lillicrap, T Harley, D Silver, ... International Conference on Machine Learning, 1928-1937, 2016	10808	2016
Recurrent Models of Visual Attention V Mnih, N Heess, A Graves, K Kavukcuoglu Advances in Neural Information Processing Systems, 2204-2212, 2014	4551	2014
IMPALA: Scalable distributed Deep-RL with importance weighted actor-learner architectures L Espeholt, H Soyer, R Munos, K Simonyan, V Mnih, T Ward, Y Doron, ... arXiv preprint arXiv:1802.01561, 2018	1526	2018
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1350	2016
Multiple Object Recognition with Visual Attention J Ba, V Mnih, K Kavukcuoglu arXiv preprint arXiv:1412.7755, 2014	1263	2014
Sample Efficient Actor-Critic with Experience Replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016	958	2016
Machine Learning for Aerial Image Labeling V Mnih University of Toronto, 2013	879	2013
Policy Distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015	744	2015
Learning to detect roads in high-resolution aerial images V Mnih, GE Hinton European Conference on Computer Vision, 210-223, 2010	743	2010
Massively Parallel Methods for Deep Reinforcement Learning A Nair, P Srinivasan, S Blackwell, C Alcicek, R Fearon, A De Maria, ... arXiv preprint arXiv:1507.04296, 2015	612	2015
Learning to Label Aerial Images from Noisy Data V Mnih, GE Hinton Proceedings of the 29th International Conference on Machine Learning (ICML …, 2012	501	2012
Learning by Playing-Solving Sparse Reward Tasks from Scratch M Riedmiller, R Hafner, T Lampe, M Neunert, J Degrave, T Van de Wiele, ... arXiv preprint arXiv:1802.10567, 2018	458	2018
On deep generative models with applications to recognition MA Ranzato, J Susskind, V Mnih, G Hinton Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on …, 2011	297	2011
Empirical bernstein stopping V Mnih, C Szepesvári, JY Audibert Proceedings of the 25th international conference on Machine learning, 672-679, 2008	265	2008
Using Fast Weights to Attend to the Recent Past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances In Neural Information Processing Systems, 4331-4339, 2016	257	2016
METHODS AND APPARATUS FOR REINFORCEMENT LEARNING V Mnih, K Kavukcuoglu US Patent 20,150,100,530, 2015	218	2015
The Uncertainty Bellman Equation and Exploration B O'Donoghue, I Osband, R Munos, V Mnih arXiv preprint arXiv:1709.05380, 2017	208	2017
Unsupervised learning of object keypoints for perception and control TD Kulkarni, A Gupta, C Ionescu, S Borgeaud, M Reynolds, A Zisserman, ... Advances in neural information processing systems 32, 10724-10734, 2019	200	2019

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores