Mark Rowland

Citado por

	Total	Desde 2019
Citas	4273	4089
Índice h	29	29
Índice i10	40	39

1200

600

300

900

20162017201820192020202120222023202419 41 106 255 418 561 796 1121 931

Acceso público

Ver todo

19 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Rémi MunosGoogle DeepMindDirección de correo verificada de inria.fr
Will DabneyDeepMindDirección de correo verificada de google.com
Marc G. BellemareReliant AI, prev. Google Brain, DeepMindDirección de correo verificada de reliant.ai
Yunhao TangResearch Scientist, DeepMindDirección de correo verificada de columbia.edu
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindDirección de correo verificada de meta.com
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of LiverpoolDirección de correo verificada de hcompany.ai
julien perolatDeepMindDirección de correo verificada de google.com
Adrian WellerDirector of Research, Machine Learning, University of CambridgeDirección de correo verificada de eng.cam.ac.uk
Richard E TurnerProfessor, University of CambridgeDirección de correo verificada de cam.ac.uk
Shayegan OmidshafieiField AI (previously: Google DeepMind, Google Research, MIT)Dirección de correo verificada de google.com
Jiri HronResearch Scientist, Google DeepMindDirección de correo verificada de google.com
Alexander G. D. G. MatthewsDeepMindDirección de correo verificada de google.com
Mohammad Gheshlaghi AzarCohereDirección de correo verificada de google.com
Krzysztof ChoromanskiGoogle Brain Robotics New York & Columbia UniversityDirección de correo verificada de columbia.edu
Clare LyleGoogle DeepMindDirección de correo verificada de deepmind.com
Daniele CalandrielloResearch Scientist, DeepMindDirección de correo verificada de google.com
Zhaohan Daniel GuoDeepMindDirección de correo verificada de google.com
Georgios PiliourasGoogle DeepMind, Singapore University of Technology and DesignDirección de correo verificada de google.com
Bilal PiotGoogle DeepmindDirección de correo verificada de google.com
Daniel HennesResearch Scientist, Google DeepMindDirección de correo verificada de google.com

Seguir

Mark Rowland

Research Scientist, Google DeepMind

Dirección de correo verificada de google.com - Página principal

Reinforcement Learning Statistics Probability


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Distributional reinforcement learning with quantile regression W Dabney, M Rowland, M Bellemare, R Munos Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	808	2018
Gaussian process behaviour in wide deep neural networks AGG Matthews, M Rowland, J Hron, RE Turner, Z Ghahramani arXiv preprint arXiv:1804.11271, 2018	554*	2018
Revisiting fundamentals of experience replay W Fedus, P Ramachandran, R Agarwal, Y Bengio, H Larochelle, ... International conference on machine learning, 3061-3071, 2020	282	2020
Black-box -divergence Minimization JM Hernández-Lobato, Y Li, M Rowland, D Hernández-Lobato, T Bui, ... arXiv preprint arXiv:1511.03243, 2015	264	2015
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022	172	2022
A general theoretical paradigm to understand learning from human preferences MG Azar, ZD Guo, B Piot, R Munos, M Rowland, M Valko, D Calandriello International Conference on Artificial Intelligence and Statistics, 4447-4455, 2024	159	2024
Structured evolution with compact architectures for scalable policy optimization K Choromanski, M Rowland, V Sindhwani, R Turner, A Weller International Conference on Machine Learning, 970-978, 2018	146	2018
α-Rank: Multi-Agent Evaluation by Evolution S Omidshafiei, C Papadimitriou, G Piliouras, K Tuyls, M Rowland, ... Scientific reports 9 (1), 9937, 2019	132	2019
An analysis of categorical distributional reinforcement learning M Rowland, M Bellemare, W Dabney, R Munos, YW Teh International Conference on Artificial Intelligence and Statistics, 29-37, 2018	129	2018
Distributional reinforcement learning MG Bellemare, W Dabney, M Rowland MIT Press, 2023	124	2023
A generalized training approach for multiagent learning P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ... arXiv preprint arXiv:1909.12823, 2019	106	2019
Game Plan: What AI can do for Football, and What Football can do for AI K Tuyls, S Omidshafiei, P Muller, Z Wang, J Connor, D Hennes, I Graham, ... Journal of Artificial Intelligence Research 71, 41-88, 2021	95	2021
Statistics and samples in distributional reinforcement learning M Rowland, R Dadashi, S Kumar, R Munos, MG Bellemare, W Dabney International Conference on Machine Learning, 5528-5536, 2019	95	2019
The unreasonable effectiveness of structured random orthogonal embeddings KM Choromanski, M Rowland, A Weller Advances in neural information processing systems 30, 2017	92	2017
Meta-learning of sequential strategies PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ... arXiv preprint arXiv:1905.03030, 2019	90	2019
Understanding and preventing capacity loss in reinforcement learning C Lyle, M Rowland, W Dabney arXiv preprint arXiv:2204.09560, 2022	82	2022
From poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ... International Conference on Machine Learning, 8525-8535, 2021	82	2021
On the effect of auxiliary tasks on representation dynamics C Lyle, M Rowland, G Ostrovski, W Dabney International Conference on Artificial Intelligence and Statistics, 1-9, 2021	72	2021
The value-improvement path: Towards better representations for reinforcement learning W Dabney, A Barreto, M Rowland, R Dadashi, J Quan, MG Bellemare, ... Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7160-7168, 2021	68	2021
MICo: Improved representations via sampling-based state similarity for Markov decision processes PS Castro, T Kastner, P Panangaden, M Rowland Advances in Neural Information Processing Systems 34, 30113-30126, 2021	57	2021

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores