Seguir
Edward Hughes
Edward Hughes
Staff Research Engineer, DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
476*2019
The hanabi challenge: A new frontier for ai research
N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ...
Artificial Intelligence 280, 103216, 2020
3642020
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
2372019
Inequity aversion improves cooperation in intertemporal social dilemmas
E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ...
Advances in neural information processing systems 31, 2018
2252018
Learning to follow language instructions with adversarial reward induction
D Bahdanau, F Hill, J Leike, E Hughes, P Kohli, E Grefenstette
arXiv preprint arXiv:1806.01946, 6-9, 2018
188*2018
Open problems in cooperative AI
A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ...
arXiv preprint arXiv:2012.08630, 2020
1762020
Bayesian action decoder for deep multi-agent reinforcement learning
J Foerster, F Song, E Hughes, N Burch, I Dunning, S Whiteson, ...
International Conference on Machine Learning, 1942-1951, 2019
1632019
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
1182019
Collaborating with humans without human data
DJ Strouse, K McKee, M Botvinick, E Hughes, R Everett
Advances in Neural Information Processing Systems 34, 14502-14515, 2021
1132021
Autocurricula and the emergence of innovation from social interaction: A manifesto for multi-agent intelligence research
JZ Leibo, E Hughes, M Lanctot, T Graepel
arXiv preprint arXiv:1903.00742, 2019
1112019
A generalized training approach for multiagent learning
P Muller, S Omidshafiei, M Rowland, K Tuyls, J Perolat, S Liu, D Hennes, ...
arXiv preprint arXiv:1909.12823, 2019
952019
Evolving intrinsic motivations for altruistic behavior
JX Wang, E Hughes, C Fernando, WM Czarnecki, EA Duéñez-Guzmán, ...
arXiv preprint arXiv:1811.05931, 2018
842018
Social diversity and social preferences in mixed-motive reinforcement learning
KR McKee, I Gemp, B McWilliams, EA Duéñez-Guzmán, E Hughes, ...
arXiv preprint arXiv:2002.02325, 2020
772020
Learning reciprocity in complex sequential social dilemmas
T Eccles, E Hughes, J Kramár, S Wheelwright, JZ Leibo
AAMAS, 1934-1936, 2019
53*2019
Human-Timescale Adaptation in an Open-Ended Task Space
J Bauer, K Baumli, F Behbahani, A Bhoopchand, N Bradley-Schmieg, ...
51*2023
Learning to incentivize other learning agents
J Yang, A Li, M Farajtabar, P Sunehag, E Hughes, H Zha
Advances in Neural Information Processing Systems 33, 15208-15219, 2020
512020
Malthusian reinforcement learning
JZ Leibo, J Perolat, E Hughes, S Wheelwright, AH Marblestone, ...
arXiv preprint arXiv:1812.07019, 2018
452018
Bounds and dynamics for empirical game theoretic analysis
K Tuyls, J Perolat, M Lanctot, E Hughes, R Everett, JZ Leibo, ...
Autonomous Agents and Multi-Agent Systems 34, 1-30, 2020
342020
Negotiating team formation using deep reinforcement learning
Y Bachrach, R Everett, E Hughes, A Lazaridou, JZ Leibo, M Lanctot, ...
Artificial Intelligence 288, 103356, 2020
322020
The connected prescription for form factors in twistor space
A Brandhuber, E Hughes, R Panerai, B Spence, G Travaglini
Journal of High Energy Physics 2016 (11), 1-17, 2016
322016
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20