Seguir
Sergio Gómez Colmenarejo
Sergio Gómez Colmenarejo
Research Engineer, DeepMind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
Learning to learn by gradient descent by gradient descent
M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ...
Advances in neural information processing systems 29, 2016
15882016
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
14512016
Policy distillation
AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ...
arXiv preprint arXiv:1511.06295, 2015
4732015
Learning to learn without gradient descent by gradient descent
Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ...
International Conference on Machine Learning, 748-756, 2017
221*2017
Learned optimizers that scale and generalize
O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ...
International Conference on Machine Learning, 3751-3760, 2017
1942017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
1782017
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
1072020
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
74*2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
arXiv preprint arXiv:1909.12200, 2019
72*2019
Programmable agents
M Denil, SG Colmenarejo, S Cabi, D Saxton, N de Freitas
arXiv preprint arXiv:1706.06383, 2017
592017
Learning awareness models
B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ...
arXiv preprint arXiv:1804.06318, 2018
392018
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously
S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N Freitas
Conference on Robot Learning, 207-216, 2017
332017
Task-relevant adversarial imitation learning
K Zolna, S Reed, A Novikov, SG Colmenarejo, D Budden, S Cabi, M Denil, ...
arXiv preprint arXiv:1910.01077, 2019
272019
TF-Replicator: Distributed machine learning for researchers
P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ...
arXiv preprint arXiv:1902.00465, 2019
222019
One-shot high-fidelity imitation: Training large-scale deep nets with rl
TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ...
arXiv preprint arXiv:1810.05017, 2018
162018
A generalist agent
S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ...
arXiv preprint arXiv:2205.06175, 2022
142022
Regularized behavior value estimation
C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ...
arXiv preprint arXiv:2103.09575, 2021
102021
Visual imitation with a minimal adversary
S Reed, Y Aytar, Z Wang, T Paine, A van den Oord, T Pfaff, S Gomez, ...
22018
Approximate hubel-wiesel modules and the data structures of neural computation
JZ Leibo, J Cornebise, S Gómez, D Hassabis
arXiv preprint arXiv:1512.08457, 2015
22015
StarCraft II Unplugged: Large Scale Offline Reinforcement Learning
M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ...
Deep RL Workshop NeurIPS 2021, 2021
12021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20