Seguir
Sebastian Flennerhag
Sebastian Flennerhag
Research Scientist at DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Meta-learning with warped gradient descent
S Flennerhag, AA Rusu, R Pascanu, F Visin, H Yin, R Hadsell
arXiv preprint arXiv:1909.00025, 2019
1432019
Transferring Knowledge across Learning Processes
S Flennerhag, PG Moreno, ND Lawrence, A Damianou
Seventh International Conference on Learning Representations, 2019
432019
Augmenting correlation structures in spatial data using deep generative models
K Klemmer, A Koshiyama, S Flennerhag
arXiv preprint arXiv:1905.09796, 2019
182019
Breaking the activation function bottleneck through adaptive parameterization
S Flennerhag, H Yin, J Keane, M Elliot
Advances in Neural Information Processing Systems 31, 2018
112018
Discovering diverse nearly optimal policies withsuccessor features
T Zahavy, B O'Donoghue, A Barreto, V Mnih, S Flennerhag, S Singh
arXiv preprint arXiv:2106.00669, 2021
92021
Bootstrapped meta-learning
S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh
arXiv preprint arXiv:2109.04504, 2021
82021
Quantnet: Transferring learning across systematic trading strategies
A Koshiyama, S Flennerhag, SB Blumberg, N Firoozye, P Treleaven
arXiv preprint arXiv:2004.03445, 2020
72020
Temporal difference uncertainties as a signal for exploration
S Flennerhag, JX Wang, P Sprechmann, F Visin, A Galashov, ...
arXiv preprint arXiv:2010.02255, 2020
62020
Introducing symmetries to black box meta reinforcement learning
L Kirsch, S Flennerhag, H van Hasselt, A Friesen, J Oh, Y Chen
Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7202-7210, 2022
52022
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality
T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ...
arXiv preprint arXiv:2205.13521, 2022
2022
Meta-Gradients in Non-Stationary Environments
J Luketina, S Flennerhag, Y Schroecker, D Abel, T Zahavy, S Singh
ICLR Workshop on Agent Learning in Open-Endedness, 2022
2022
Towards Scalable Meta-Learning
S Flennerhag
PQDT-Global, 2021
2021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–12