Harm van Seijen

Citado por

	Total	Desde 2019
Citas	1683	1368
Índice h	17	16
Índice i10	22	19

340

170

255

2011201220132014201520162017201820192020202120222023202411 7 15 16 25 29 63 141 156 196 253 313 338 111

Acceso público

Ver todo

5 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Richard S. SuttonKeen, Amii, and University of AlbertaDirección de correo verificada de richsutton.com
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoDirección de correo verificada de cs.ox.ac.uk
Marco WieringInstitute of Artificial Intelligence and Cognitive Engineering, University of GroningenDirección de correo verificada de rug.nl
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLDirección de correo verificada de google.com
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)Dirección de correo verificada de ualberta.ca
A. Rupam MahmoodUniversity of Alberta, AmiiDirección de correo verificada de ualberta.ca
Marlos C. MachadoUniversity of AlbertaDirección de correo verificada de ualberta.ca

Seguir

Harm van Seijen

Sony AI

Dirección de correo verificada de sony.com

reinforcement learning machine learning representation learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Reducing network agnostophobia AR Dhamija, M Günther, T Boult Advances in Neural Information Processing Systems 31, 2018	341	2018
A theoretical and empirical analysis of Expected Sarsa H Van Seijen, H Van Hasselt, S Whiteson, M Wiering 2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009	268	2009
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	256	2017
True online TD (lambda) H Seijen, R Sutton International Conference on Machine Learning, 692-700, 2014	126	2014
True online temporal-difference learning H Van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton Journal of Machine Learning Research 17 (145), 1-40, 2016	110	2016
Systematic generalisation with group invariant predictions F Ahmed, Y Bengio, H Van Seijen, A Courville International Conference on Learning Representations, 2020	97	2020
A Deeper Look at Planning as Learning from Replay H van Seijen, RS Sutton International Conference on Machine Learning, 2015	76	2015
Planning by prioritized sweeping with small backups H Van Seijen, R Sutton International Conference on Machine Learning, 361-369, 2013	58*	2013
Modular lifelong reinforcement learning via neural composition JA Mendez, H van Seijen, E Eaton arXiv preprint arXiv:2207.00429, 2022	38	2022
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021	38	2021
Using a logarithmic mapping to enable lower discount factors in reinforcement learning H Van Seijen, M Fatemi, A Tavakoli Advances in Neural Information Processing Systems 32, 2019	29	2019
Exploiting Best-Match Equations for Efficient Reinforcement Learning. H van Seijen, S Whiteson, H van Hasselt, M Wiering Journal of Machine Learning Research 12 (6), 2011	27	2011
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	25	2018
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	25	2017
Effective multi-step temporal-difference learning for non-linear function approximation H van Seijen arXiv preprint arXiv:1608.05151, 2016	23	2016
Dead-ends and secure exploration in reinforcement learning M Fatemi, S Sharma, H Van Seijen, SE Kahou International Conference on Machine Learning, 1873-1881, 2019	20	2019
Efficient abstraction selection in reinforcement learning H van Seijen, S Whiteson, L Kester Computational Intelligence 30 (4), 657-699, 2014	18	2014
Learning invariances for policy generalization R Tachet, P Bachman, H van Seijen arXiv preprint arXiv:1809.02591, 2018	17	2018
Separation of concerns in reinforcement learning H van Seijen, M Fatemi, J Romoff, R Laroche arXiv preprint arXiv:1612.05159, 2016	13	2016
Towards evaluating adaptivity of model-based reinforcement learning methods Y Wan, A Rahimi-Kalahroudi, J Rajendran, I Momennejad, S Chandar, ... International Conference on Machine Learning, 22536-22561, 2022	11	2022

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores