Romain Laroche

Citado por

	Total	Desde 2019
Citas	1916	1461
Índice h	22	18
Índice i10	46	31

360

180

270

20102011201220132014201520162017201820192020202120222023202412 16 3 7 35 34 105 109 129 169 219 237 285 343 207

Acceso público

Ver todo

9 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Remi Tachet des CombesDirección de correo verificada de alpacaml.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Dirección de correo verificada de univ-lille.fr
Harm van SeijenSony AIDirección de correo verificada de sony.com
Layla El AsriResearch Lead at Borealis AIDirección de correo verificada de borealisai.com
Raphaël FéraudOrange LabsDirección de correo verificada de orange.com
Steve YoungProfessor of Information EngineeringDirección de correo verificada de eng.cam.ac.uk
Oliver LemonProfessor of Artificial Intelligence, Heriot-Watt University, Edinburgh, Director of Interaction LabDirección de correo verificada de hw.ac.uk
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Dirección de correo verificada de univ-lorraine.fr
Bilal PiotGoogle DeepmindDirección de correo verificada de google.com
julien perolatDeepMindDirección de correo verificada de google.com
Julia VelkovskaVanderbilt UniversityDirección de correo verificada de vanderbilt.edu

Seguir

Romain Laroche

Microsoft Research

Dirección de correo verificada de polytechnique.org - Página principal

Reinforcement Learning Dialogue Systems


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	290	2017
Safe policy improvement with baseline bootstrapping R Laroche, P Trichelair, RT Des Combes International conference on machine learning, 3652-3661, 2019	226	2019
Learning dynamic belief graphs to generalize on text-based games A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ... Advances in Neural Information Processing Systems 33, 3045-3057, 2020	104	2020
Contextual bandit for active learning: Active thompson sampling D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo Neural Information Processing: 21st International Conference, ICONIP 2014 …, 2014	99	2014
Transfer reinforcement learning with shared dynamics R Laroche, M Barlier Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	62	2017
When does return-conditioned supervised learning work for offline reinforcement learning? D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna Advances in Neural Information Processing Systems 35, 1542-1553, 2022	60	2022
Counting to explore and generalize in text-based games X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ... arXiv preprint arXiv:1806.11525, 2018	60	2018
Score-based inverse reinforcement learning L El Asri, B Piot, M Geist, R Laroche, O Pietquin International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2016	49	2016
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021	43	2021
Reinforcement learning algorithm selection R Laroche, R Feraud ICLR, 2018	41	2018
Safe policy improvement with soft baseline bootstrapping K Nadjahi, R Laroche, R Tachet des Combes Machine Learning and Knowledge Discovery in Databases: European Conference …, 2020	37	2020
Transfer Learning for User Adaptation in Spoken Dialogue Systems. A Genevay, R Laroche AAMAS, 975-983, 2016	32	2016
Human-machine dialogue as a stochastic game M Barlier, J Perolat, R Laroche, O Pietquin 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015	32	2015
NASTIA: Negotiating Appointment Setting Interface. L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi LREC, 266-271, 2014	30	2014
Reward function learning for dialogue management L El Asri, R Laroche, O Pietquin STAIRS 2012, 95-106, 2012	30	2012
Reward shaping for statistical optimisation of dialogue management L El Asri, R Laroche, O Pietquin Statistical Language and Speech Processing: First International Conference …, 2013	29	2013
Decentralized exploration in multi-armed bandits R Féraud, R Alami, R Laroche International Conference on Machine Learning, 1901-1909, 2019	28	2019
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	28	2017
Safe policy improvement with an estimated baseline policy TD Simão, R Laroche, RT Combes International Foundation for Autonomous Agents and Multi-Agent Systems, 2019	27	2019
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	27	2018

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores