Michal Valko

Citado por

	Total	Desde 2019
Citas	11621	10774
Índice h	43	39
Índice i10	102	95

3500

1750

875

2625

2011201220132014201520162017201820192020202120222023202436 25 63 61 107 141 167 199 321 604 1367 2659 3427 2380

Acceso público

Ver todo

53 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Rémi MunosGoogle DeepMindDirección de correo verificada de inria.fr
Mohammad Gheshlaghi AzarCohereDirección de correo verificada de google.com
Bilal PiotGoogle DeepmindDirección de correo verificada de google.com
Daniele CalandrielloResearch Scientist, DeepMindDirección de correo verificada de google.com
Corentin TallecDeepMindDirección de correo verificada de google.com
Jean-bastien GrillDirección de correo verificada de google.com
Zhaohan Daniel GuoDeepMindDirección de correo verificada de google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchDirección de correo verificada de inria.fr
Florent AltchéResearch Engineer, DeepMindDirección de correo verificada de google.com
Pierre MénardOvGU MagdeburgDirección de correo verificada de inria.fr
Florian STRUBCohereDirección de correo verificada de cohere.com
Pierre RichemondGoogle DeepMindDirección de correo verificada de deepmind.com
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Dirección de correo verificada de inria.fr
Yunhao TangResearch Scientist, DeepMindDirección de correo verificada de columbia.edu
Omar Darwiche DominguesCohereDirección de correo verificada de cohere.com
Branislav KvetonAmazonDirección de correo verificada de amazon.com
Milos HauskrechtProfessor of Computer Science, University of PittsburghDirección de correo verificada de pitt.edu
Mark RowlandResearch Scientist, Google DeepMindDirección de correo verificada de google.com
Matteo PirottaResearch Scientist, Meta (FAIR)Dirección de correo verificada de fb.com
Shantanu ThakoorResearch Engineer at DeepMindDirección de correo verificada de google.com

Seguir

Michal Valko

Llama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind

Dirección de correo verificada de meta.com - Página principal

fine-tuning LLMs rl with human feedback deep reinforcement learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Bootstrap your own latent: A new approach to self-supervised learning JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ... Neural Information Processing Systems, 2020	6108	2020
Large-scale representation learning on graphs via bootstrapping S Thakoor, C Tallec, MG Azar, R Munos, P Veličković, M Valko International Conference on Learning Representations, 2022	393*	2022
Finite-time analysis of kernelised contextual bandits M Valko, N Korda, R Munos, I Flaounas, N Cristianini Uncertainty in Artificial Intelligence, 2013	275	2013
Outlier detection for patient monitoring and alerting M Hauskrecht, I Batal, M Valko, S Visweswaran, GF Cooper, G Clermont Journal of Biomedical Informatics, 2013	175	2013
A general theoretical paradigm to understand learning from human preferences MG Azar, M Rowland, B Piot, D Guo, D Calandriello, M Valko, R Munos International Conference on Artificial Intelligence and Statistics, 2024	159	2024
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Neural Information Processing Systems, 2017	148*	2017
Stochastic simultaneous optimistic optimization M Valko, A Carpentier, R Munos International Conference on Machine Learning, 2013	139	2013
Spectral bandits for smooth graph functions M Valko, R Munos, B Kveton, T Kocák International Conference on Machine Learning, 2014	132	2014
Broaden your views for self-supervised video learning A Recasens, P Luc, JB Alayrac, L Wang, F Strub, C Tallec, M Malinowski, ... International Conference on Computer Vision, 2021	129	2021
Efficient learning by implicit exploration in bandit problems with side observations T Kocák, G Neu, M Valko, R Munos Neural Information Processing Systems, 2014	128	2014
Episodic reinforcement learning in finite MDPs: Minimax lower bounds revisited O Darwiche Domingues, P Ménard, E Kaufmann, M Valko Algorithmic Learning Theory, 2021	117	2021
Black-box optimization of noisy functions with unknown smoothness JB Grill, M Valko, R Munos Neural Information Processing Systems, 2015	110	2015
Simple regret for infinitely many armed bandits A Carpentier, M Valko International Conference on Machine Learning, 2015	102	2015
Game Plan: What AI can do for Football, and What Football can do for AI K Tuyls, S Omidshafiei, P Muller, Z Wang, J Connor, D Hennes, I Graham, ... Journal of Artificial Intelligence Research 71, 41-88, 2021	95	2021
BYOL works even without batch statistics PH Richemond, JB Grill, F Altché, C Tallec, F Strub, A Brock, S Smith, ... NeurIPS 2020 Workshop: Self-Supervised Learning - Theory and Practice, 2020	94	2020
Adaptive reward-free exploration E Kaufmann, P Ménard, OD Domingues, A Jonsson, E Leurent, M Valko Algorithmic Learning Theory, 2021	89	2021
Gamification of pure exploration for linear bandits R Degenne, P Ménard, X Shang, M Valko International Conference on Machine Learning, 2020	88	2020
Gaussian process optimization with adaptive sketching: Scalable and no regret D Calandriello, L Carratino, A Lazaric, M Valko, L Rosasco Conference on Learning Theory, 2019	83	2019
Fast active learning for pure exploration in reinforcement learning P Ménard, OD Domingues, A Jonsson, E Kaufmann, E Leurent, M Valko International Conference on Machine Learning, 2021	78	2021
Monte-Carlo tree search as regularized policy optimization JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos International Conference on Machine Learning, 2020	73	2020

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores