Seguir
Shantanu Thakoor
Shantanu Thakoor
Research Engineer at DeepMind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
The marabou framework for verification and analysis of deep neural networks
G Katz, DA Huang, D Ibeling, K Julian, C Lazarus, R Lim, P Shah, ...
Computer Aided Verification: 31st International Conference, CAV 2019, New …, 2019
5632019
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
4812023
Large-scale representation learning on graphs via bootstrapping
S Thakoor, C Tallec, MG Azar, M Azabou, EL Dyer, R Munos, P Veličković, ...
arXiv preprint arXiv:2102.06514, 2021
336*2021
Adversarial examples for natural language classification problems
V Kuleshov, S Thakoor, T Lau, S Ermon
932018
Counterfactual credit assignment in model-free reinforcement learning
T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ...
arXiv preprint arXiv:2011.09464, 2020
642020
Byol-explore: Exploration by bootstrapped prediction
Z Guo, S Thakoor, M Pîslar, B Avila Pires, F Altché, C Tallec, A Saade, ...
Advances in neural information processing systems 35, 31855-31870, 2022
532022
Geometric entropic exploration
ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ...
arXiv preprint arXiv:2101.02055, 2021
382021
Large-scale graph representation learning with very deep gnns and self-supervision
R Addanki, PW Battaglia, D Budden, A Deac, J Godwin, T Keck, WLS Li, ...
arXiv preprint arXiv:2107.09422, 2021
272021
Understanding self-predictive learning for reinforcement learning
Y Tang, ZD Guo, PH Richemond, BA Pires, Y Chandak, R Munos, ...
International Conference on Machine Learning, 33632-33656, 2023
212023
Learning to play Othello without human knowledge
S Thakoor, S Nair, M Jhunjhunwala
Stanford University CS238 Final Project Report 204, 257, 2017
182017
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
112024
Half-Hop: A graph upsampling approach for slowing down message passing
M Azabou, V Ganesh, S Thakoor, CH Lin, L Sathidevi, R Liu, M Valko, ...
International Conference on Machine Learning, 1341-1360, 2023
62023
Generalised policy improvement with geometric policy composition
S Thakoor, M Rowland, D Borsa, W Dabney, R Munos, A Barreto
International Conference on Machine Learning, 21272-21307, 2022
62022
Blade: Robust exploration via diffusion models
B Piot, ZD Guo, S Thakoor, MG Azar
Deep Reinforcement Learning Workshop NeurIPS 2022, 2022
32022
Learning behavior representations through multi-timescale bootstrapping
M Azabou, M Mendelson, M Sorokin, S Thakoor, N Ahad, C Urzay, ...
arXiv preprint arXiv:2206.07041, 2022
32022
Synthesis of programs from multimodal datasets
S Thakoor, S Shah, G Ramakrishnan, A Sanyal
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
32018
Relax, it doesn’t matter how you get there: A new self-supervised approach for multi-timescale behavior analysis
M Azabou, M Mendelson, N Ahad, M Sorokin, S Thakoor, C Urzay, E Dyer
Advances in Neural Information Processing Systems 36, 2024
12024
Representations and exploration for deep reinforcement learning using singular value decomposition
Y Chandak, S Thakoor, ZD Guo, Y Tang, R Munos, W Dabney, DL Borsa
International Conference on Machine Learning, 4009-4034, 2023
12023
Quantifying and Understanding Adversarial Examples in Discrete Input Spaces
V Kuleshov, E Nikishin, S Thakoor, T Lau, S Ermon
arXiv preprint arXiv:2112.06276, 2021
2021
LARGE-SCALE NODE CLASSIFICATION WITH BOOTSTRAPPING
R Addanki, PW Battaglia, D Budden, A Deac, M DeepMind, J Godwin, ...
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20