Seguir
George Tucker
George Tucker
Google Brain
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Soft actor-critic algorithms and applications
T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ...
arXiv preprint arXiv:1812.05905, 2018
23922018
Offline reinforcement learning: Tutorial, review, and perspectives on open problems
S Levine, A Kumar, G Tucker, J Fu
arXiv preprint arXiv:2005.01643, 2020
1782*2020
Efficient Bayesian mixed-model analysis increases association power in large cohorts
PR Loh, G Tucker, BK Bulik-Sullivan, BJ Vilhjálmsson, HK Finucane, ...
Nature genetics 47 (3), 284-290, 2015
15072015
Conservative q-learning for offline reinforcement learning
A Kumar, A Zhou, G Tucker, S Levine
NeurIPS 2020, 2020
14712020
Regularizing neural networks by penalizing confident output distributions
G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton
ICLR 2017 Workshop, 2017
11952017
Stabilizing off-policy q-learning via bootstrapping error reduction
A Kumar, J Fu, G Tucker, S Levine
NeurIPS 2019, 2019
9252019
D4rl: Datasets for deep data-driven reinforcement learning
J Fu, A Kumar, O Nachum, G Tucker, S Levine
arXiv preprint arXiv:2004.07219, 2020
9082020
Model-based reinforcement learning for atari
L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ...
ICLR 2020 Spotlight, 2020
8782020
On variational bounds of mutual information
B Poole, S Ozair, A Van Den Oord, A Alemi, G Tucker
International Conference on Machine Learning, 5171-5180, 2019
8062019
Behavior regularized offline reinforcement learning
Y Wu, G Tucker, O Nachum
arXiv preprint arXiv:1911.11361, 2019
6432019
Widespread macromolecular interaction perturbations in human genetic disorders
N Sahni, S Yi, M Taipale, JIF Bass, J Coulombe-Huntington, F Yang, ...
Cell 161 (3), 647-660, 2015
5632015
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
5032023
Learning to walk via deep reinforcement learning
T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine
RSS 2019, 2019
4962019
A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways
M Taipale, G Tucker, J Peng, I Krykbaeva, ZY Lin, B Larsen, H Choi, ...
Cell 158 (2), 434-448, 2014
4292014
Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling
C Riquelme, G Tucker, J Snoek
ICLR 2018, 2018
408*2018
Sample-efficient reinforcement learning with stochastic ensemble value expansion
J Buckman, D Hafner, G Tucker, E Brevdo, H Lee
NeurIPS 2018 Oral, 2018
3622018
Rebar: Low-variance, unbiased gradient estimates for discrete latent variable models
G Tucker, A Mnih, CJ Maddison, D Lawson, J Sohl-Dickstein
NIPS 2017 Oral, 2017
3372017
Don't blame the elbo! a linear vae perspective on posterior collapse
J Lucas, G Tucker, RB Grosse, M Norouzi
Advances in Neural Information Processing Systems 32, 2019
304*2019
Soft Co-Clustering of Data
FW Elliott, R Rohwer, SC Jones, GJ Tucker, CJ Kain, CN Weidert
US Patent App. 12/133,902, 2009
2942009
Methods and devices for ignoring similar audio being received by a system
AD Rosen, MJ Rodehorst, GJ Tucker, ALM Challenner
US Patent 9,728,188, 2017
2442017
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20