Seguir
George Tucker
George Tucker
Google Brain
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Soft actor-critic algorithms and applications
T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ...
arXiv preprint arXiv:1812.05905, 2018
23732018
Offline reinforcement learning: Tutorial, review, and perspectives on open problems
S Levine, A Kumar, G Tucker, J Fu
arXiv preprint arXiv:2005.01643, 2020
1756*2020
Efficient Bayesian mixed-model analysis increases association power in large cohorts
PR Loh, G Tucker, BK Bulik-Sullivan, BJ Vilhjálmsson, HK Finucane, ...
Nature genetics 47 (3), 284-290, 2015
15012015
Conservative q-learning for offline reinforcement learning
A Kumar, A Zhou, G Tucker, S Levine
NeurIPS 2020, 2020
14432020
Regularizing neural networks by penalizing confident output distributions
G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton
ICLR 2017 Workshop, 2017
11872017
Stabilizing off-policy q-learning via bootstrapping error reduction
A Kumar, J Fu, G Tucker, S Levine
NeurIPS 2019, 2019
9092019
D4rl: Datasets for deep data-driven reinforcement learning
J Fu, A Kumar, O Nachum, G Tucker, S Levine
arXiv preprint arXiv:2004.07219, 2020
8882020
Model-based reinforcement learning for atari
L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ...
ICLR 2020 Spotlight, 2020
8762020
On variational bounds of mutual information
B Poole, S Ozair, A Van Den Oord, A Alemi, G Tucker
International Conference on Machine Learning, 5171-5180, 2019
8012019
Behavior regularized offline reinforcement learning
Y Wu, G Tucker, O Nachum
arXiv preprint arXiv:1911.11361, 2019
6292019
Widespread macromolecular interaction perturbations in human genetic disorders
N Sahni, S Yi, M Taipale, JIF Bass, J Coulombe-Huntington, F Yang, ...
Cell 161 (3), 647-660, 2015
5622015
Learning to walk via deep reinforcement learning
T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine
RSS 2019, 2019
4952019
A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways
M Taipale, G Tucker, J Peng, I Krykbaeva, ZY Lin, B Larsen, H Choi, ...
Cell 158 (2), 434-448, 2014
4292014
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
4272023
Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling
C Riquelme, G Tucker, J Snoek
ICLR 2018, 2018
408*2018
Sample-efficient reinforcement learning with stochastic ensemble value expansion
J Buckman, D Hafner, G Tucker, E Brevdo, H Lee
NeurIPS 2018 Oral, 2018
3602018
Rebar: Low-variance, unbiased gradient estimates for discrete latent variable models
G Tucker, A Mnih, CJ Maddison, D Lawson, J Sohl-Dickstein
NIPS 2017 Oral, 2017
3362017
Don't blame the elbo! a linear vae perspective on posterior collapse
J Lucas, G Tucker, RB Grosse, M Norouzi
Advances in Neural Information Processing Systems 32, 2019
301*2019
Soft Co-Clustering of Data
FW Elliott, R Rohwer, SC Jones, GJ Tucker, CJ Kain, CN Weidert
US Patent App. 12/133,902, 2009
2942009
Methods and devices for ignoring similar audio being received by a system
AD Rosen, MJ Rodehorst, GJ Tucker, ALM Challenner
US Patent 9,728,188, 2017
2432017
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20