George Tucker

Citado por

	Total	Desde 2019
Citas	18539	17118
Índice h	37	35
Índice i10	56	54

5000

2500

1250

3750

2015201620172018201920202021202220232024120 222 323 614 1070 1889 3138 3968 4948 2094

Acceso público

Ver todo

16 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Sergey LevineUC Berkeley, Physical IntelligenceDirección de correo verificada de eecs.berkeley.edu
Aviral KumarGoogle DeepMindDirección de correo verificada de berkeley.edu
Ofir NachumOpenAIDirección de correo verificada de openai.com
Justin FuUC BerkeleyDirección de correo verificada de berkeley.edu
Bonnie BergerMITDirección de correo verificada de mit.edu
Dieterich LawsonStanford UniversityDirección de correo verificada de stanford.edu
Aurick ZhouWaymoDirección de correo verificada de berkeley.edu
Mohammad NorouziIdeogramDirección de correo verificada de ideogram.ai
Po-Ru LohBrigham and Women's Hospital / Harvard Medical SchoolDirección de correo verificada de broadinstitute.org
Tuomas HaarnojaDeepMindDirección de correo verificada de google.com
Sehoon HaGeorgia Institute of TechnologyDirección de correo verificada de gatech.edu
Chris J. MaddisonUniversity of TorontoDirección de correo verificada de cs.toronto.edu
Łukasz KaiserOpenAI & CNRSDirección de correo verificada de openai.com
Andriy MnihResearch Scientist at Google DeepMindDirección de correo verificada de cs.toronto.edu
Chelsea FinnStanford University, GoogleDirección de correo verificada de cs.stanford.edu
Jie TanGoogle DeepMindDirección de correo verificada de google.com
Henryk MichalewskiGoogleDirección de correo verificada de google.com
Dumitru ErhanDirector of Research @ Google DeepMindDirección de correo verificada de google.com
Jascha Sohl-DicksteinAnthropicDirección de correo verificada de anthropic.com
Afroz MohiuddinGoogle IncDirección de correo verificada de google.com

Seguir

George Tucker

Google Brain

Dirección de correo verificada de google.com - Página principal

Reinforcement Learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018	2373	2018
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020	1756*	2020
Efficient Bayesian mixed-model analysis increases association power in large cohorts PR Loh, G Tucker, BK Bulik-Sullivan, BJ Vilhjálmsson, HK Finucane, ... Nature genetics 47 (3), 284-290, 2015	1501	2015
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine NeurIPS 2020, 2020	1443	2020
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton ICLR 2017 Workshop, 2017	1187	2017
Stabilizing off-policy q-learning via bootstrapping error reduction A Kumar, J Fu, G Tucker, S Levine NeurIPS 2019, 2019	909	2019
D4rl: Datasets for deep data-driven reinforcement learning J Fu, A Kumar, O Nachum, G Tucker, S Levine arXiv preprint arXiv:2004.07219, 2020	888	2020
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... ICLR 2020 Spotlight, 2020	876	2020
On variational bounds of mutual information B Poole, S Ozair, A Van Den Oord, A Alemi, G Tucker International Conference on Machine Learning, 5171-5180, 2019	801	2019
Behavior regularized offline reinforcement learning Y Wu, G Tucker, O Nachum arXiv preprint arXiv:1911.11361, 2019	629	2019
Widespread macromolecular interaction perturbations in human genetic disorders N Sahni, S Yi, M Taipale, JIF Bass, J Coulombe-Huntington, F Yang, ... Cell 161 (3), 647-660, 2015	562	2015
Learning to walk via deep reinforcement learning T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine RSS 2019, 2019	495	2019
A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways M Taipale, G Tucker, J Peng, I Krykbaeva, ZY Lin, B Larsen, H Choi, ... Cell 158 (2), 434-448, 2014	429	2014
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	427	2023
Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling C Riquelme, G Tucker, J Snoek ICLR 2018, 2018	408*	2018
Sample-efficient reinforcement learning with stochastic ensemble value expansion J Buckman, D Hafner, G Tucker, E Brevdo, H Lee NeurIPS 2018 Oral, 2018	360	2018
Rebar: Low-variance, unbiased gradient estimates for discrete latent variable models G Tucker, A Mnih, CJ Maddison, D Lawson, J Sohl-Dickstein NIPS 2017 Oral, 2017	336	2017
Don't blame the elbo! a linear vae perspective on posterior collapse J Lucas, G Tucker, RB Grosse, M Norouzi Advances in Neural Information Processing Systems 32, 2019	301*	2019
Soft Co-Clustering of Data FW Elliott, R Rohwer, SC Jones, GJ Tucker, CJ Kain, CN Weidert US Patent App. 12/133,902, 2009	294	2009
Methods and devices for ignoring similar audio being received by a system AD Rosen, MJ Rodehorst, GJ Tucker, ALM Challenner US Patent 9,728,188, 2017	243	2017

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores