George Tucker

Citado por

	Total	Desde 2019
Citas	18809	17387
Índice h	38	37
Índice i10	57	55

5000

2500

1250

3750

2015201620172018201920202021202220232024120 224 321 614 1070 1888 3141 3969 4954 2352

Acceso público

Ver todo

16 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Sergey LevineUC Berkeley, Physical IntelligenceDirección de correo verificada de eecs.berkeley.edu
Aviral KumarGoogle DeepMindDirección de correo verificada de berkeley.edu
Ofir NachumOpenAIDirección de correo verificada de openai.com
Justin FuUC BerkeleyDirección de correo verificada de berkeley.edu
Bonnie BergerMITDirección de correo verificada de mit.edu
Dieterich LawsonStanford UniversityDirección de correo verificada de stanford.edu
Aurick ZhouWaymoDirección de correo verificada de berkeley.edu
Mohammad NorouziIdeogramDirección de correo verificada de ideogram.ai
Po-Ru LohBrigham and Women's Hospital / Harvard Medical SchoolDirección de correo verificada de broadinstitute.org
Tuomas HaarnojaDeepMindDirección de correo verificada de google.com
Sehoon HaGeorgia Institute of TechnologyDirección de correo verificada de gatech.edu
Chris J. MaddisonUniversity of TorontoDirección de correo verificada de cs.toronto.edu
Łukasz KaiserOpenAI & CNRSDirección de correo verificada de openai.com
Andriy MnihResearch Scientist at Google DeepMindDirección de correo verificada de cs.toronto.edu
Chelsea FinnStanford University, GoogleDirección de correo verificada de cs.stanford.edu
Jie TanGoogle DeepMindDirección de correo verificada de google.com
Henryk MichalewskiGoogleDirección de correo verificada de google.com
Dumitru ErhanDirector of Research @ Google DeepMindDirección de correo verificada de google.com
Jascha Sohl-DicksteinAnthropicDirección de correo verificada de anthropic.com
Afroz MohiuddinGoogle IncDirección de correo verificada de google.com

Seguir

George Tucker

Google Brain

Dirección de correo verificada de google.com - Página principal

Reinforcement Learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018	2392	2018
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020	1782*	2020
Efficient Bayesian mixed-model analysis increases association power in large cohorts PR Loh, G Tucker, BK Bulik-Sullivan, BJ Vilhjálmsson, HK Finucane, ... Nature genetics 47 (3), 284-290, 2015	1507	2015
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine NeurIPS 2020, 2020	1471	2020
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton ICLR 2017 Workshop, 2017	1195	2017
Stabilizing off-policy q-learning via bootstrapping error reduction A Kumar, J Fu, G Tucker, S Levine NeurIPS 2019, 2019	925	2019
D4rl: Datasets for deep data-driven reinforcement learning J Fu, A Kumar, O Nachum, G Tucker, S Levine arXiv preprint arXiv:2004.07219, 2020	908	2020
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... ICLR 2020 Spotlight, 2020	878	2020
On variational bounds of mutual information B Poole, S Ozair, A Van Den Oord, A Alemi, G Tucker International Conference on Machine Learning, 5171-5180, 2019	806	2019
Behavior regularized offline reinforcement learning Y Wu, G Tucker, O Nachum arXiv preprint arXiv:1911.11361, 2019	643	2019
Widespread macromolecular interaction perturbations in human genetic disorders N Sahni, S Yi, M Taipale, JIF Bass, J Coulombe-Huntington, F Yang, ... Cell 161 (3), 647-660, 2015	563	2015
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	503	2023
Learning to walk via deep reinforcement learning T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine RSS 2019, 2019	496	2019
A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways M Taipale, G Tucker, J Peng, I Krykbaeva, ZY Lin, B Larsen, H Choi, ... Cell 158 (2), 434-448, 2014	429	2014
Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling C Riquelme, G Tucker, J Snoek ICLR 2018, 2018	408*	2018
Sample-efficient reinforcement learning with stochastic ensemble value expansion J Buckman, D Hafner, G Tucker, E Brevdo, H Lee NeurIPS 2018 Oral, 2018	362	2018
Rebar: Low-variance, unbiased gradient estimates for discrete latent variable models G Tucker, A Mnih, CJ Maddison, D Lawson, J Sohl-Dickstein NIPS 2017 Oral, 2017	337	2017
Don't blame the elbo! a linear vae perspective on posterior collapse J Lucas, G Tucker, RB Grosse, M Norouzi Advances in Neural Information Processing Systems 32, 2019	304*	2019
Soft Co-Clustering of Data FW Elliott, R Rohwer, SC Jones, GJ Tucker, CJ Kain, CN Weidert US Patent App. 12/133,902, 2009	294	2009
Methods and devices for ignoring similar audio being received by a system AD Rosen, MJ Rodehorst, GJ Tucker, ALM Challenner US Patent 9,728,188, 2017	244	2017

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores