Follow
Vikranth Dwaracherla
Vikranth Dwaracherla
Other namesVikranth Reddy Dwaracherla
DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Epistemic neural networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Advances in Neural Information Processing Systems 36, 2795-2823, 2023
1262023
Reinforcement learning, bit by bit
X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen
Foundations and Trends® in Machine Learning 16 (6), 733-865, 2023
832023
Hypermodels for exploration
V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy
arXiv preprint arXiv:2006.07464, 2020
552020
Motion-based object segmentation based on dense rgb-d scene flow
L Shao, P Shah, V Dwaracherla, J Bohg
IEEE Robotics and Automation Letters 3 (4), 3797-3804, 2018
492018
Approximate thompson sampling via epistemic neural networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Uncertainty in Artificial Intelligence, 1586-1595, 2023
252023
Motion planning for point-to-point navigation of spherical robot using position feedback
V Dwaracherla, S Thakar, L Vachhani, A Gupta, A Yadav, S Modi
IEEE/ASME Transactions on Mechatronics 24 (5), 2416-2426, 2019
242019
The neural testbed: Evaluating joint predictions
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ...
Advances in Neural Information Processing Systems 35, 12554-12565, 2022
212022
Efficient exploration for llms
V Dwaracherla, SM Asghari, B Hao, B Van Roy
arXiv preprint arXiv:2402.00396, 2024
182024
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping
V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy
arXiv preprint arXiv:2206.03633, 2022
162022
Probabilistic approach for visual homing of a mobile robot in the presence of dynamic obstacles
A Sabnis, GK Arunkumar, V Dwaracherla, L Vachhani
IEEE Transactions on Industrial Electronics 63 (9), 5523-5533, 2016
162016
From predictions to decisions: The importance of joint predictive distributions
Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ...
arXiv preprint arXiv:2107.09224, 2021
152021
Gradient estimation with simultaneous perturbation and compressive sensing
VS Borkar, VR Dwaracherla, N Sahasrabudhe
Journal of Machine Learning Research 18 (161), 1-27, 2018
132018
Evaluating predictive distributions: Does Bayesian deep learning work?
I Osband, Z Wen, SM Asghari, X Lu, M Ibrahimi, V Dwaracherla, ...
112021
Evaluating high-order predictive distributions in deep learning
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy
Uncertainty in Artificial Intelligence, 1552-1560, 2022
82022
Langevin dqn
V Dwaracherla, B Van Roy
arXiv preprint arXiv:2002.07282, 2020
82020
Parameterized indexed value function for efficient exploration in reinforcement learning
T Tan, Z Xiong, VR Dwaracherla
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5948-5955, 2020
72020
Reinforcement Learning, Bit by Bit. 2021
X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen
URL https://arxiv. org/abs/2103.04047, 0
5
Posterior sampling networks
VR Dwaracherla, B Van Roy, M Ibrahimi
Reinforcement Learning and Decision Making Conference, 366-370, 2019
42019
Discrete time position feedback based steering control for autonomous homing of a mobile robot
VR Dwaracherla, S Thakar, GKA Kumar, L Vachhani
2016 12th IEEE International Conference on Control and Automation (ICCA …, 2016
42016
Exploration using hyper-models
B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband
US Patent App. 17/639,504, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20