Safe exploration in continuous action spaces G Dalal, K Dvijotham, M Vecerik, T Hester, C Paduraru, Y Tassa arXiv preprint arXiv:1801.08757, 2018 | 519 | 2018 |
Finite Sample Analyses for TD (0) with Function Approximation G Dalal, B Szörényi, G Thoppe, S Mannor Association for the Advancement of Artificial Intelligence (AAAI) 2018, 2018 | 191 | 2018 |
Finite sample analysis of two-timescale stochastic approximation with applications to reinforcement learning G Dalal, B Szorenyi, G Thoppe, S Mannor 31st Annual Conference on Learning Theory (COLT) 75, 1-35, 2018 | 126 | 2018 |
A tale of two-timescale reinforcement learning with the tightest finite-time bound G Dalal, B Szorenyi, G Thoppe Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 3701-3708, 2020 | 52 | 2020 |
Beyond the one step greedy approach in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of The 35th International Conference on Machine Learning (ICML 2018), 2018 | 46 | 2018 |
Supervised Learning for Optimal Power Flow as a Real-Time Proxy R Canyasse, G Dalal, S Mannor IEEE PES Innovative Smart Grid Technologies (ISGT 2017) 8, 2017 | 45 | 2017 |
Anomaly Detection in Large Databases Using Behavioral Patterning H Mazzawi, G Dalal, D Rozenblat, L Ein-Dor, M Ninio, O Lavi 2017 IEEE 33rd International Conference on Data Engineering (ICDE 2017), 2017 | 45 | 2017 |
Multiple-step greedy policies in approximate and online reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Advances in Neural Information Processing Systems (NIPS 2018), 5238-5247, 2018 | 41 | 2018 |
How to combine tree-search methods in reinforcement learning Y Efroni, G Dalal, B Scherrer, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2019) 33 …, 2019 | 39 | 2019 |
Hierarchical Decision Making In Electricity Grid Management G Dalal, E Gilboa, S Mannor Proceedings of The 33rd International Conference on Machine Learning (ICML …, 2016 | 37 | 2016 |
Reinforcement learning for datacenter congestion control C Tessler, Y Shpigelman, G Dalal, A Mandelbaum, D Haritan Kazakov, ... ACM SIGMETRICS Performance Evaluation Review 49 (2), 43-46, 2022 | 36 | 2022 |
Chance-constrained outage scheduling using a machine learning proxy G Dalal, E Gilboa, S Mannor, L Wehenkel IEEE Transactions on Power Systems 34 (4), 2019 | 36 | 2019 |
Acting in Delayed Environments with Non-Stationary Markov Policies E Derman, G Dalal, S Mannor International Conference on Learning Representations (ICLR), 2021 | 31 | 2021 |
Unit commitment using nearest neighbor as a short-term proxy G Dalal, E Gilboa, S Mannor, L Wehenkel 20th Power Systems Computation Conference (PSCC'18), 2018 | 31 | 2018 |
Reinforcement learning for the unit commitment problem G Dalal, S Mannor 2015 IEEE Eindhoven PowerTech, 1-6, 2015 | 31 | 2015 |
The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems A Inci, E Bolotin, Y Fu, G Dalal, S Mannor, D Nellans, D Marculescu EMC2 (The Sixth Workshop on Energy Efficient Machine Learning and Cognitive …, 2020 | 18 | 2020 |
On covariate shift of latent confounders in imitation and reinforcement learning G Tennenholtz, A Hallak, G Dalal, S Mannor, G Chechik, U Shalit arXiv preprint arXiv:2110.06539, 2021 | 16 | 2021 |
Concentration Bounds for Two Timescale Stochastic Approximation with Applications to Reinforcement Learning G Dalal, B Szorenyi, G Thoppe, S Mannor arXiv preprint arXiv:1703.05376, 2017 | 13 | 2017 |
Improve agents without retraining: Parallel tree search with off-policy correction G Dalal, A Hallak, S Dalton, S Mannor, G Chechik Advances in Neural Information Processing Systems 34, 5518-5530, 2021 | 8 | 2021 |
Finite sample analysis for TD (0) with linear function approximation G Dalal, B Szörényi, G Thoppe, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2018), 2018 | 8 | 2018 |