Generalizable episodic memory for deep reinforcement learning H Hu, J Ye, G Zhu, Z Ren, C Zhang Thirty-eighth International Conference on Machine Learning (ICML 2021), 2021 | 39 | 2021 |
Metacure: Meta reinforcement learning with empowerment-driven exploration J Zhang, J Wang, H Hu, T Chen, Y Chen, C Fan, C Zhang Thirty-eighth International Conference on Machine Learning (ICML 2021 …, 2021 | 38* | 2021 |
Offline Reinforcement Learning with Value-based Episodic Memory X Ma*, Y Yang*, H Hu*, Q Liu, J Yang, C Zhang, Q Zhao, B Liang Tenth International Conference on Learning Representations (ICLR 2022), 2021 | 34 | 2021 |
Maximize to explore: One objective function fusing estimation, planning, and exploration Z Liu, M Lu, W Xiong, H Zhong, H Hu, S Zhang, S Zheng, Z Yang, Z Wang Advances in Neural Information Processing Systems 36, 2024 | 20* | 2024 |
On the Estimation Bias in Double Q-Learning Z Ren, G Zhu, H Hu, B Han, J Chen, C Zhang Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS …, 2021 | 14 | 2021 |
On the Role of Discount Factor in Offline Reinforcement Learning H Hu, Y Yang, Q Zhao, C Zhang Thirty-ninth International Conference on Machine Learning (ICML 2022), 2022 | 12 | 2022 |
What is essential for unseen goal generalization of offline goal-conditioned RL? R Yang, L Yong, X Ma, H Hu, C Zhang, T Zhang International Conference on Machine Learning, 39543-39571, 2023 | 11 | 2023 |
Reason for future, act for now: A principled framework for autonomous llm agents with provable sample efficiency Z Liu, H Hu, S Zhang, H Guo, S Ke, B Liu, Z Wang arXiv preprint arXiv:2309.17382, 2023 | 10* | 2023 |
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery Y Yang*, H Hu*, W Li*, S Li, J Yang, Q Zhao, C Zhang Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, 2022 | 8 | 2022 |
The provable benefits of unsupervised data sharing for offline reinforcement learning H Hu, Y Yang, Q Zhao, C Zhang arXiv preprint arXiv:2302.13493, 2023 | 7 | 2023 |
Unsupervised behavior extraction via random intent priors H Hu, Y Yang, J Ye, Z Mai, C Zhang Advances in Neural Information Processing Systems 36, 2024 | 2 | 2024 |
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets Y Mao, C Wu, X Chen, H Hu, J Jiang, T Zhou, T Lv, C Fan, Z Hu, Y Wu, ... The Twelfth International Conference on Learning Representations, 2023 | | 2023 |
Bayesian Offline-to-Online Reinforcement Learning: A Realist Approach H Hu, Y Yang, J Ye, Z Mai, Y Hu, T Lv, C Fan, Q Zhao, C Zhang | | 2023 |
Query-Efficient Offline Preference-Based Reinforcement Learning via In-Dataset Exploration H Hu, Y Yang, J Zhang, S Wang, B Liu, Y Gao, C Zhang | | 2023 |