Hidden community detection in social networks K He, Y Li, S Soundarajan, JE Hopcroft Information Sciences 425, 92-106, 2018 | 112 | 2018 |
HyperDQN: A randomized exploration method for deep reinforcement learning Z Li, Y Li, Y Zhang, T Zhang, ZQ Luo International Conference on Learning Representations, 2021 | 16 | 2021 |
Divergence-augmented policy optimization Q Wang, Y Li, J Xiong, T Zhang Advances in Neural Information Processing Systems 32, 2019 | 12 | 2019 |
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent Y Li, J Xu, L Han, ZQ Luo Forty-first International Conference on Machine Learning, 2024 | 6* | 2024 |
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation Y Li, J Xu, ZQ Luo arXiv preprint arXiv:2407.13195, 2024 | | 2024 |
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation Y Li, Z Luo International Conference on Artificial Intelligence and Statistics, 559-567, 2024 | | 2024 |
Radar Anti-jamming Strategy Learning via Domain-knowledge Enhanced Online Convex Optimization L Liu, W Pu, Y Li, B Jiu, ZQ Luo arXiv preprint arXiv:2402.16274, 2024 | | 2024 |
Probability Tools for Sequential Random Projection Y Li arXiv preprint arXiv:2402.14026, 2024 | | 2024 |
Simple, unified analysis of Johnson-Lindenstrauss with applications Y Li arXiv preprint arXiv:2402.10232, 2024 | | 2024 |
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games Y Li, L Liu, W Pi, H Liang, ZQ Luo arXiv preprint arXiv:2402.09456, 2024 | | 2024 |
Scalable and Provable Exploration via HyperAgent for Foundation Model Decision-making Y Li, J Xu, ZQ Luo Automated Reinforcement Learning: Exploring Meta-Learning, AutoML, and LLMs, 0 | | |