Seguir
Xiao Hu
Xiao Hu
Dirección de correo verificada de mails.tsinghua.edu.cn
Título
Citado por
Citado por
Año
Fault diagnosis using novel AdaBoost based discriminant locality preserving projection with resamples
YL He, Y Zhao, X Hu, XN Yan, QX Zhu, Y Xu
Engineering Applications of Artificial Intelligence 91, 103631, 2020
562020
Mind the gap: Offline policy optimization for imperfect rewards
J Li*, X Hu*, H Xu, J Liu, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2023, 2023
152023
PROTO: Iterative Policy Regularized Offline-to-Online Reinforcement Learning
J Li, X Hu, H Xu, J Liu, X Zhan, YQ Zhang
arXiv preprint arXiv:2305.15669, 2023
82023
Query-Policy Misalignment in Preference-Based Reinforcement Learning
X Hu, J Li, X Zhan, QS Jia, YQ Zhang
International Conference on Learning Representations (ICLR), 2024, 2023
32023
Novel L2-Discriminant Locality Preserving Projection Integrated with Adaboost and Its Application to Fault Diagnosis
X Hu, Y Zhao, Y Xu, YL He, QX Zhu
2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS …, 2020
22020
Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning
S Huang, Q Gallouédec, F Felten, A Raffin, RFJ Dossa, Y Zhao, ...
arXiv preprint arXiv:2402.03046, 2024
12024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
J Li, J Zheng, Y Zheng, L Mao, X Hu, S Cheng, H Niu, J Liu, Y Liu, J Liu, ...
arXiv preprint arXiv:2402.18137, 2024
2024
Vehicle Extreme Control based on Offline Reinforcement Leaning
S Zhao, J Li, X Hu, J Zhang, C He
2022 China Automation Congress (CAC), 4539-4543, 2022
2022
面向数据中心绿色可靠运行的强化学习方法
贾庆山, 唐静娴, 吴俊杰, 胡潇, 林依挺, 夏恒
智能科学与技术学报 2 (4), 341-347, 0
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–9