Zongzhang Zhang

Citado por

	Total	Desde 2019
Citas	1237	1129
Índice h	17	16
Índice i10	30	27

300

150

225

2013201420152016201720182019202020212022202320247 9 8 23 6 46 86 155 183 258 300 147

Acceso público

Ver todo

41 artículos

1 artículo

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Yan Zheng (郑岩)Tianjin UniversityDirección de correo verificada de tju.edu.cn
Yingfeng Chen(陈赢峰)Fuxi AI Lab in NeteaseDirección de correo verificada de mail.ustc.edu.cn
Tianpei YangUniversity of AlbertaDirección de correo verificada de ualberta.ca
Wulong LiuHuawei Noah's Ark LabDirección de correo verificada de huawei.com
Mykel J. KochenderferAssociate Professor, Stanford UniversityDirección de correo verificada de stanford.edu
David HsuProfessor of Computer Science, National University of SingaporeDirección de correo verificada de comp.nus.edu.sg
Wee Sun LeeProfessor, Department of Computer Science, National University of SingaporeDirección de correo verificada de comp.nus.edu.sg
Aijun BaiGoogle ResearchDirección de correo verificada de google.com
Yuzheng ZhuangSenior Researcher @ Huawei Noah's Ark LabDirección de correo verificada de huawei.com
Feng WuAssociate Professor, University of Science and Technology of ChinaDirección de correo verificada de ustc.edu.cn
Michael LittmanBrown UniversityDirección de correo verificada de brown.edu
Zhan Wei LimNational University of SingaporeDirección de correo verificada de comp.nus.edu.sg
Jianye HaoTianjin University

Seguir

Zongzhang Zhang

Nanjing University

Dirección de correo verificada de nju.edu.cn - Página principal

Artificial Intelligence Reinforcement Learning Probabilistic Planning Imitation Learning


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
A survey on deep reinforcement learning Q Liu, JW Zhai, ZZ Zhang, S Zhong, Q Zhou, P Zhang, J Xu Chinese Journal of Computers 41 (1), 1-27, 2018	182	2018
深度强化学习综述刘全，翟建伟，章宗长，钟珊，周倩，章鹏，徐进计算机学报 41 (1), 1-27, 2018	103	2018
Weighted double Q-learning Z Zhang, Z Pan, MJ Kochenderfer IJCAI-2017, 3455-3461, 2017	99	2017
A deep Bayesian policy reuse approach against non-stationary agents Y Zheng, Z Meng, J Hao, Z Zhang, T Yang, C Fan NeurIPS-2018, 954-964, 2018	82	2018
Hierarchical deep multiagent reinforcement learning with temporal abstraction H Tang, J Hao, T Lv, Y Chen, Z Zhang, H Jia, C Ren, Y Zheng, Z Meng, ... arXiv preprint arXiv:1809.09332, 2018	74	2018
A survey on deep reinforcement learning L Quan, Z Jianwei, Z Zongchang, Z Shan, Z Qian Chinese Journal of Computers 41 (01), 1-27, 2018	53	2018
Weighted double deep multiagent reinforcement learning in stochastic cooperative environments Y Zheng, Z Meng, J Hao, Z Zhang PRICAI-2018, 421-429, 2018	45	2018
Multi-Agent Incentive Communication via Decentralized Teammate Modeling L Yuan, J Wang, F Zhang, C Wang, Z Zhang, Y Yu, C Zhang AAAI-2022, 9466-9474, 2022	35	2022
Deep Q-learning with prioritized sampling J Zhai, Q Liu, Z Zhang, S Zhong, H Zhu, P Zhang, C Sun ICONIP-2016, 13-22, 2016	33	2016
Efficient deep reinforcement learning via adaptive policy transfer T Yang, J Hao, Z Meng, Z Zhang, Y Hu, Y Chen, C Fan, W Wang, W Liu, ... IJCAI-2020, 3094-3100, 2020	31	2020
Triple-GAIL: A multi-modal imitation learning framework with generative adversarial Nets C Fei, B Wang, Y Zhuang, Z Zhang, J Hao, H Zhang, X Ji, W Liu IJCAI-2020, 2929-2935, 2020	28	2020
Thompson sampling based Monte-Carlo planning in POMDPs A Bai, F Wu, Z Zhang, X Chen ICAPS-2014, 28-36, 2014	25	2014
Covering number for efficient heuristic-based POMDP planning Z Zhang, D Hsu, WS Lee ICML-2014, 28-36, 2014	25	2014
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy FM Luo, S Jiang, Y Yu, Z Zhang, YF Zhang AAAI-2022, 7637-7646, 2022	22	2022
Covering number as a complexity measure for POMDP planning and learning Z Zhang, M Littman, X Chen AAAI-2012, 1853-1859, 2012	21	2012
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data F Zhang, C Jia, YC Li, L Yuan, Y Yu, Z Zhang ICLR-2023, 2023	19	2023
Multi-agent Dynamic Algorithm Configuration K Xue, J Xu, L Yuan, M Li, C Qian, Z Zhang, Y Yu NeurIPS-2022, 20147-20161, 2022	19	2022
Efficient Multi-agent Communication via Self-supervised Information Aggregation C Guan, F Chen, L Yuan, C Wang, H Yin, Z Zhang, Y Yu NeurIPS-2022, 1020-1033, 2022	17	2022
Efficient policy detecting and reusing for non-stationarity in markov games Y Zheng, J Hao, Z Zhang, Z Meng, T Yang, Y Li, C Fan Autonomous Agents and Multi-Agent Systems 35, 1-29, 2021	16	2021
Adaptive Online Packing-guided Search for POMDPs C Wu, G Yang, Z Zhang, Y Yu, D Li, W Liu NeurIPS-2021, 28419-28430, 2021	14	2021

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores