Dynamic TCP initial windows and congestion control schemes through reinforcement learning X Nie, Y Zhao, Z Li, G Chen, K Sui, J Zhang, Z Ye, D Pei IEEE Journal on Selected Areas in Communications 37 (6), 1231-1247, 2019 | 123 | 2019 |
Practical root cause localization for microservice systems via trace analysis Z Li, J Chen, R Jiao, N Zhao, Z Wang, S Zhang, Y Wu, L Jiang, L Yan, ... 2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS), 1-10, 2021 | 83 | 2021 |
{Jump-Starting} multivariate time series anomaly detection for online service systems M Ma, S Zhang, J Chen, J Xu, H Li, Y Lin, X Nie, B Zhou, Y Wang, D Pei 2021 USENIX Annual Technical Conference (USENIX ATC 21), 413-426, 2021 | 63 | 2021 |
Hotspot: Anomaly localization for additive kpis with multi-dimensional attributes Y Sun, Y Zhao, Y Su, D Liu, X Nie, Y Meng, S Cheng, D Pei, S Zhang, ... IEEE Access 6, 10909-10923, 2018 | 62 | 2018 |
Understanding and handling alert storm for online service systems N Zhao, J Chen, X Peng, H Wang, X Wu, Y Zhang, Z Chen, X Zheng, ... Proceedings of the ACM/IEEE 42nd International Conference on Software …, 2020 | 54 | 2020 |
Real-time incident prediction for online service systems N Zhao, J Chen, Z Wang, X Peng, G Wang, Y Wu, F Zhou, Z Feng, X Nie, ... Proceedings of the 28th ACM Joint Meeting on European Software Engineering …, 2020 | 51 | 2020 |
Actionable and interpretable fault localization for recurring failures in online service systems Z Li, N Zhao, M Li, X Lu, L Wang, D Chang, X Nie, L Cao, W Zhang, K Sui, ... Proceedings of the 30th ACM Joint European Software Engineering Conference …, 2022 | 48 | 2022 |
Causal inference-based root cause analysis for online service systems with intervention recognition M Li, Z Li, K Yin, X Nie, W Zhang, K Sui, D Pei Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and …, 2022 | 46 | 2022 |
Fluxrank: A widely-deployable framework to automatically localizing root cause machines for software service failure mitigation P Liu, Y Chen, X Nie, J Zhu, S Zhang, K Sui, M Zhang, D Pei 2019 IEEE 30th International Symposium on Software Reliability Engineering …, 2019 | 44 | 2019 |
BDS: A centralized near-optimal overlay network for inter-datacenter data replication Y Zhang, J Jiang, K Xu, X Nie, MJ Reed, H Wang, G Yao, M Zhang, ... Proceedings of the Thirteenth EuroSys Conference, 1-14, 2018 | 44 | 2018 |
Practical root cause localization for microservice systems via trace analysis. In 2021 IEEE/ACM 29th International Symposium on Quality of Service (IWQOS) Z Li, J Chen, R Jiao, N Zhao, Z Wang, S Zhang, Y Wu, L Jiang, L Yan, ... IEEE, Tokyo, Japan, 1-10, 2021 | 32 | 2021 |
Identifying root-cause metrics for incident diagnosis in online service systems C Wu, N Zhao, L Wang, X Yang, S Li, M Zhang, X Jin, X Wen, X Nie, ... 2021 IEEE 32nd International Symposium on Software Reliability Engineering …, 2021 | 27 | 2021 |
Mining causality graph for automatic web-based service diagnosis X Nie, Y Zhao, K Sui, D Pei, Y Chen, X Qu 2016 IEEE 35th International Performance Computing and Communications …, 2016 | 23 | 2016 |
Bds+: An inter-datacenter data replication system with dynamic bandwidth separation Y Zhang, X Nie, J Jiang, W Wang, K Xu, Y Zhao, MJ Reed, K Chen, ... IEEE/ACM Transactions on Networking 29 (2), 918-934, 2021 | 19 | 2021 |
Reducing web latency through dynamically setting TCP initial window with reinforcement learning X Nie, Y Zhao, D Pei, G Chen, K Sui, J Zhang 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS), 1-10, 2018 | 18 | 2018 |
Tcp wise: One initial congestion window is not enough X Nie, Y Zhao, G Chen, K Sui, Y Chen, D Pei, M Zhang, J Zhang 2017 IEEE 36th International Performance Computing and Communications …, 2017 | 18 | 2017 |
Generic and Robust Performance Diagnosis via Causal Inference for OLTP Database Systems X Lu, Z Xie, Z Li, M Li, X Nie, N Zhao, Q Yu, S Zhang, K Sui, L Zhu, D Pei 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet …, 2022 | 10 | 2022 |
OpsEval: A Comprehensive Task-Oriented AIOps Benchmark for Large Language Models Y Liu, C Pei, L Xu, B Chen, M Sun, Z Zhang, Y Sun, S Zhang, K Wang, ... arXiv preprint arXiv:2310.07637, 2023 | 6 | 2023 |
Piebridge: A cross-dr scale large data transmission scheduling system Y Zhang, K Xu, G Yao, M Zhang, X Nie Proceedings of the 2016 ACM SIGCOMM Conference, 553-554, 2016 | 6 | 2016 |
Mining fluctuation propagation graph among time series with active learning M Li, M Ma, X Nie, K Yin, L Cao, X Wen, Z Yuan, D Wu, G Li, W Liu, ... International Conference on Database and Expert Systems Applications, 220-233, 2022 | 5 | 2022 |