Accelerating sparse dnn models without hardware-support via tile-wise sparsity C Guo, BY Hsueh, J Leng, Y Qiu, Y Guan, Z Wang, X Jia, X Li, M Guo, ... Proceedings of the International Conference for High Performance Computing …, 2020 | 61 | 2020 |
Adversarial defense through network profiling based path extraction Y Qiu, J Leng, C Guo, Q Chen, C Li, M Guo, Y Zhu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 50 | 2019 |
SQuant: On-the-fly data-free quantization via diagonal hessian approximation C Guo, Y Qiu, J Leng, X Gao, C Zhang, Y Liu, F Yang, Y Zhu, M Guo arXiv preprint arXiv:2202.07471, 2022 | 35 | 2022 |
Dual-side sparse tensor core Y Wang, C Zhang, Z Xie, C Guo, Y Liu, J Leng 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 33 | 2021 |
Balancing efficiency and flexibility for DNN acceleration via temporal GPU-systolic array integration C Guo, Y Zhou, J Leng, Y Zhu, Z Du, Q Chen, C Li, B Yao, M Guo 2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020 | 20 | 2020 |
Characterizing and demystifying the implicit convolution algorithm on commercial matrix-multiplication accelerators Y Zhou, M Yang, C Guo, J Leng, Y Liang, Q Chen, M Guo, Y Zhu 2021 IEEE International Symposium on Workload Characterization (IISWC), 214-225, 2021 | 19 | 2021 |
Ant: Exploiting adaptive numerical data type for low-bit deep neural network quantization C Guo, C Zhang, J Leng, Z Liu, F Yang, Y Liu, M Guo, Y Zhu 2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO …, 2022 | 8 | 2022 |
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training C Guo, Y Qiu, J Leng, C Zhang, Y Cao, Q Zhang, Y Liu, F Yang, M Guo 2022 IEEE 40th International Conference on Computer Design (ICCD), 738-745, 2022 | 3 | 2022 |
Efficient activation quantization via adaptive rounding border for post-training quantization Z Li, C Guo, Z Zhu, Y Zhou, Y Qiu, X Gao, J Leng, M Guo arXiv preprint arXiv:2208.11945, 2022 | 3 | 2022 |
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization C Guo, J Tang, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 2 | 2023 |
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs Y Zhou, Y Song, J Leng, Z Liu, W Cui, Z Zhang, C Guo, Q Chen, L Li, ... arXiv preprint arXiv:2305.17408, 2023 | | 2023 |
Towards Reliable AI Applications via Algorithm-Based Fault Tolerance on NVDLA MT Sanic, C Guo, J Leng, M Guo, W Ma 2022 18th International Conference on Mobility, Sensing and Networking (MSN …, 2022 | | 2022 |