Cong Guo
Cited by
Cited by
Accelerating sparse dnn models without hardware-support via tile-wise sparsity
C Guo, BY Hsueh, J Leng, Y Qiu, Y Guan, Z Wang, X Jia, X Li, M Guo, ...
Proceedings of the International Conference for High Performance Computing …, 2020
Adversarial defense through network profiling based path extraction
Y Qiu, J Leng, C Guo, Q Chen, C Li, M Guo, Y Zhu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
SQuant: On-the-fly data-free quantization via diagonal hessian approximation
C Guo, Y Qiu, J Leng, X Gao, C Zhang, Y Liu, F Yang, Y Zhu, M Guo
arXiv preprint arXiv:2202.07471, 2022
Dual-side sparse tensor core
Y Wang, C Zhang, Z Xie, C Guo, Y Liu, J Leng
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
Balancing efficiency and flexibility for DNN acceleration via temporal GPU-systolic array integration
C Guo, Y Zhou, J Leng, Y Zhu, Z Du, Q Chen, C Li, B Yao, M Guo
2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020
Characterizing and demystifying the implicit convolution algorithm on commercial matrix-multiplication accelerators
Y Zhou, M Yang, C Guo, J Leng, Y Liang, Q Chen, M Guo, Y Zhu
2021 IEEE International Symposium on Workload Characterization (IISWC), 214-225, 2021
Ant: Exploiting adaptive numerical data type for low-bit deep neural network quantization
C Guo, C Zhang, J Leng, Z Liu, F Yang, Y Liu, M Guo, Y Zhu
2022 55th IEEE/ACM International Symposium on Microarchitecture (MICRO …, 2022
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training
C Guo, Y Qiu, J Leng, C Zhang, Y Cao, Q Zhang, Y Liu, F Yang, M Guo
2022 IEEE 40th International Conference on Computer Design (ICCD), 738-745, 2022
Efficient activation quantization via adaptive rounding border for post-training quantization
Z Li, C Guo, Z Zhu, Y Zhou, Y Qiu, X Gao, J Leng, M Guo
arXiv preprint arXiv:2208.11945, 2022
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization
C Guo, J Tang, W Hu, J Leng, C Zhang, F Yang, Y Liu, M Guo, Y Zhu
Proceedings of the 50th Annual International Symposium on Computer …, 2023
AdaptGear: Accelerating GNN Training via Adaptive Subgraph-Level Kernels on GPUs
Y Zhou, Y Song, J Leng, Z Liu, W Cui, Z Zhang, C Guo, Q Chen, L Li, ...
arXiv preprint arXiv:2305.17408, 2023
Towards Reliable AI Applications via Algorithm-Based Fault Tolerance on NVDLA
MT Sanic, C Guo, J Leng, M Guo, W Ma
2022 18th International Conference on Mobility, Sensing and Networking (MSN …, 2022
The system can't perform the operation now. Try again later.
Articles 1–12