Delta-DNN: Efficiently compressing deep neural networks via exploiting floats similarity Z Hu, X Zou, W Xia, S Jin, D Tao, Y Liu, W Zhang, Z Zhang Proceedings of the 49th International Conference on Parallel Processing, 1-12, 2020 | 17 | 2020 |
Smart-DNN+: A Memory-efficient Neural Networks Compression Framework for the Model Inference D Wu, W Yang, X Zou, W Xia, S Li, Z Hu, W Zhang, B Fang ACM Transactions on Architecture and Code Optimization 20 (4), 1-24, 2023 | 3 | 2023 |