Nyströmformer: A nyström-based algorithm for approximating self-attention Y Xiong, Z Zeng, R Chakraborty, M Tan, G Fung, Y Li, V Singh Proceedings of the AAAI conference on artificial intelligence 35 (16), 14138 …, 2021 | 517 | 2021 |
You only sample (almost) once: Linear cost self-attention via bernoulli sampling Z Zeng, Y Xiong, S Ravi, S Acharya, GM Fung, V Singh International conference on machine learning, 12321-12332, 2021 | 25 | 2021 |
Large-field-of-view visualization utilizing multiple miniaturized cameras for laparoscopic surgery JJ Kim, A Watras, H Liu, Z Zeng, JA Greenberg, CP Heise, YH Hu, H Jiang Micromachines 9 (9), 431, 2018 | 23 | 2018 |
Multi resolution analysis (mra) for approximate self-attention Z Zeng, S Pal, J Kline, GM Fung, V Singh International Conference on Machine Learning, 25955-25972, 2022 | 10 | 2022 |
VCC: scaling transformers to 128k tokens or more by prioritizing important tokens Z Zeng, C Hawkins, M Hong, A Zhang, N Pappas, V Singh, S Zheng Advances in Neural Information Processing Systems 36, 20260-20286, 2023 | 9 | 2023 |
Framequant: Flexible low-bit quantization for transformers H Adepu, Z Zeng, L Zhang, V Singh arXiv preprint arXiv:2403.06082, 2024 | 7 | 2024 |
Lookupffn: making transformers compute-lite for cpu inference Z Zeng, M Davies, P Pulijala, K Sankaralingam, V Singh International Conference on Machine Learning, 40707-40718, 2023 | 6 | 2023 |
Parallax mitigation for real-time close field video stitching A Watras, J Ke, Z Zeng, JJ Kim, H Liu, H Jiang, YH Hu 2017 International Conference on Computational Science and Computational …, 2017 | 4 | 2017 |
IM-unpack: training and inference with arbitrarily low precision integers Z Zeng, K Sankaralingam, V Singh arXiv preprint arXiv:2403.07339, 2024 | 1 | 2024 |
Controlled differential equations on long sequences via non-standard wavelets S Pal, Z Zeng, SN Ravi, V Singh International Conference on Machine Learning, 26820-26836, 2023 | 1 | 2023 |
Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers Y Zhong, Y Zhou, Y Zhang, S Li, Y Li, F Chao, Z Zeng, R Ji arXiv preprint arXiv:2412.16553, 2024 | | 2024 |
On the Efficiency of Transformers Z Zeng The University of Wisconsin-Madison, 2024 | | 2024 |