Improving neural network quantization without retraining using outlier channel splitting R Zhao, Y Hu, J Dotzel, C De Sa, Z Zhang International conference on machine learning, 7543-7552, 2019 | 370 | 2019 |
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design C Hao, J Dotzel, J Xiong, L Benini, Z Zhang, D Chen IEEE Design & Test, 2021 | 49 | 2021 |
Logic synthesis meets machine learning: Trading exactness for generalization S Rai, WL Neto, Y Miyasaka, X Zhang, M Yu, Q Yi, M Fujita, GB Manske, ... 2021 Design, Automation & Test in Europe Conference & Exhibition (DATE …, 2021 | 40 | 2021 |
Building efficient deep neural networks with unitary group convolutions R Zhao, Y Hu, J Dotzel, CD Sa, Z Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 34 | 2019 |
Improving neural network quantization using outlier channel splitting R Zhao, Y Hu, J Dotzel, C De Sa, Z Zhang arXiv preprint arXiv:1901.09504 1 (2), 2019 | 13 | 2019 |
Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs J Dotzel, Y Chen, B Kotb, S Prasad, G Wu, S Li, MS Abdelfattah, Z Zhang International Conference on Machine Learning, 2024 | 7 | 2024 |
M4BRAM: Mixed-Precision Matrix-Matrix Multiplication in FPGA Block RAMs Y Chen, J Dotzel, MS Abdelfattah 2023 International Conference on Field Programmable Technology (ICFPT), 69-78, 2023 | 7 | 2023 |
OverQ: Opportunistic outlier quantization for neural network accelerators R Zhao, J Dotzel, Z Hu, P Ivanov, C De Sa, Z Zhang arXiv preprint arXiv:1910.06909, 2019 | 5 | 2019 |
ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models Y Akhauri, AF AbouElhamayed, J Dotzel, Z Zhang, AM Rush, S Huda, ... Conference on Empirical Methods in Natural Language Processing, 2024 | 4 | 2024 |
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search J Dotzel, G Wu, A Li, M Umar, Y Ni, MS Abdelfattah, Z Zhang, L Cheng, ... International Conference on Automated Machine Learning, 2024 | 1 | 2024 |
Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel J Dotzel, B Kotb, J Dotzel, MS Abdelfattah, Z Zhang The Second Tiny Papers Track at ICLR 2024, 2024 | 1 | 2024 |
Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models J Dotzel, Y Akhauri, AS AbouElhamayed, C Jiang, M Abdelfattah, Z Zhang arXiv preprint arXiv:2404.04900, 2024 | 1 | 2024 |
Opportunities for Post-Training Dynamic Layer Sparsity in Large Vision and Language Models J Dotzel, C Jiang, M Abdelfattah, Z Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |