팔로우
Zhenyu "Allen" Zhang
Zhenyu "Allen" Zhang
다른 이름Zhenyu Zhang
utexas.edu의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
H2o: Heavy-hitter oracle for efficient generative inference of large language models
Z Zhang, Y Sheng, T Zhou, T Chen, L Zheng, R Cai, Z Song, Y Tian, C Ré, ...
(NeurIPS) Advances in Neural Information Processing Systems 36, 34661-34710, 2023
2812023
Robust overfitting may be mitigated by properly learned smoothening
T Chen*, Z Zhang*, S Liu, S Chang, Z Wang
(* Equal Contribution, ICLR) International Conference on Learning …, 2021
2142021
Galore: Memory-efficient llm training by gradient low-rank projection
J Zhao, Z Zhang, B Chen, Z Wang, A Anandkumar, Y Tian
(ICML Oral) International Conference on Machine Learning, 2024
138*2024
Long live the lottery: The existence of winning tickets in lifelong learning
T Chen*, Z Zhang*, S Liu, S Chang, Z Wang
(* Equal Contribution, ICLR) International Conference on Learning …, 2021
752021
Efficient lottery ticket finding: Less data is more
Z Zhang*, X Chen*, T Chen*, Z Wang
(* Equal Contribution, ICML) International Conference on Machine Learning, 2021
602021
Gans can play lottery tickets too
X Chen*, Z Zhang*, Y Sui, T Chen
(* Equal Contribution, ICLR) International Conference on Learning …, 2021
602021
" BNN-BN=?": Training Binary Neural Networks Without Batch Normalization
T Chen, Z Zhang, X Ouyang, Z Liu, Z Shen, Z Wang
(CVPR Workshops Spotlight) IEEE Conference on Computer Vision and Pattern …, 2021
552021
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity
L Yin, Y Wu, Z Zhang, CY Hsieh, Y Wang, Y Jia, M Pechenizkiy, Y Liang, ...
(ICML) International Conference on Machine Learning, 2023
502023
Sparsity Winning Twice: Better Robust Generalization from More Efficient Training
T Chen*, Z Zhang*, P Wang*, S Balachandra*, H Ma*, Z Wang, Z Wang
(* Equal Contribution, ICLR) International Conference on Learning …, 2022
47*2022
Joma: Demystifying multilayer transformers via joint dynamics of mlp and attention
Y Tian, Y Wang, Z Zhang, B Chen, S Du
(ICLR) International Conference on Learning Representations, 2023
462023
The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy
T Chen, Z Zhang, Y Cheng, A Awadallah, Z Wang
(CVPR) IEEE Conference on Computer Vision and Pattern Recognition, 2022
452022
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
T Chen*, Z Zhang*, AK JAISWAL, S Liu, Z Wang
(* Equal Contribution, ICLR Spotlight) International Conference on Learning …, 2023
44*2023
Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference
H Dong, X Yang, Z Zhang, Z Wang, Y Chi, B Chen
(ICML) International Conference on Machine Learning, 2024
33*2024
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
S Liu, T Chen, Z Zhang, X Chen, T Huang, AK JAISWAL, Z Wang
(ICLR Spotlight) International Conference on Learning Representations, 2023
312023
You are caught stealing my winning lottery ticket! making a lottery ticket claim its ownership
X Chen, T Chen, Z Zhang, Z Wang
(NeurIPS) Advances in Neural Information Processing Systems 34, 1780-1791, 2021
272021
Quarantine: Sparsity can uncover the trojan attack trigger for free
T Chen*, Z Zhang*, Y Zhang*, S Chang, S Liu, Z Wang
(* Equal Contribution, CVPR) IEEE Conference on Computer Vision and Pattern …, 2022
262022
Sparsity-Guided Holistic Explanation for LLMs with Interpretable Inference-Time Intervention
Z Tan, T Chen, Z Zhang, H Liu
(AAAI) The 38th Annual AAAI Conference on Artificial Intelligence, 2023
232023
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
P Li, Z Zhang, P Yadav, YL Sung, Y Cheng, M Bansal, T Chen
(ICLR Spotlight) International Conference on Learning Representations, 2023
222023
Found in the middle: How language models use long contexts better via plug-and-play positional encoding
Z Zhang, R Chen, S Liu, Z Yao, O Ruwase, B Chen, X Wu, Z Wang
(NeurIPS) Advances in Neural Information Processing Systems, 2024
172024
Can You Win Everything with A Lottery Ticket?
T Chen, Z Zhang, J Wu, R Huang, S Liu, S Chang, Z Wang
(TMLR) Transactions on Machine Learning Research, 2022
162022
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20