Följ
Hongyu Wang
Hongyu Wang
University of Chinese Academy of Sciences
Verifierad e-postadress på mail.ustc.edu.cn - Startsida
Titel
Citeras av
Citeras av
År
Bitnet: Scaling 1-bit transformers for large language models
H Wang, S Ma, L Dong, S Huang, H Wang, L Ma, F Yang, R Wang, Y Wu, ...
arXiv preprint arXiv:2310.11453, 2023
1812023
Deepnet: Scaling transformers to 1,000 layers
H Wang, S Ma, L Dong, S Huang, D Zhang, F Wei
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
1772024
The era of 1-bit llms: All large language models are in 1.58 bits
S Ma, H Wang, L Ma, L Wang, W Wang, S Huang, L Dong, R Wang, J Xue, ...
arXiv preprint arXiv:2402.17764 1, 2024
1752024
Magneto: A Foundation Transformer
Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng ...
International Conference on Machine Learning, 2023
43*2023
TorchScale: Transformers at scale
S Ma, H Wang, S Huang, W Wang, Z Chi, L Dong, A Benhaim, B Patra, ...
arXiv preprint arXiv:2211.13184, 2022
132022
Q-sparse: All large language models can be fully sparsely-activated
H Wang, S Ma, R Wang, F Wei
arXiv preprint arXiv:2407.10969, 2024
52024
M4u: Evaluating multilingual understanding and reasoning for large multimodal models
H Wang, J Xu, S Xie, R Wang, J Li, Z Xie, B Zhang, C Xiong, X Chen
arXiv preprint arXiv:2405.15638, 2024
42024
Bitnet. cpp: Efficient Edge Inference for Ternary LLMs
J Wang, H Zhou, T Song, S Cao, Y Xia, T Cao, J Wei, S Ma, H Wang, ...
arXiv preprint arXiv:2502.11880, 2025
1*2025
BitNet a4. 8: 4-bit Activations for 1-bit LLMs
H Wang, S Ma, F Wei
arXiv preprint arXiv:2411.04965, 2024
12024
Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation
S Xie, H Wang, Z Xiao, R Wang, X Chen
arXiv preprint arXiv:2501.04268, 2025
2025
Transformer network with normalization including scaling parameter
Shuming MA, Li Dong, Shaohan Huang, Dongdong Zhang, Furu Wei, Hongyu Wang
US Patent App. 18/176,037, 2024
2024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–11