Obserwuj
Yuhui Xu
Yuhui Xu
Inne imiona/nazwiska徐 宇辉, Evan Xu
Salesforce Research
Zweryfikowany adres z salesforce.com - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search
Y Xu, L Xie, X Zhang, X Chen, GJ Qi, Q Tian, H Xiong
International Conference on Learning Representations, 2020
8822020
Deep neural network compression with single and multiple level quantization
Y Xu, Y Wang, A Zhou, W Lin, H Xiong
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
1502018
Trp: Trained rank pruning for efficient deep neural networks
Y Xu, Y Li, S Zhang, W Wen, B Wang, Y Qi, Y Chen, W Lin, H Xiong
IJCAI 2020, 2020
149*2020
Qa-lora: Quantization-aware low-rank adaptation of large language models
Y Xu, L Xie, X Gu, X Chen, H Chang, H Zhang, Z Chen, X Zhang, Q Tian
ICLR 2024, 2023
1352023
Weight-sharing neural architecture search: A battle to shrink the optimization gap
L Xie, X Chen, K Bi, L Wei, Y Xu, L Wang, Z Chen, A Xiao, J Chang, ...
ACM Computing Surveys (CSUR) 54 (9), 1-37, 2021
127*2021
Partially-connected neural architecture search for reduced computational redundancy
Y Xu, L Xie, W Dai, X Zhang, X Chen, GJ Qi, H Xiong, Q Tian
IEEE Transactions on Pattern Analysis and Machine Intelligence 43 (9), 2953-2970, 2021
692021
Latency-aware differentiable neural architecture search
Y Xu, L Xie, X Zhang, X Chen, B Shi, Q Tian, H Xiong
arXiv preprint arXiv:2001.06392, 2020
442020
Filter level pruning based on similar feature extraction for convolutional neural networks
L Li, Y Xu, J Zhu
IEICE TRANSACTIONS on Information and Systems 101 (4), 1203-1206, 2018
282018
Iterative deep neural network quantization with lipschitz constraint
Y Xu, W Dai, Y Qi, J Zou, H Xiong
IEEE Transactions on Multimedia 22 (7), 1874-1888, 2019
212019
Fitting the search space of weight-sharing nas with graph convolutional networks
X Chen, L Xie, J Wu, L Wei, Y Xu, Q Tian
Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7064-7072, 2021
202021
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models
X Lu, Q Liu, Y Xu, A Zhou, S Huang, B Zhang, J Yan, H Li
ACL 2024, 2024
192024
Bnet: Batch normalization with enhanced linear transformation
Y Xu, L Xie, C Xie, W Dai, J Mei, S Qiao, W Shen, H Xiong, A Yuille
IEEE transactions on pattern analysis and machine intelligence 45 (7), 9225-9232, 2023
18*2023
DNQ: Dynamic Network Quantization
Y Xu, S Zhang, Y Qi, J Guo, W Lin, H Xiong
Data Compression Conference (DCC2019), 2018
122018
Think: Thinner key cache by query-driven pruning
Y Xu, Z Jie, H Dong, L Wang, X Lu, A Zhou, A Saha, C Xiong, D Sahoo
ICLR 2025, 2024
102024
Fedexg: Federated learning with model exchange
Z Mao, W Dai, C Li, Y Xu, S Wang, J Zou, H Xiong
2020 IEEE International Symposium on Circuits and Systems (ISCAS), 1-5, 2020
102020
Dynamic-stride-net: Deep convolutional neural network with dynamic stride
Z Yang, Y Xu, W Dai, H Xiong
Optoelectronic Imaging and Multimedia Technology VI 11187, 42-53, 2019
102019
Noise-to-compression variational autoencoder for efficient end-to-end optimized image coding
J Luo, S Li, W Dai, Y Xu, D Cheng, G Li, H Xiong
2020 Data Compression Conference (DCC), 33-42, 2020
82020
Tiny-hourglassnet: An efficient design for 3d human pose estimation
B Shi, Y Xu, W Dai, B Wang, S Zhang, C Li, J Zou, H Xiong
2020 IEEE international conference on image processing (ICIP), 1491-1495, 2020
52020
SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models
X Lu, A Zhou, Y Xu, R Zhang, P Gao, H Li
ICML 2024, 2024
42024
可解释化, 结构化, 多模态化的深度神经网络
熊红凯, 高星, 李劭辉, 徐宇辉, 王涌壮, 余豪阳, 刘昕, 张云飞
模式识别与人工智能 31 (1), 1-11, 2018
42018
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20