Large language model alignment: A survey T Shen, R Jin, Y Huang, C Liu, W Dong, Z Guo, X Wu, Y Liu, D Xiong arXiv preprint arXiv:2309.15025, 2023 | 161 | 2023 |
Evaluating large language models: A comprehensive survey Z Guo, R Jin, C Liu, Y Huang, D Shi, L Yu, Y Liu, J Li, B Xiong, D Xiong arXiv preprint arXiv:2310.19736, 2023 | 133 | 2023 |
A comprehensive evaluation of quantization strategies for large language models R Jin, J Du, W Huang, W Liu, J Luan, B Wang, D Xiong Findings of the Association for Computational Linguistics ACL 2024, 12186-12215, 2024 | 27* | 2024 |
M3ke: A massive multi-level multi-subject knowledge evaluation benchmark for chinese large language models C Liu, R Jin, Y Ren, L Yu, T Dong, X Peng, S Zhang, J Peng, P Zhang, ... arXiv preprint arXiv:2305.10263, 2023 | 27 | 2023 |
Informative language representation learning for massively multilingual neural machine translation R Jin, D Xiong arXiv preprint arXiv:2209.01530, 2022 | 11 | 2022 |
Ircan: Mitigating knowledge conflicts in llm generation via identifying and reweighting context-aware neurons D Shi, R Jin, T Shen, W Dong, X Wu, D Xiong Advances in Neural Information Processing Systems 37, 4997-5024, 2025 | 5 | 2025 |
ConTrans: Weak-to-strong alignment engineering via concept transplantation W Dong, X Wu, R Jin, S Xu, D Xiong arXiv preprint arXiv:2405.13578, 2024 | 4 | 2024 |
Openeval: benchmarking Chinese LLMs across capability, alignment and safety C Liu, L Yu, J Li, R Jin, Y Huang, L Shi, J Zhang, X Ji, T Cui, T Liu, J Song, ... arXiv preprint arXiv:2403.12316, 2024 | 4 | 2024 |
Do Large Language Models Mirror Cognitive Language Processing? Y Ren, R Jin, T Zhang, D Xiong arXiv preprint arXiv:2402.18023, 2024 | 4 | 2024 |
Followeval: A multi-dimensional benchmark for assessing the instruction-following capability of large language models Y Jing, R Jin, J Hu, H Qiu, X Wang, P Wang, D Xiong arXiv preprint arXiv:2311.09829, 2023 | 4 | 2023 |
Finemath: A fine-grained mathematical evaluation benchmark for chinese large language models Y Liu, R Jin, L Shi, Z Yao, D Xiong arXiv preprint arXiv:2403.07747, 2024 | 3 | 2024 |
Large language model safety: A holistic survey D Shi, T Shen, Y Huang, Z Li, Y Leng, R Jin, C Liu, X Wu, Z Guo, L Yu, ... arXiv preprint arXiv:2412.17686, 2024 | 2 | 2024 |
Multilingual Large Language Models: A Systematic Survey S Zhu, S Xu, H Sun, L Pan, M Cui, J Du, R Jin, A Branco, D Xiong arXiv preprint arXiv:2411.11072, 2024 | 2 | 2024 |
Evaluating Chinese large language models on discipline knowledge acquisition via memorization and robustness assessment C Liu, R Jin, M Steedman, D Xiong Proceedings of the 1st Workshop on Data Contamination (CONDA), 1-12, 2024 | 2 | 2024 |
Lhmke: A large-scale holistic multi-subject knowledge evaluation benchmark for chinese large language models C Liu, R Jin, Y Ren, D Xiong arXiv preprint arXiv:2403.12601, 2024 | 2 | 2024 |
FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data H Sun, R Jin, S Xu, L Pan, M Cui, J Du, Y Lei, L Yang, L Shi, J Xiao, S Zhu, ... arXiv preprint arXiv:2408.06273, 2024 | 1 | 2024 |
Joint training and decoding for multilingual end-to-end simultaneous speech translation W Huang, R Jin, W Zhang, J Luan, B Wang, D Xiong ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Star-Agents: Automatic Data Optimization with LLM Agents for Instruction Tuning H Zhou, Y Tang, H Qin, Y Yang, R Jin, D Xiong, K Han, Y Wang Advances in Neural Information Processing Systems 37, 4575-4597, 2025 | | 2025 |
Empirical Study on Data Attributes Insufficiency of Evaluation Benchmarks for LLMs C Liu, R Jin, Z Yao, T Li, L Cheng, M Steedman, D Xiong Proceedings of the 31st International Conference on Computational …, 2025 | | 2025 |
CS2W: A Chinese Spoken-to-Written Style Conversion Dataset with Multiple Conversion Types Z Guo, L Yu, M Xu, R Jin, D Xiong Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | | 2023 |