PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization Y Wang*, Z Yu*, Z Zeng, L Yang, C Wang, H Chen, C Jiang, R Xie, ... Internation Conference on Learning Representation (ICLR 2024), 2023 | 206 | 2023 |
Exploring Vision-Language Models for Imbalanced Learning Y Wang, Z Yu, J Wang, Q Heng, H Chen, W Ye, R Xie, X Xie, S Zhang International Journal of Computer Vision 132 (1), 224-237, 2024 | 31 | 2024 |
Textbox: A unified, modularized, and extensible framework for text generation J Li, T Tang, G He, J Jiang, X Hu, P Xie, Z Chen, Z Yu, WX Zhao, JR Wen Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 28 | 2021 |
KIEval: A Knowledge-grounded Interactive Evaluation Framework for Large Language Models Z Yu, C Gao, W Yao, Y Wang, W Ye, J Wang, X Xie, Y Zhang, S Zhang Annual Meeting of the Association for Computational Linguistics (ACL 2024), 2024 | 23 | 2024 |
Supervised knowledge makes large language models better in-context learners L Yang*, S Zhang*, Z Yu*, G Bao, Y Wang, J Wang, R Xu, W Ye, X Xie, ... Internation Conference on Learning Representation (ICLR 2024), 2023 | 16 | 2023 |
Textbox 2.0: A text generation library with pre-trained language models T Tang, J Li, Z Chen, Y Hu, Z Yu, W Dai, Z Dong, X Cheng, Y Wang, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 12 | 2022 |
Codeshell technical report R Xie, Z Zeng, Z Yu, C Gao, S Zhang, W Ye arXiv preprint arXiv:2403.15747, 2024 | 8 | 2024 |
ElitePLM: An empirical study on general language ability evaluation of pretrained language models J Li, T Tang, Z Gong, L Yang, Z Yu, Z Chen, J Wang, WX Zhao, JR Wen Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 6 | 2022 |
Llmtune: Accelerate database knob tuning with large language models X Huang, H Li, J Zhang, X Zhao, Z Yao, Y Li, Z Yu, T Zhang, H Chen, C Li arXiv preprint arXiv:2404.11581, 2024 | 5 | 2024 |
FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models Z Yu, C Gao, W Yao, Y Wang, Z Zeng, W Ye, J Wang, Y Zhang, S Zhang Empirical Methods in Natural Language Processing (EMNLP 2024, Demo Track), 2024 | 4 | 2024 |
Pure: Aligning llm via pluggable query reformulation for enhanced helpfulness W Yao, Y Wang, Z Yu, R Xie, S Zhang, W Ye Findings of the Association for Computational Linguistics: EMNLP 2024, 8721-8744, 2024 | 1 | 2024 |
An Empirical Analysis of Uncertainty in Large Language Model Evaluations Q Xie, Q Li, Z Yu, Y Zhang, Y Zhang, L Yang arXiv preprint arXiv:2502.10709, 2025 | | 2025 |
Outcome-Refining Process Supervision for Code Generation Z Yu, W Gu, Y Wang, Z Zeng, J Wang, W Ye, S Zhang arXiv preprint arXiv:2412.15118, 2024 | | 2024 |