A survey of large language models WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou, Y Min, B Zhang, J Zhang, ... arXiv preprint arXiv:2303.18223, 2023 | 4208* | 2023 |
Don't make your llm an evaluation benchmark cheater K Zhou, Y Zhu, Z Chen, W Chen, WX Zhao, X Chen, Y Lin, JR Wen, J Han arXiv preprint arXiv:2311.01964, 2023 | 127 | 2023 |
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models Z Chen, K Zhou, B Zhang, Z Gong, WX Zhao, JR Wen arXiv preprint arXiv:2305.14323, 2023 | 49 | 2023 |
Textbox: A unified, modularized, and extensible framework for text generation J Li, T Tang, G He, J Jiang, X Hu, P Xie, Z Chen, Z Yu, WX Zhao, JR Wen arXiv preprint arXiv:2101.02046, 2021 | 27 | 2021 |
JiuZhang 3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models K Zhou, B Zhang, J Wang, Z Chen, WX Zhao, J Sha, Z Sheng, S Wang, ... arXiv preprint arXiv:2405.14365, 2024 | 19 | 2024 |
Improving large language models via fine-grained reinforcement learning with minimum editing constraint Z Chen, K Zhou, WX Zhao, J Wan, F Zhang, D Zhang, JR Wen arXiv preprint arXiv:2401.06081, 2024 | 19 | 2024 |
Textbox 2.0: A text generation library with pre-trained language models T Tang, J Li, Z Chen, Y Hu, Z Yu, W Dai, Z Dong, X Cheng, Y Wang, ... arXiv preprint arXiv:2212.13005, 2022 | 12 | 2022 |
Jiuzhang 2.0: A unified chinese pre-trained language model for multi-task mathematical problem solving X Zhao, K Zhou, B Zhang, Z Gong, Z Chen, Y Zhou, JR Wen, J Sha, ... Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023 | 8 | 2023 |
Imitate, explore, and self-improve: A reproduction report on slow-thinking reasoning systems Y Min, Z Chen, J Jiang, J Chen, J Deng, Y Hu, Y Tang, J Wang, X Cheng, ... arXiv preprint arXiv:2412.09413, 2024 | 7 | 2024 |
Eliteplm: an empirical study on general language ability evaluation of pretrained language models J Li, T Tang, Z Gong, L Yang, Z Yu, Z Chen, J Wang, WX Zhao, JR Wen arXiv preprint arXiv:2205.01523, 2022 | 5 | 2022 |
Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment Z Chen, K Zhou, WX Zhao, J Wang, JR Wen Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 3* | 2024 |
Technical report: Enhancing llm reasoning with reward-guided tree search J Jiang, Z Chen, Y Min, J Chen, X Cheng, J Wang, Y Tang, H Sun, J Deng, ... arXiv preprint arXiv:2411.11694, 2024 | 1 | 2024 |
Towards Effective and Efficient Continual Pre-training of Large Language Models J Chen, Z Chen, J Wang, K Zhou, Y Zhu, J Jiang, Y Min, WX Zhao, Z Dou, ... arXiv preprint arXiv:2407.18743, 2024 | 1 | 2024 |
Yulan: An open-source large language model Y Zhu, K Zhou, K Mao, W Chen, Y Sun, Z Chen, Q Cao, Y Wu, Y Chen, ... arXiv preprint arXiv:2406.19853, 2024 | 1 | 2024 |
Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models Z Chen, L Song, K Zhou, WX Zhao, B Wang, W Chen, JR Wen arXiv preprint arXiv:2410.07825, 2024 | | 2024 |