‪Sining Zhoubian‬ - ‪Academic Search‬

קבלת פרופיל משלי

צוטט על ידי

	הכל	מאז 2020
ציטוטים ביבליוגרפיים	119	118
H-index	5	5
i10-index	3	3

0

70

35

2023202420252 66 49

עקוב אחר

Sining Zhoubian

Sining Zhoubian

Tsinghua University, ZhipuAI

כתובת אימייל מאומתת בדומיין mails.tsinghua.edu.cn - דף הבית

machine learning large language models reinforcement learning


כותרת מיון לפי ציטוט ביבליוגרפי מיון לפי שנה מיון לפי כותרת	צוטט על ידי צוטט על ידי	שנה
Rest-mcts*: Llm self-training via process reward guided tree search‏ D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang‏ Advances in Neural Information Processing Systems 37, 64735-64772, 2025‏	77	2025
Sciglm: Training scientific language models with self-reflective instruction annotation and tuning‏ D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...‏ arXiv preprint arXiv:2401.07950, 2024‏	14	2024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. Sciglm: Training scientific language models with self-reflective instruction annotation and …‏ D Zhang, Z Hu, S Zhoubian‏ arXiv preprint arXiv:2401.07950, 2024‏	13	2024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. Sciglm: Training scientific language models with self-reflective instruction annotation and tuning‏ D Zhang, Z Hu, S Zhoubian‏ arXiv preprint arXiv:2401.07950, 2024‏	6	2024
Rest-mcts*: Llm self-training via process reward guided tree search, 2024a‏ D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang‏ URL https://arxiv. org/abs/2406.03816, 0‏	6
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language …‏ D Zhang, Z Hu, S Zhoubian‏ The Thirty-eight Conference on Neural Information Processing Systems …, 0‏	2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models‏ D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...‏ Advances in Neural Information Processing Systems 37, 1443-1473, 2025‏	1	2025
DataSciBench: An LLM Agent Benchmark for Data Science‏ D Zhang, S Zhoubian, M Cai, F Li, L Yang, W Wang, T Dong, Z Hu, J Tang, ...‏ arXiv preprint arXiv:2502.13897, 2025‏		2025
Rock Classification Based on Residual Networks‏ S Zhoubian, Y Wang, Z Jiang‏ arXiv preprint arXiv:2402.11831, 2024‏		2024

המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.

מאמרים 1–9