עקוב אחר
Sining Zhoubian
Sining Zhoubian
Tsinghua University, ZhipuAI
כתובת אימייל מאומתת בדומיין mails.tsinghua.edu.cn - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Rest-mcts*: Llm self-training via process reward guided tree search
D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang
Advances in Neural Information Processing Systems 37, 64735-64772, 2025
772025
Sciglm: Training scientific language models with self-reflective instruction annotation and tuning
D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...
arXiv preprint arXiv:2401.07950, 2024
142024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. Sciglm: Training scientific language models with self-reflective instruction annotation and …
D Zhang, Z Hu, S Zhoubian
arXiv preprint arXiv:2401.07950, 2024
132024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. Sciglm: Training scientific language models with self-reflective instruction annotation and tuning
D Zhang, Z Hu, S Zhoubian
arXiv preprint arXiv:2401.07950, 2024
62024
Rest-mcts*: Llm self-training via process reward guided tree search, 2024a
D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang
URL https://arxiv. org/abs/2406.03816, 0
6
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language …
D Zhang, Z Hu, S Zhoubian
The Thirty-eight Conference on Neural Information Processing Systems …, 0
2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...
Advances in Neural Information Processing Systems 37, 1443-1473, 2025
12025
DataSciBench: An LLM Agent Benchmark for Data Science
D Zhang, S Zhoubian, M Cai, F Li, L Yang, W Wang, T Dong, Z Hu, J Tang, ...
arXiv preprint arXiv:2502.13897, 2025
2025
Rock Classification Based on Residual Networks
S Zhoubian, Y Wang, Z Jiang
arXiv preprint arXiv:2402.11831, 2024
2024
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–9