Urmăriți
Sining Zhoubian
Sining Zhoubian
Tsinghua University, ZhipuAI
Adresă de e-mail confirmată pe mails.tsinghua.edu.cn - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Rest-mcts*: Llm self-training via process reward guided tree search
D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang
Advances in Neural Information Processing Systems 37, 64735-64772, 2025
812025
Sciglm: Training scientific language models with self-reflective instruction annotation and tuning
D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...
arXiv preprint arXiv:2401.07950, 2024
152024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. Sciglm: Training scientific language models with self-reflective instruction annotation and …
D Zhang, Z Hu, S Zhoubian
arXiv preprint arXiv:2401.07950, 2024
132024
Rest-mcts*: Llm self-training via process reward guided tree search, 2024a
D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang
URL https://arxiv. org/abs/2406.03816, 0
8
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. Sciglm: Training scientific language models with self-reflective instruction annotation and tuning
D Zhang, Z Hu, S Zhoubian
arXiv preprint arXiv:2401.07950, 2024
62024
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language …
D Zhang, Z Hu, S Zhoubian
The Thirty-eight Conference on Neural Information Processing Systems …, 0
2
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ...
Advances in Neural Information Processing Systems 37, 1443-1473, 2025
12025
DataSciBench: An LLM Agent Benchmark for Data Science
D Zhang, S Zhoubian, M Cai, F Li, L Yang, W Wang, T Dong, Z Hu, J Tang, ...
arXiv preprint arXiv:2502.13897, 2025
2025
Rock Classification Based on Residual Networks
S Zhoubian, Y Wang, Z Jiang
arXiv preprint arXiv:2402.11831, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–9