Rest-mcts*: Llm self-training via process reward guided tree search D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang Advances in Neural Information Processing Systems 37, 64735-64772, 2025 | 77 | 2025 |
Sciglm: Training scientific language models with self-reflective instruction annotation and tuning D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ... arXiv preprint arXiv:2401.07950, 2024 | 14 | 2024 |
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. Sciglm: Training scientific language models with self-reflective instruction annotation and … D Zhang, Z Hu, S Zhoubian arXiv preprint arXiv:2401.07950, 2024 | 13 | 2024 |
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. Sciglm: Training scientific language models with self-reflective instruction annotation and tuning D Zhang, Z Hu, S Zhoubian arXiv preprint arXiv:2401.07950, 2024 | 6 | 2024 |
Rest-mcts*: Llm self-training via process reward guided tree search, 2024a D Zhang, S Zhoubian, Z Hu, Y Yue, Y Dong, J Tang URL https://arxiv. org/abs/2406.03816, 0 | 6 | |
Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, and Jie Tang. 2024. SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language … D Zhang, Z Hu, S Zhoubian The Thirty-eight Conference on Neural Information Processing Systems …, 0 | 2 | |
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models D Zhang, Z Hu, S Zhoubian, Z Du, K Yang, Z Wang, Y Yue, Y Dong, ... Advances in Neural Information Processing Systems 37, 1443-1473, 2025 | 1 | 2025 |
DataSciBench: An LLM Agent Benchmark for Data Science D Zhang, S Zhoubian, M Cai, F Li, L Yang, W Wang, T Dong, Z Hu, J Tang, ... arXiv preprint arXiv:2502.13897, 2025 | | 2025 |
Rock Classification Based on Residual Networks S Zhoubian, Y Wang, Z Jiang arXiv preprint arXiv:2402.11831, 2024 | | 2024 |