Runji Lin

Cited by

	All	Since 2020
Citations	3213	3210
h-index	10	10
i10-index	11	11

2800

1400

700

2100

20222023202420259 149 2725 322

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yaodong YangBOYA (博雅) Assistant Professor at Peking UniversityVerified email at pku.edu.cn
Keming LuUniversity of Southern CaliforniaVerified email at usc.edu
Junyang LinQwen Team, Alibaba Group & Peking UniversityVerified email at alibaba-inc.com
Jun WangProfessor, Computer Science, University College LondonVerified email at cs.ucl.ac.uk
Chang ZhouPeking University ([email protected])Verified email at pku.edu.cn
Weinan ZhangProfessor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Ying WenAssociate Professor, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Muning WenPhD student, Shanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Haifeng ZhangInstitute of Automation, Chinese Academy of SciencesVerified email at ia.ac.cn
Bowen YuQwen Team, Alibaba GroupVerified email at alibaba-inc.com
Jakub Grudzien KubaUC BerkeleyVerified email at berkeley.edu
Yali DuTuring Fellow, Associate professor, King's College LondonVerified email at kcl.ac.uk
Xidong FengGoogle DeepMindVerified email at google.com
Dixia Fan (范迪夏)Assistant Professor, Westlake UniversityVerified email at westlake.edu.cn
Zhipeng WangPh.D. student of Queen's UniversityVerified email at queensu.ca
Yanbing YangCollege of Computer Science， Sichuan UniversityVerified email at scu.edu.cn

Runji Lin

Institute of Automation, Chinese Academy of Sciences

Verified email at ia.ac.cn - Homepage

Reinforcement Learning Multi-Agent System Large Language Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Qwen technical report J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng, Y Fan, W Ge, Y Han, F Huang, ... arXiv preprint arXiv:2309.16609, 2023	1882	2023
Qwen2. 5 technical report A Yang, B Yang, B Zhang, B Hui, B Zheng, B Yu, C Li, D Liu, F Huang, ... arXiv preprint arXiv:2412.15115, 2024	844	2024
Multi-Agent Reinforcement Learning is a Sequence Modeling Problem M Wen, JG Kuba, R Lin, W Zhang, Y Wen, J Wang, Y Yang NeurIPS 2022, 2022	198	2022
# instag: Instruction tagging for analyzing supervised fine-tuning of large language models K Lu, H Yuan, Z Yuan, R Lin, J Lin, C Tan, C Zhou, J Zhou The Twelfth International Conference on Learning Representations, 2023	75	2023
Routing to the expert: Efficient reward-guided ensemble of large language models K Lu, H Yuan, R Lin, J Lin, Z Yuan, C Zhou, J Zhou NAACL, 2023	48	2023
Qwen2. 5-math technical report: Toward mathematical expert model via self-improvement A Yang, B Zhang, B Hui, B Gao, B Yu, C Li, D Liu, J Tu, J Zhou, J Lin, K Lu, ... arXiv preprint arXiv:2409.12122, 2024	41	2024
Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, X Yan, Y Wu, R Lin, H Zhang, J Wang NeurIPS 2024, 2023	36	2023
Large Sequence Models for Sequential Decision-Making: A Survey M WEN, R LIN, H WANG, Y YANG, Y WEN, L MAI, J WANG, H ZHANG, ... Frontiers of Computer Science, 2023	33	2023
Online merging optimizers for boosting rewards and mitigating tax in alignment K Lu, B Yu, F Huang, Y Fan, R Lin, C Zhou arXiv preprint arXiv:2405.17931, 2024	15	2024
Contextual Transformer for Offline Meta Reinforcement Learning R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang NeurIPS 2022 Workshop: Foundation Models for Decision Making, 2022	11	2022
Learn to flap: foil non-parametric path planning via deep reinforcement learning ZP Wang, RJ Lin, ZY Zhao, X Chen, PM Guo, N Yang, ZC Wang, DX Fan Journal of Fluid Mechanics 984, A9, 2024	10	2024
Scalable Model-based Policy Optimization for Decentralized Networked Systems Y Du, C Ma, Y Liu, R Lin, H Dong, J Wang, Y Yang IROS 2022, 2022	8*	2022
Processbench: Identifying process errors in mathematical reasoning C Zheng, Z Zhang, B Zhang, R Lin, K Lu, B Yu, D Liu, J Zhou, J Lin arXiv preprint arXiv:2412.06559, 2024	6	2024
Llm critics help catch bugs in mathematics: Towards a better mathematical verifier with natural language feedback B Gao, Z Cai, R Xu, P Wang, C Zheng, R Lin, K Lu, J Lin, C Zhou, W Xiao, ... arXiv preprint arXiv:2406.14024, 2024	5	2024
Increasing the Data Rate for Reflected Optical Camera Communication Using Uniform LED Light Z Chen, R Lin, H Duan, Y Chen, Y Yang, R Wu, L Chen IEEE INFOCOM 2020-IEEE Conference on Computer Communications Workshops …, 2020	1	2020
The Lessons of Developing Process Reward Models in Mathematical Reasoning Z Zhang, C Zheng, Y Wu, B Zhang, R Lin, B Yu, D Liu, J Zhou, J Lin arXiv preprint arXiv:2501.07301, 2025		2025
Online Decision MetaMorphFormer: A Casual Transformer-Based Reinforcement Learning Framework of Universal Embodied Intelligence L Ji, R Lin arXiv preprint arXiv:2409.07341, 2024		2024
Learning Robust Communication by Adversarial Training in Networked System Control R Lin, H Zhang Chinese Conference on Swarm Intelligence and Cooperative Control, 605-619, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors