Qisen Yang (杨琪森)

Citace

	Všechny	Od 2020
Citace	235	234
h-index	9	9
i10-index	9	9

140

105

202220232024202520 53 140 21

Veřejný přístup

Zobrazit všechny

8 článků

1 článek

dostupné

nedostupné

Vychází ze zplnomocnění pro financování

Spoluautoři

Shiji SongTsinghua UniversityE-mailová adresa ověřena na: tsinghua.edu.cn
Gao Huang （黄高）Associate Professor, Tsinghua UniversityE-mailová adresa ověřena na: tsinghua.edu.cn
Chaofei WangTsinghua UniversityE-mailová adresa ověřena na: mails.tsinghua.edu.cn
Zilong Zheng (郑子隆)UCLA CS PhDE-mailová adresa ověřena na: ucla.edu
Matthieu LinTsinghua UniversityE-mailová adresa ověřena na: mails.tsinghua.edu.cn
Zekun Moore WangBeihang University, ByteDance, M-A-PE-mailová adresa ověřena na: buaa.edu.cn
Yifan Pu (浦一凡)Department of Automation, Tsinghua UniversityE-mailová adresa ověřena na: mails.tsinghua.edu.cn
Yizeng HanAlibaba DAMO AcademyE-mailová adresa ověřena na: alibaba-inc.com
Shenzhi Wang (王慎执)PhD Candidate at Tsinghua UniversityE-mailová adresa ověřena na: mails.tsinghua.edu.cn
Qihang ZhangUniversity of British ColumbiaE-mailová adresa ověřena na: student.ubc.ca

Sledovat

Qisen Yang (杨琪森)

Tsinghua University

E-mailová adresa ověřena na: mails.tsinghua.edu.cn - Domovská stránka

Reinforcement Learning Large Language Model Efficient Deep Learning Psychology


Název Seřadit podle citací Seřadit podle roku Seřadit podle názvu	Citace Citace	Rok
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling S Wang, C Liu, Z Zheng, S Qi, S Chen, Q Yang, A Zhao, C Wang, S Song, ... Findings of the Association for Computational Linguistics ACL 2024, 9909-9953, 2024	61*	2024
Efficient knowledge distillation from model checkpoints C Wang, Q Yang, R Huang, S Song, G Huang Advances in Neural Information Processing Systems 35, 607-619, 2022	50	2022
Towards learning spatially discriminative feature representations C Wang, J Xiao, Y Han, Q Yang, S Song, G Huang Proceedings of the IEEE/CVF international conference on computer vision …, 2021	27	2021
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents Q Yang, Z Wang, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	19*	2024
Fine-grained few shot learning with foreground object transformation C Wang, S Song, Q Yang, X Li, G Huang Neurocomputing 466, 16-26, 2021	19	2021
Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning S Wang, Q Yang, J Gao, M Lin, H Chen, L Wu, N Jia, S Song, G Huang Advances in Neural Information Processing Systems 36, 47081-47104, 2023	14	2023
Boosting Offline Reinforcement Learning with Action Preference Query Q Yang, S Wang, MG Lin, S Song, G Huang International Conference on Machine Learning 202, 39509--39523, 2023	10	2023
复杂开放水域下智能船舶路径规划与避障方法杨琪森，王慎执，桑金楠，王朝飞，黄高，吴澄，宋士吉计算机集成制造系统 28 (7), 2030, 2022	10	2022
Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment Y Qisen, W Shenzhi, S Jinnan, W Chaofei, H Gao, WU Cheng, S Shiji Computer Integrated Manufacturing System 28 (7), 2030, 2022	10	2022
Hundreds guide millions: Adaptive offline reinforcement learning with expert guidance Q Yang, S Wang, Q Zhang, G Huang, S Song IEEE Transactions on Neural Networks and Learning Systems 35 (11), 16288-16300, 2023	7	2023
Leveraging reward consistency for interpretable feature discovery in reinforcement learning Q Yang, H Wang, M Tong, W Shi, G Huang, S Song IEEE Transactions on Systems, Man, and Cybernetics: Systems 54 (2), 1014-1025, 2023	7	2023
Decoupled Prioritized Resampling for Offline RL Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan IEEE Transactions on Neural Networks and Learning Systems, 2024	1	2024

Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.

Články 1–12

Citace za rok

Duplicitní citace

Sloučené citace

Přidat spoluautorySpoluautoři

Sledovat

Citace

Spoluautoři