Sledovat
Qisen Yang (杨琪森)
Qisen Yang (杨琪森)
E-mailová adresa ověřena na: mails.tsinghua.edu.cn - Domovská stránka
Název
Citace
Citace
Rok
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
S Wang, C Liu, Z Zheng, S Qi, S Chen, Q Yang, A Zhao, C Wang, S Song, ...
Findings of the Association for Computational Linguistics ACL 2024, 9909-9953, 2024
61*2024
Efficient knowledge distillation from model checkpoints
C Wang, Q Yang, R Huang, S Song, G Huang
Advances in Neural Information Processing Systems 35, 607-619, 2022
502022
Towards learning spatially discriminative feature representations
C Wang, J Xiao, Y Han, Q Yang, S Song, G Huang
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
272021
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
Q Yang, Z Wang, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
19*2024
Fine-grained few shot learning with foreground object transformation
C Wang, S Song, Q Yang, X Li, G Huang
Neurocomputing 466, 16-26, 2021
192021
Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning
S Wang, Q Yang, J Gao, M Lin, H Chen, L Wu, N Jia, S Song, G Huang
Advances in Neural Information Processing Systems 36, 47081-47104, 2023
142023
Boosting Offline Reinforcement Learning with Action Preference Query
Q Yang, S Wang, MG Lin, S Song, G Huang
International Conference on Machine Learning 202, 39509--39523, 2023
102023
复杂开放水域下智能船舶路径规划与避障方法
杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉
计算机集成制造系统 28 (7), 2030, 2022
102022
Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment
Y Qisen, W Shenzhi, S Jinnan, W Chaofei, H Gao, WU Cheng, S Shiji
Computer Integrated Manufacturing System 28 (7), 2030, 2022
102022
Hundreds guide millions: Adaptive offline reinforcement learning with expert guidance
Q Yang, S Wang, Q Zhang, G Huang, S Song
IEEE Transactions on Neural Networks and Learning Systems 35 (11), 16288-16300, 2023
72023
Leveraging reward consistency for interpretable feature discovery in reinforcement learning
Q Yang, H Wang, M Tong, W Shi, G Huang, S Song
IEEE Transactions on Systems, Man, and Cybernetics: Systems 54 (2), 1014-1025, 2023
72023
Decoupled Prioritized Resampling for Offline RL
Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan
IEEE Transactions on Neural Networks and Learning Systems, 2024
12024
Systém momentálně nemůže danou operaci provést. Zkuste to znovu později.
Články 1–12