Seguir
Qisen Yang (杨琪森)
Qisen Yang (杨琪森)
Dirección de correo verificada de mails.tsinghua.edu.cn - Página principal
Título
Citado por
Citado por
Año
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
S Wang, C Liu, Z Zheng, S Qi, S Chen, Q Yang, A Zhao, C Wang, S Song, ...
Findings of the Association for Computational Linguistics ACL 2024, 9909-9953, 2024
56*2024
Efficient knowledge distillation from model checkpoints
C Wang, Q Yang, R Huang, S Song, G Huang
Advances in Neural Information Processing Systems 35, 607-619, 2022
482022
Towards learning spatially discriminative feature representations
C Wang, J Xiao, Y Han, Q Yang, S Song, G Huang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
272021
Fine-grained few shot learning with foreground object transformation
C Wang, S Song, Q Yang, X Li, G Huang
Neurocomputing 466, 16-26, 2021
192021
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
Q Yang, Z Wang, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
15*2024
Train once, get a family: State-adaptive balances for offline-to-online reinforcement learning
S Wang, Q Yang, J Gao, M Lin, H Chen, L Wu, N Jia, S Song, G Huang
Advances in Neural Information Processing Systems 36, 2024
122024
Boosting Offline Reinforcement Learning with Action Preference Query
Q Yang, S Wang, MG Lin, S Song, G Huang
International Conference on Machine Learning 202, 39509--39523, 2023
102023
复杂开放水域下智能船舶路径规划与避障方法
杨琪森, 王慎执, 桑金楠, 王朝飞, 黄高, 吴澄, 宋士吉
计算机集成制造系统 28 (7), 2030, 2022
102022
Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment
Y Qisen, W Shenzhi, S Jinnan, W Chaofei, H Gao, WU Cheng, S Shiji
Computer Integrated Manufacturing System 28 (7), 2030, 2022
102022
Hundreds guide millions: Adaptive offline reinforcement learning with expert guidance
Q Yang, S Wang, Q Zhang, G Huang, S Song
IEEE Transactions on Neural Networks and Learning Systems, 2023
72023
Leveraging reward consistency for interpretable feature discovery in reinforcement learning
Q Yang, H Wang, M Tong, W Shi, G Huang, S Song
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2023
62023
Decoupled Prioritized Resampling for Offline RL
Y Yue, B Kang, X Ma, Q Yang, G Huang, S Song, S Yan
IEEE Transactions on Neural Networks and Learning Systems, 2024
2024
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–12