팔로우
Shihan Dou
Shihan Dou
m.fudan.edu.cn의 이메일 확인됨
제목
인용
인용
연도
The rise and potential of large language model based agents: A survey
Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ...
Science China Information Sciences 68 (2), 121101, 2025
7802025
Secrets of RLHF in large language models part I: PPO
R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ...
arXiv preprint arXiv:2307.04964, 2023
134*2023
Vulcnn: An image-inspired scalable vulnerability detection system
Y Wu, D Zou, S Dou, W Yang, D Xu, H Jin
Proceedings of the 44th International Conference on Software Engineering …, 2022
1332022
LoRAMoE: Alleviating world knowledge forgetting in large language models via MoE-style plugin
S Dou, E Zhou, Y Liu, S Gao, W Shen, L Xiong, Y Zhou, X Wang, Z Xi, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
89*2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling
B Wang, R Zheng, L Chen, Y Liu, S Dou, C Huang, W Shen, S Jin, E Zhou, ...
arXiv preprint arXiv:2401.06080, 2024
86*2024
SCDetector: Software functional clone detection based on semantic tokens analysis
Y Wu, D Zou, S Dou, S Yang, W Yang, F Cheng, H Liang, H Jin
Proceedings of the 35th IEEE/ACM international conference on automated …, 2020
662020
IntDroid: Android malware detection based on API intimacy analysis
D Zou, Y Wu, S Yang, A Chauhan, W Yang, J Zhong, S Dou, H Jin
ACM Transactions on Software Engineering and Methodology (TOSEM) 30 (3), 1-32, 2021
482021
Codechameleon: Personalized encryption framework for jailbreaking large language models
H Lv, X Wang, Y Zhang, C Huang, S Dou, J Ye, T Gui, Q Zhang, X Huang
arXiv preprint arXiv:2402.16717, 2024
322024
MINER: Improving out-of-vocabulary named entity recognition from an information theoretic perspective
X Wang, S Dou, L Xiong, Y Zou, Q Zhang, T Gui, L Qiao, Z Cheng, ...
arXiv preprint arXiv:2204.04391, 2022
322022
Loose lips sink ships: Mitigating length bias in reinforcement learning from human feedback
W Shen, R Zheng, W Zhan, J Zhao, S Dou, T Gui, Q Zhang, X Huang
arXiv preprint arXiv:2310.05199, 2023
312023
Towards understanding the capability of large language models on code clone detection: a survey
S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang
arXiv preprint arXiv:2308.01191, 2023
30*2023
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models
W Zhou, X Wang, L Xiong, H Xia, Y Gu, M Chai, F Zhu, C Huang, S Dou, ...
arXiv preprint arXiv:2403.12171, 2024
262024
Stepcoder: Improve code generation with reinforcement learning from compiler feedback
S Dou, Y Liu, H Jia, L Xiong, E Zhou, W Shen, J Shan, C Huang, X Wang, ...
arXiv preprint arXiv:2402.01391, 2024
26*2024
Obfuscation-resilient android malware analysis based on contrastive learning
Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin
arXiv preprint arXiv:2107.03799, 2021
202021
Tooleyes: Fine-grained evaluation for tool learning capabilities of large language models in real-world scenarios
J Ye, G Li, S Gao, C Huang, Y Wu, S Li, X Fan, S Dou, Q Zhang, T Gui, ...
arXiv preprint arXiv:2401.00741, 2024
172024
Contrastive learning for robust android malware familial classification
Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin
IEEE Transactions on Dependable and Secure Computing, 2022
172022
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study
S Dou, H Jia, S Wu, H Zheng, W Zhou, M Wu, M Chai, J Fan, C Huang, ...
arXiv preprint arXiv:2407.06153, 2024
142024
Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering
T Li, S Dou, W Liu, M Wu, C Lv, X Zheng, X Huang
arXiv preprint arXiv:2401.06824, 2024
142024
Training large language models for reasoning through reverse curriculum reinforcement learning
Z Xi, W Chen, B Hong, S Jin, R Zheng, W He, Y Ding, S Liu, X Guo, ...
arXiv preprint arXiv:2402.05808, 2024
122024
MouSi: Poly-Visual-Expert Vision-Language Models
X Fan, T Ji, C Jiang, S Li, S Jin, S Song, J Wang, B Hong, L Chen, ...
arXiv preprint arXiv:2401.17221, 2024
122024
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20