Shihan Dou

인용

	전체	2020년 이후
서지정보	1687	1687
h-index	16	16
i10-index	20	20

1300

650

325

975

2021202220232024202510 40 216 1266 151

공개 액세스

모두 보기

자료 7개

자료 0개

공개

비공개

재정 지원 요구사항 기준

공동 저자

Huang Xuanjing (黄萱菁)Professor of Computer Science, Fudan Universityfudan.edu.cn의 이메일 확인됨
Qi Zhang (张奇)Professor of Computer Science, Fudan Universityfudan.edu.cn의 이메일 확인됨
Tao Gui （桂韬）复旦大学fudan.edu.cn의 이메일 확인됨
Hai JinHuazhong University of Science and Technologyhust.edu.cn의 이메일 확인됨
Rui ZhengFudan Universityfudan.edu.cn의 이메일 확인됨
Xipeng Qiu（邱锡鹏）Professor of Computer Science, Fudan Universityfudan.edu.cn의 이메일 확인됨
Yueming Wu

팔로우

Shihan Dou

Fudan University

m.fudan.edu.cn의 이메일 확인됨

Alignment RLHF Reward Modeling


제목 서지정보순 정렬 연도순 정렬 제목순 정렬	인용 인용	연도
The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... Science China Information Sciences 68 (2), 121101, 2025	780	2025
Secrets of RLHF in large language models part I: PPO R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023	134*	2023
Vulcnn: An image-inspired scalable vulnerability detection system Y Wu, D Zou, S Dou, W Yang, D Xu, H Jin Proceedings of the 44th International Conference on Software Engineering …, 2022	133	2022
LoRAMoE: Alleviating world knowledge forgetting in large language models via MoE-style plugin S Dou, E Zhou, Y Liu, S Gao, W Shen, L Xiong, Y Zhou, X Wang, Z Xi, ... Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024	89*	2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling B Wang, R Zheng, L Chen, Y Liu, S Dou, C Huang, W Shen, S Jin, E Zhou, ... arXiv preprint arXiv:2401.06080, 2024	86*	2024
SCDetector: Software functional clone detection based on semantic tokens analysis Y Wu, D Zou, S Dou, S Yang, W Yang, F Cheng, H Liang, H Jin Proceedings of the 35th IEEE/ACM international conference on automated …, 2020	66	2020
IntDroid: Android malware detection based on API intimacy analysis D Zou, Y Wu, S Yang, A Chauhan, W Yang, J Zhong, S Dou, H Jin ACM Transactions on Software Engineering and Methodology (TOSEM) 30 (3), 1-32, 2021	48	2021
Codechameleon: Personalized encryption framework for jailbreaking large language models H Lv, X Wang, Y Zhang, C Huang, S Dou, J Ye, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2402.16717, 2024	32	2024
MINER: Improving out-of-vocabulary named entity recognition from an information theoretic perspective X Wang, S Dou, L Xiong, Y Zou, Q Zhang, T Gui, L Qiao, Z Cheng, ... arXiv preprint arXiv:2204.04391, 2022	32	2022
Loose lips sink ships: Mitigating length bias in reinforcement learning from human feedback W Shen, R Zheng, W Zhan, J Zhao, S Dou, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2310.05199, 2023	31	2023
Towards understanding the capability of large language models on code clone detection: a survey S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang arXiv preprint arXiv:2308.01191, 2023	30*	2023
EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models W Zhou, X Wang, L Xiong, H Xia, Y Gu, M Chai, F Zhu, C Huang, S Dou, ... arXiv preprint arXiv:2403.12171, 2024	26	2024
Stepcoder: Improve code generation with reinforcement learning from compiler feedback S Dou, Y Liu, H Jia, L Xiong, E Zhou, W Shen, J Shan, C Huang, X Wang, ... arXiv preprint arXiv:2402.01391, 2024	26*	2024
Obfuscation-resilient android malware analysis based on contrastive learning Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin arXiv preprint arXiv:2107.03799, 2021	20	2021
Tooleyes: Fine-grained evaluation for tool learning capabilities of large language models in real-world scenarios J Ye, G Li, S Gao, C Huang, Y Wu, S Li, X Fan, S Dou, Q Zhang, T Gui, ... arXiv preprint arXiv:2401.00741, 2024	17	2024
Contrastive learning for robust android malware familial classification Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin IEEE Transactions on Dependable and Secure Computing, 2022	17	2022
What's Wrong with Your Code Generated by Large Language Models? An Extensive Study S Dou, H Jia, S Wu, H Zheng, W Zhou, M Wu, M Chai, J Fan, C Huang, ... arXiv preprint arXiv:2407.06153, 2024	14	2024
Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering T Li, S Dou, W Liu, M Wu, C Lv, X Zheng, X Huang arXiv preprint arXiv:2401.06824, 2024	14	2024
Training large language models for reasoning through reverse curriculum reinforcement learning Z Xi, W Chen, B Hong, S Jin, R Zheng, W He, Y Ding, S Liu, X Guo, ... arXiv preprint arXiv:2402.05808, 2024	12	2024
MouSi: Poly-Visual-Expert Vision-Language Models X Fan, T Ji, C Jiang, S Li, S Jin, S Song, J Wang, B Hong, L Chen, ... arXiv preprint arXiv:2401.17221, 2024	12	2024

현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.

학술자료 1–20

연간 인용횟수

중복된 서지정보

병합된 서지정보

공동 저자 추가공동 저자

팔로우

인용

공동 저자