Follow
Qin Liu
Title
Cited by
Cited by
Year
The rise and potential of large language model based agents: A survey
Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ...
Science China Information Sciences 68 (2), 121101, 2025
7352025
Secrets of rlhf in large language models part i: Ppo
R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ...
arXiv preprint arXiv:2307.04964, 2023
134*2023
Textflint: Unified multilingual robustness evaluation toolkit for natural language processing
X Wang, Q Liu, T Gui, Q Zhang, Y Zou, X Zhou, J Ye, Y Zhang, R Zheng, ...
Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021
129*2021
Flooding-X: Improving BERT’s resistance to adversarial attacks via loss-restricted fine-tuning
Q Liu, R Zheng, B Rong, J Liu, Z Liu, Z Cheng, L Qiao, T Gui, Q Zhang, ...
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
342022
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
F Wang, X Fu, JY Huang, Z Li, Q Liu, X Liu, MD Ma, N Xu, W Zhou, ...
arXiv preprint arXiv:2406.09411, 2024
332024
From shortcuts to triggers: Backdoor defense with denoised poe
Q Liu, F Wang, C Xiao, M Chen
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
182024
Test-time backdoor mitigation for black-box large language models with defensive demonstrations
W Mo, J Xu, Q Liu, J Wang, J Yan, C Xiao, M Chen
arXiv preprint arXiv:2311.09763, 2023
152023
Llms assist nlp researchers: Critique paper (meta-) reviewing
J Du, Y Wang, W Zhao, Z Deng, S Liu, R Lou, HP Zou, PN Venkit, ...
Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024
122024
Monotonic paraphrasing improves generalization of language model prompting
Q Liu, F Wang, N Xu, T Yan, T Meng, M Chen
Findings of the Association for Computational Linguistics: EMNLP 2024, 9861–9877, 2024
52024
Overview of argumentative text understanding for ai debater challenge
J Yuan, L Cheng, R He, Y Li, L Bing, Z Wei, Q Liu, C Shen, S Zhang, ...
Natural Language Processing and Chinese Computing: 10th CCF International …, 2021
42021
Securing Multi-turn Conversational Language Models From Distributed Backdoor Attacks
T Tong, Q Liu, J Xu, M Chen
Findings of the Association for Computational Linguistics: EMNLP 2024, 12833 …, 2024
3*2024
Unraveling cross-modality knowledge conflicts in large vision-language models
T Zhu, Q Liu, F Wang, Z Tu, M Chen
arXiv preprint arXiv:2410.03659, 2024
32024
Two heads are better than one: Nested poe for robust defense against multi-backdoors
V Graf, Q Liu, M Chen
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
32024
Detecting Adversarial Samples through Sharpness of Loss Landscape
R Zheng, S Dou, Y Zhou, Q Liu, T Gui, Q Zhang, Z Wei, XJ Huang, ...
Findings of the Association for Computational Linguistics: ACL 2023, 11282-11298, 2023
32023
Characterizing the impacts of instances on robustness
R Zheng, Z Xi, Q Liu, W Lai, T Gui, Q Zhang, XJ Huang, J Ma, Y Shan, ...
Findings of the Association for Computational Linguistics: ACL 2023, 2314-2332, 2023
32023
Plugat: A plug and play module to defend against textual adversarial attack
R Zheng, R Bao, Q Liu, T Gui, Q Zhang, XJ Huang, R Xie, W Wu
Proceedings of the 29th International Conference on Computational …, 2022
32022
Unraveling and mitigating safety alignment degradation of vision-language models
Q Liu, C Shang, L Liu, N Pappas, J Ma, NA John, S Doss, L Marquez, ...
arXiv preprint arXiv:2410.09047, 2024
22024
Familiarity-aware evidence compression for retrieval augmented generation
D Jung, Q Liu, T Huang, B Zhou, M Chen
arXiv preprint arXiv:2409.12468, 2024
22024
SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment
Q Liu, F Wang, C Xiao, M Chen
arXiv preprint arXiv:2410.14676, 2024
12024
Mitigating backdoor threats to large language models: Advancement and challenges
Q Liu, W Mo, T Tong, J Xu, F Wang, C Xiao, M Chen
2024 60th Annual Allerton Conference on Communication, Control, and …, 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20