The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... Science China Information Sciences 68 (2), 121101, 2025 | 735 | 2025 |
Secrets of rlhf in large language models part i: Ppo R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023 | 134* | 2023 |
Textflint: Unified multilingual robustness evaluation toolkit for natural language processing X Wang, Q Liu, T Gui, Q Zhang, Y Zou, X Zhou, J Ye, Y Zhang, R Zheng, ... Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 129* | 2021 |
Flooding-X: Improving BERT’s resistance to adversarial attacks via loss-restricted fine-tuning Q Liu, R Zheng, B Rong, J Liu, Z Liu, Z Cheng, L Qiao, T Gui, Q Zhang, ... Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 34 | 2022 |
MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding F Wang, X Fu, JY Huang, Z Li, Q Liu, X Liu, MD Ma, N Xu, W Zhou, ... arXiv preprint arXiv:2406.09411, 2024 | 33 | 2024 |
From shortcuts to triggers: Backdoor defense with denoised poe Q Liu, F Wang, C Xiao, M Chen Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | 18 | 2024 |
Test-time backdoor mitigation for black-box large language models with defensive demonstrations W Mo, J Xu, Q Liu, J Wang, J Yan, C Xiao, M Chen arXiv preprint arXiv:2311.09763, 2023 | 15 | 2023 |
Llms assist nlp researchers: Critique paper (meta-) reviewing J Du, Y Wang, W Zhao, Z Deng, S Liu, R Lou, HP Zou, PN Venkit, ... Proceedings of the 2024 Conference on Empirical Methods in Natural Language …, 2024 | 12 | 2024 |
Monotonic paraphrasing improves generalization of language model prompting Q Liu, F Wang, N Xu, T Yan, T Meng, M Chen Findings of the Association for Computational Linguistics: EMNLP 2024, 9861–9877, 2024 | 5 | 2024 |
Overview of argumentative text understanding for ai debater challenge J Yuan, L Cheng, R He, Y Li, L Bing, Z Wei, Q Liu, C Shen, S Zhang, ... Natural Language Processing and Chinese Computing: 10th CCF International …, 2021 | 4 | 2021 |
Securing Multi-turn Conversational Language Models From Distributed Backdoor Attacks T Tong, Q Liu, J Xu, M Chen Findings of the Association for Computational Linguistics: EMNLP 2024, 12833 …, 2024 | 3* | 2024 |
Unraveling cross-modality knowledge conflicts in large vision-language models T Zhu, Q Liu, F Wang, Z Tu, M Chen arXiv preprint arXiv:2410.03659, 2024 | 3 | 2024 |
Two heads are better than one: Nested poe for robust defense against multi-backdoors V Graf, Q Liu, M Chen Proceedings of the 2024 Conference of the North American Chapter of the …, 2024 | 3 | 2024 |
Detecting Adversarial Samples through Sharpness of Loss Landscape R Zheng, S Dou, Y Zhou, Q Liu, T Gui, Q Zhang, Z Wei, XJ Huang, ... Findings of the Association for Computational Linguistics: ACL 2023, 11282-11298, 2023 | 3 | 2023 |
Characterizing the impacts of instances on robustness R Zheng, Z Xi, Q Liu, W Lai, T Gui, Q Zhang, XJ Huang, J Ma, Y Shan, ... Findings of the Association for Computational Linguistics: ACL 2023, 2314-2332, 2023 | 3 | 2023 |
Plugat: A plug and play module to defend against textual adversarial attack R Zheng, R Bao, Q Liu, T Gui, Q Zhang, XJ Huang, R Xie, W Wu Proceedings of the 29th International Conference on Computational …, 2022 | 3 | 2022 |
Unraveling and mitigating safety alignment degradation of vision-language models Q Liu, C Shang, L Liu, N Pappas, J Ma, NA John, S Doss, L Marquez, ... arXiv preprint arXiv:2410.09047, 2024 | 2 | 2024 |
Familiarity-aware evidence compression for retrieval augmented generation D Jung, Q Liu, T Huang, B Zhou, M Chen arXiv preprint arXiv:2409.12468, 2024 | 2 | 2024 |
SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment Q Liu, F Wang, C Xiao, M Chen arXiv preprint arXiv:2410.14676, 2024 | 1 | 2024 |
Mitigating backdoor threats to large language models: Advancement and challenges Q Liu, W Mo, T Tong, J Xu, F Wang, C Xiao, M Chen 2024 60th Annual Allerton Conference on Communication, Control, and …, 2024 | 1 | 2024 |