MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding F Wang, X Fu, JY Huang, Z Li, Q Liu, X Liu, MD Ma, N Xu, W Zhou, ... arXiv preprint arXiv:2406.09411, 2024 | 33 | 2024 |
A causal view of entity bias in (large) language models F Wang, W Mo, Y Wang, W Zhou, M Chen arXiv preprint arXiv:2305.14695, 2023 | 29 | 2023 |
Test-time backdoor mitigation for black-box large language models with defensive demonstrations W Mo, J Xu, Q Liu, J Wang, J Yan, C Xiao, M Chen arXiv preprint arXiv:2311.09763, 2023 | 16 | 2023 |
Mitigating backdoor threats to large language models: Advancement and challenges Q Liu, W Mo, T Tong, J Xu, F Wang, C Xiao, M Chen 2024 60th Annual Allerton Conference on Communication, Control, and …, 2024 | 1 | 2024 |
Rethinking Backdoor Detection Evaluation for Language Models J Yan, WJ Mo, X Ren, R Jia arXiv preprint arXiv:2409.00399, 2024 | 1 | 2024 |