Knowledge Conflicts for LLMs: A Survey R Xu, Z Qi, C Wang, H Wang, Y Zhang, W Xu
EMNLP 2024, 2024
52 2024 The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation R Xu, BS Lin, S Yang, T Zhang, W Shi, T Zhang, Z Fang, W Xu, H Qiu
ACL 2024, Outstanding Paper Award 🏆, 2023
34 2023 How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States Z Zhou, H Yu, X Zhang, R Xu, F Huang, Y Li
EMNLP 2024 Findings, 2024
16 2024 MISO: legacy-compatible privacy-preserving single sign-on using trusted execution environments R Xu, S Yang, F Zhang, Z Fang
EuroS&P 2023, 352-372, 2023
10 2023 MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Z Zeng, Y Liu, Y Wan, J Li, P Chen, J Dai, Y Yao, R Xu, Z Qi, W Zhao, ...
NeurIPS 2024, 2024
7 2024 Preemptive Answer" Attacks" on Chain-of-Thought Reasoning R Xu, Z Qi, W Xu
ACL 2024 Findings, 2024
6 2024 Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias R Xu, Z Zhou, T Zhang, Z Qi, S Yao, K Xu, W Xu, H Qiu
EMNLP 2024, 2024
5 2024 LSync: A universal event-synchronizing solution for live streaming Y Xu, F Dang, R Xu, X Chen, Y Liu
IEEE INFOCOM 2022-IEEE Conference on Computer Communications, 2188-2197, 2022
5 2022 Debateqa: Evaluating question answering on debatable knowledge R Xu, X Qi, Z Qi, W Xu, Z Guo
arXiv preprint arXiv:2408.01419, 2024
3 2024 Course-Correction: Safety Alignment Using Synthetic Preferences R Xu, Y Cai, Z Zhou, R Gu, H Weng, Y Liu, T Zhang, W Xu, H Qiu
EMNLP 2024, 2024
3 2024 MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs Z Zeng, Y Liu, Y Wan, J Li, P Chen, J Dai, Y Yao, R Xu, Z Qi, W Zhao, ...
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024
3 2024 Liferec: A mobile app for lifelog recording and ubiquitous recommendation J Li, H Zhang, Z He, R Xu, P Wu, M Zhang, Y Liu, S Ma
Proceedings of the 2022 Conference on Human Information Interaction and …, 2022
3 2022 : A Universal Timeline-Synchronizing Solution for Live Streaming F Dang, Y Xu, R Xu, X Chen, Y Liu
IEEE/ACM Transactions on Networking, 2024
1 2024 Exploring Chinese Humor Generation: A Study on Two-Part Allegorical Sayings R Xu
IJCNN 2024, 2024
1 2024 Rules Created by Symbolic Systems Cannot Constrain a Learning System SW Lin, R Xu, X Li, W Xu
2025 Long RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall Z Qi, R Xu, Z Guo, C Wang, H Zhang, W Xu
arXiv preprint arXiv:2410.23000, 2024
2024 \textsc {Long RAG}: Evaluating Long-Context\& Long-Form Retrieval-Augmented Generation with Key Point Recall Z Qi, R Xu, Z Guo, C Wang, H Zhang, W Xu
EMNLP 2024 Findings, 2024
2024 Sing it, Narrate it: Quality Musical Lyrics Translation Z Ye, J Li, R Xu
EMNLP 2024 Findings, 2024
2024 On the Role of Attention Heads in Large Language Model Safety Z Zhou, H Yu, X Zhang, R Xu, F Huang, K Wang, Y Liu, J Fang, Y Li
arXiv preprint arXiv:2410.13708, 2024
2024 Tempo: Confidentiality Preservation in Cloud-Based Neural Network Training R Xu, Z Fang
IJCNN 2024, 2024
2024