Survey of vulnerabilities in large language models revealed by adversarial attacks E Shayegani, MAA Mamun, Y Fu, P Zaree, Y Dong, N Abu-Ghazaleh arXiv preprint arXiv:2310.10844, 2023 | 142 | 2023 |
Watermarking conditional text generation for ai detection: Unveiling challenges and a semantic-aware watermark remedy Y Fu, D Xiong, Y Dong AAAI 2024, 2023 | 24 | 2023 |
An efficient policy evaluation engine for XACML policy management F Deng, Z Yu, W Liu, X Luo, Y Fu, B Qiang, C Xu, Z Li Information Sciences 547, 1105-1121, 2021 | 8 | 2021 |
Survey of vulnerabilities in large language models revealed by adversarial attacks, 2023 E Shayegani, MAA Mamun, Y Fu, P Zaree, Y Dong, N Abu-Ghazaleh URL https://arxiv. org/abs/2310.10844, 0 | 5 | |
Not all heads matter: A head-level KV cache compression method with integrated retrieval and reasoning Y Fu, Z Cai, A Asi, W Xiong, Y Dong, W Xiao ICLR2025, 2024 | 3 | 2024 |
Trawl: Tensor reduced and approximated weights for large language models Y Luo, H Patel, Y Fu, D Ahn, J Chen, Y Dong, EE Papalexakis arXiv preprint arXiv:2406.17261, 2024 | 2 | 2024 |
Cross-Task Defense: Instruction-Tuning LLMs for Content Safety Y Fu, W Xiao, J Chen, J Li, E Papalexakis, A Chien, Y Dong NAACL2024 TrustNLP Workshop, 2024 | 2 | 2024 |
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack Y Fu, Y Li, W Xiao, C Liu, Y Dong ACL2024, 2023 | 2 | 2023 |
Inverse Reinforcement Learning for Text Summarization Y Fu, D Xiong, Y Dong Findings of EMNLP 2023, 2023 | 2 | 2023 |
MetaXCR: Reinforcement-Based Meta-Transfer Learning for Cross-Lingual Commonsense Reasoning J He, Y Fu Transfer Learning for Natural Language Processing Workshop, 74-87, 2023 | 2 | 2023 |
Vulnerabilities of Large Language Models to Adversarial Attacks Y Fu, E Shayegan, MM Al Abdullah, P Zaree, N Abu-Ghazaleh, Y Dong Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | 1 | 2024 |
Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning Y Fu, J He, Y Yang, Q Liu, D Xiong arXiv preprint arXiv:2409.19075, 2024 | | 2024 |