Följ
Yu Fu
Titel
Citeras av
Citeras av
År
Survey of vulnerabilities in large language models revealed by adversarial attacks
E Shayegani, MAA Mamun, Y Fu, P Zaree, Y Dong, N Abu-Ghazaleh
arXiv preprint arXiv:2310.10844, 2023
1422023
Watermarking conditional text generation for ai detection: Unveiling challenges and a semantic-aware watermark remedy
Y Fu, D Xiong, Y Dong
AAAI 2024, 2023
242023
An efficient policy evaluation engine for XACML policy management
F Deng, Z Yu, W Liu, X Luo, Y Fu, B Qiang, C Xu, Z Li
Information Sciences 547, 1105-1121, 2021
82021
Survey of vulnerabilities in large language models revealed by adversarial attacks, 2023
E Shayegani, MAA Mamun, Y Fu, P Zaree, Y Dong, N Abu-Ghazaleh
URL https://arxiv. org/abs/2310.10844, 0
5
Not all heads matter: A head-level KV cache compression method with integrated retrieval and reasoning
Y Fu, Z Cai, A Asi, W Xiong, Y Dong, W Xiao
ICLR2025, 2024
32024
Trawl: Tensor reduced and approximated weights for large language models
Y Luo, H Patel, Y Fu, D Ahn, J Chen, Y Dong, EE Papalexakis
arXiv preprint arXiv:2406.17261, 2024
22024
Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
Y Fu, W Xiao, J Chen, J Li, E Papalexakis, A Chien, Y Dong
NAACL2024 TrustNLP Workshop, 2024
22024
Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack
Y Fu, Y Li, W Xiao, C Liu, Y Dong
ACL2024, 2023
22023
Inverse Reinforcement Learning for Text Summarization
Y Fu, D Xiong, Y Dong
Findings of EMNLP 2023, 2023
22023
MetaXCR: Reinforcement-Based Meta-Transfer Learning for Cross-Lingual Commonsense Reasoning
J He, Y Fu
Transfer Learning for Natural Language Processing Workshop, 74-87, 2023
22023
Vulnerabilities of Large Language Models to Adversarial Attacks
Y Fu, E Shayegan, MM Al Abdullah, P Zaree, N Abu-Ghazaleh, Y Dong
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
12024
Meta-RTL: Reinforcement-Based Meta-Transfer Learning for Low-Resource Commonsense Reasoning
Y Fu, J He, Y Yang, Q Liu, D Xiong
arXiv preprint arXiv:2409.19075, 2024
2024
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–12