Privacy in large language models: Attacks, defenses and future directions

H Li, Y Chen, J Luo, J Wang, H Peng, Y Kang… - arxiv preprint arxiv …, 2023 - arxiv.org
The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …

Federated large language model: A position paper

C Chen, X Feng, J Zhou, J Yin, X Zheng - arxiv e-prints, 2023 - ui.adsabs.harvard.edu
Large scale language models (LLM) have received significant attention and found diverse
applications across various domains, but their development encounters challenges in real …

Generating valid and natural adversarial examples with large language models

Z Wang, W Wang, Q Chen, Q Wang… - … Cooperative Work in …, 2024 - ieeexplore.ieee.org
Deep learning-based natural language processing (NLP) models, particularly pre-trained
language models (PLMs), have been revealed to be vulnerable to adversarial attacks …

Integration of large language models and federated learning

C Chen, X Feng, Y Li, L Lyu, J Zhou, X Zheng, J Yin - Patterns, 2024 - cell.com
As the parameter size of large language models (LLMs) continues to expand, there is an
urgent need to address the scarcity of high-quality data. In response, existing research has …

Multi-granular Adversarial Attacks against Black-box Neural Ranking Models

YA Liu, R Zhang, J Guo, M de Rijke, Y Fan… - Proceedings of the 47th …, 2024 - dl.acm.org
Adversarial ranking attacks have gained increasing attention due to their success in probing
vulnerabilities, and, hence, enhancing the robustness, of neural ranking models …

Breaking agents: Compromising autonomous llm agents through malfunction amplification

B Zhang, Y Tan, Y Shen, A Salem, M Backes… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, autonomous agents built on large language models (LLMs) have experienced
significant development and are being deployed in real-world applications. These agents …

Exposing the Achilles' heel of textual hate speech classifiers using indistinguishable adversarial examples

S Aggarwal, DK Vishwakarma - Expert Systems with Applications, 2024 - Elsevier
The accessibility of online hate speech has increased significantly, making it crucial for
social-media companies to prioritize efforts to curb its spread. Although deep learning …

Following Clues, Approaching the Truth: Explainable Micro-Video Rumor Detection via Chain-of-Thought Reasoning

R Hong, J Lang, J Xu, Z Cheng, T Zhong… - THE WEB CONFERENCE … - openreview.net
The rapid spread of rumor content on online micro-video platforms poses significant threats
to public health and safety. However, existing Micro-Video Rumor Detection (MVRD) …