Jailbreak attacks and defenses against large language models: A survey S Yi, Y Liu, Z Sun, T Cong, X He, J Song, K Xu, Q Li arXiv preprint arXiv:2407.04295, 2024 | 54 | 2024 |
Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media Z Sun, Z Zhang, X Shen, Z Zhang, Y Liu, M Backes, Y Zhang, X He arXiv preprint arXiv:2412.18148, 2024 | 1 | 2024 |
On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing Y Liu, Z Zhong, Y Liao, Z Sun, J Zheng, J Wei, Q Gong, F Tong, Y Chen, ... arXiv preprint arXiv:2412.17242, 2024 | 1* | 2024 |
Quantized Delta Weight Is Safety Keeper Y Liu, Z Sun, X He, X Huang arXiv preprint arXiv:2411.19530, 2024 | 1 | 2024 |
The Rising Threat to Emerging AI-Powered Search Engines Z Luo, Z Peng, Y Liu, Z Sun, M Li, J Zheng, X He arXiv preprint arXiv:2502.04951, 2025 | | 2025 |
SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning H Zhang, Y Liu, X He, J Wu, T Cong, X Huang arXiv preprint arXiv:2502.03801, 2025 | | 2025 |
PEFTGuard: Detecting Backdoor Attacks Against Parameter-Efficient Fine-Tuning Z Sun, T Cong, Y Liu, C Lin, X He, R Chen, X Han, X Huang arXiv preprint arXiv:2411.17453, 2024 | | 2024 |
Revealing the Difficulty in Jailbreak Defense on Language Models for Metaverse Z Kang, Y Liu, J Zheng, Z Sun Proceedings of the Third International Workshop on Social and Metaverse …, 2024 | | 2024 |
AdSpectorX: A Multimodal Expert Spector for Covert Advertising Detection on Chinese Social Media Z Zhang, Y Han, Z Zhang, Y Liu, J Zheng, Z Sun Proceedings of the Third International Workshop on Social and Metaverse …, 2024 | | 2024 |
GENNDTI: Drug-target interaction prediction using graph neural network enhanced by router nodes B Yang, Y Liu, J Wu, F Bai, M Zheng, J Zheng IEEE Journal of Biomedical and Health Informatics (Highlights), 2024 | | 2024 |