Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts J Yu, X Lin, Z Yu, X Xing arXiv preprint arXiv:2309.10253, 2023 | 244 | 2023 |
Voiceprint mimicry attack towards speaker verification system in smart home L Zhang, Y Meng, J Yu, C Xiang, B Falk, H Zhu IEEE INFOCOM 2020-IEEE conference on computer communications, 377-386, 2020 | 57 | 2020 |
Assessing prompt injection risks in 200+ custom gpts J Yu, Y Wu, D Shu, M Jin, X Xing ICLR 2024 Workshop on Secure and Trustworthy Large Language Models, 2023 | 45 | 2023 |
Speedup robust graph structure learning with low-rank information H Xu, L Xiang, J Yu, A Cao, X Wang Proceedings of the 30th ACM International Conference on Information …, 2021 | 27 | 2021 |
Enhancing jailbreak attack against large language models through silent tokens J Yu, H Luo, JYC Hu, W Guo, H Liu, X Xing arXiv preprint arXiv:2405.20653, 2024 | 15 | 2024 |
Research on Application of Artificial Intelligence Technology in Electrical Automation Control C Jiang, X Xiong, T Zhu, J Cao, J Yu Journal of Physics: Conference Series 1601 (5), 052006, 2020 | 11 | 2020 |
LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks J Yu, X Lin, Z Yu, X Xing 33nd USENIX Security Symposium (USENIX Security 24), 2024 | 10 | 2024 |
Matrix gaussian mechanisms for differentially-private learning J Yang, L Xiang, J Yu, X Wang, B Guo, Z Li, B Li IEEE Transactions on Mobile Computing 22 (2), 1036-1048, 2021 | 10 | 2021 |
Statemask: Explaining deep reinforcement learning through state mask Z Cheng, X Wu, J Yu, W Sun, W Guo, X Xing Advances in Neural Information Processing Systems 36, 2024 | 8 | 2024 |
Decoupled alignment for robust plug-and-play adaptation H Luo, J Yu, W Zhang, J Li, JYC Hu, X Xing, H Liu arXiv preprint arXiv:2406.01514, 2024 | 7 | 2024 |
{AIRS}: Explanation for Deep Reinforcement Learning based Security Applications J Yu, W Guo, Q Qin, G Wang, T Wang, X Xing 32nd USENIX Security Symposium (USENIX Security 23), 7375-7392, 2023 | 7 | 2023 |
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation Z Cheng, X Wu, J Yu, S Yang, G Wang, X Xing Proceedings of the 41st International Conference on Machine Learning, 2024 | 3 | 2024 |
Promptfuzz: Harnessing fuzzing techniques for robust testing of prompt injection in llms J Yu, Y Shao, H Miao, J Shi, X Xing arXiv preprint arXiv:2409.14729, 2024 | 2 | 2024 |
Soft-Label Integration for Robust Toxicity Classification Z Cheng, X Wu, J Yu, S Han, XQ Cai, X Xing 38th Conference on Neural Information Processing Systems (NeurIPS 2024)., 2024 | 1 | 2024 |
BlockFound: Customized blockchain foundation model for anomaly detection J Yu, X Wu, H Liu, W Guo, X Xing arXiv preprint arXiv:2410.04039, 2024 | 1 | 2024 |
BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning W Shi, H Li, J Yu, W Guo, X Xing Proceedings of the 17th ACM/IEEE International Workshop on Search-Based and …, 2024 | 1 | 2024 |
UTF: Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification J Cai, J Yu, Y Shao, Y Wu, X Xing arXiv preprint arXiv:2410.12318, 2024 | | 2024 |