Követés
Jiahao Yu
Jiahao Yu
E-mail megerősítve itt: northwestern.edu - Kezdőlap
Cím
Hivatkozott rá
Hivatkozott rá
Év
Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts
J Yu, X Lin, Z Yu, X Xing
arXiv preprint arXiv:2309.10253, 2023
2442023
Voiceprint mimicry attack towards speaker verification system in smart home
L Zhang, Y Meng, J Yu, C Xiang, B Falk, H Zhu
IEEE INFOCOM 2020-IEEE conference on computer communications, 377-386, 2020
572020
Assessing prompt injection risks in 200+ custom gpts
J Yu, Y Wu, D Shu, M Jin, X Xing
ICLR 2024 Workshop on Secure and Trustworthy Large Language Models, 2023
452023
Speedup robust graph structure learning with low-rank information
H Xu, L Xiang, J Yu, A Cao, X Wang
Proceedings of the 30th ACM International Conference on Information …, 2021
272021
Enhancing jailbreak attack against large language models through silent tokens
J Yu, H Luo, JYC Hu, W Guo, H Liu, X Xing
arXiv preprint arXiv:2405.20653, 2024
152024
Research on Application of Artificial Intelligence Technology in Electrical Automation Control
C Jiang, X Xiong, T Zhu, J Cao, J Yu
Journal of Physics: Conference Series 1601 (5), 052006, 2020
112020
LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks
J Yu, X Lin, Z Yu, X Xing
33nd USENIX Security Symposium (USENIX Security 24), 2024
102024
Matrix gaussian mechanisms for differentially-private learning
J Yang, L Xiang, J Yu, X Wang, B Guo, Z Li, B Li
IEEE Transactions on Mobile Computing 22 (2), 1036-1048, 2021
102021
Statemask: Explaining deep reinforcement learning through state mask
Z Cheng, X Wu, J Yu, W Sun, W Guo, X Xing
Advances in Neural Information Processing Systems 36, 2024
82024
Decoupled alignment for robust plug-and-play adaptation
H Luo, J Yu, W Zhang, J Li, JYC Hu, X Xing, H Liu
arXiv preprint arXiv:2406.01514, 2024
72024
{AIRS}: Explanation for Deep Reinforcement Learning based Security Applications
J Yu, W Guo, Q Qin, G Wang, T Wang, X Xing
32nd USENIX Security Symposium (USENIX Security 23), 7375-7392, 2023
72023
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation
Z Cheng, X Wu, J Yu, S Yang, G Wang, X Xing
Proceedings of the 41st International Conference on Machine Learning, 2024
32024
Promptfuzz: Harnessing fuzzing techniques for robust testing of prompt injection in llms
J Yu, Y Shao, H Miao, J Shi, X Xing
arXiv preprint arXiv:2409.14729, 2024
22024
Soft-Label Integration for Robust Toxicity Classification
Z Cheng, X Wu, J Yu, S Han, XQ Cai, X Xing
38th Conference on Neural Information Processing Systems (NeurIPS 2024)., 2024
12024
BlockFound: Customized blockchain foundation model for anomaly detection
J Yu, X Wu, H Liu, W Guo, X Xing
arXiv preprint arXiv:2410.04039, 2024
12024
BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning
W Shi, H Li, J Yu, W Guo, X Xing
Proceedings of the 17th ACM/IEEE International Workshop on Search-Based and …, 2024
12024
UTF: Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification
J Cai, J Yu, Y Shao, Y Wu, X Xing
arXiv preprint arXiv:2410.12318, 2024
2024
A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.
Cikkek 1–17