Jiahao Yu

Hivatkozott rá

	Összes	2020 óta
Hivatkozások	449	447
h-index	8	8
i10-index	8	7

320

160

240

2020202120222023202420253 11 21 51 312 47

Nyilvános hozzáférés

Összes megtekintése

3 cikk

1 cikk

elérhető

nem érhető el

Finanszírozási megbízások alapján

Követés

Jiahao Yu

Northwestern University

E-mail megerősítve itt: northwestern.edu - Kezdőlap

reinforcement learning security


Cím Rendezés hivatkozások szerint Rendezés év szerint Rendezés cím szerint	Hivatkozott rá Hivatkozott rá	Év
Gptfuzzer: Red teaming large language models with auto-generated jailbreak prompts J Yu, X Lin, Z Yu, X Xing arXiv preprint arXiv:2309.10253, 2023	244	2023
Voiceprint mimicry attack towards speaker verification system in smart home L Zhang, Y Meng, J Yu, C Xiang, B Falk, H Zhu IEEE INFOCOM 2020-IEEE conference on computer communications, 377-386, 2020	57	2020
Assessing prompt injection risks in 200+ custom gpts J Yu, Y Wu, D Shu, M Jin, X Xing ICLR 2024 Workshop on Secure and Trustworthy Large Language Models, 2023	45	2023
Speedup robust graph structure learning with low-rank information H Xu, L Xiang, J Yu, A Cao, X Wang Proceedings of the 30th ACM International Conference on Information …, 2021	27	2021
Enhancing jailbreak attack against large language models through silent tokens J Yu, H Luo, JYC Hu, W Guo, H Liu, X Xing arXiv preprint arXiv:2405.20653, 2024	15	2024
Research on Application of Artificial Intelligence Technology in Electrical Automation Control C Jiang, X Xiong, T Zhu, J Cao, J Yu Journal of Physics: Conference Series 1601 (5), 052006, 2020	11	2020
LLM-Fuzzer: Scaling Assessment of Large Language Model Jailbreaks J Yu, X Lin, Z Yu, X Xing 33nd USENIX Security Symposium (USENIX Security 24), 2024	10	2024
Matrix gaussian mechanisms for differentially-private learning J Yang, L Xiang, J Yu, X Wang, B Guo, Z Li, B Li IEEE Transactions on Mobile Computing 22 (2), 1036-1048, 2021	10	2021
Statemask: Explaining deep reinforcement learning through state mask Z Cheng, X Wu, J Yu, W Sun, W Guo, X Xing Advances in Neural Information Processing Systems 36, 2024	8	2024
Decoupled alignment for robust plug-and-play adaptation H Luo, J Yu, W Zhang, J Li, JYC Hu, X Xing, H Liu arXiv preprint arXiv:2406.01514, 2024	7	2024
{AIRS}: Explanation for Deep Reinforcement Learning based Security Applications J Yu, W Guo, Q Qin, G Wang, T Wang, X Xing 32nd USENIX Security Symposium (USENIX Security 23), 7375-7392, 2023	7	2023
RICE: Breaking Through the Training Bottlenecks of Reinforcement Learning with Explanation Z Cheng, X Wu, J Yu, S Yang, G Wang, X Xing Proceedings of the 41st International Conference on Machine Learning, 2024	3	2024
Promptfuzz: Harnessing fuzzing techniques for robust testing of prompt injection in llms J Yu, Y Shao, H Miao, J Shi, X Xing arXiv preprint arXiv:2409.14729, 2024	2	2024
Soft-Label Integration for Robust Toxicity Classification Z Cheng, X Wu, J Yu, S Han, XQ Cai, X Xing 38th Conference on Neural Information Processing Systems (NeurIPS 2024)., 2024	1	2024
BlockFound: Customized blockchain foundation model for anomaly detection J Yu, X Wu, H Liu, W Guo, X Xing arXiv preprint arXiv:2410.04039, 2024	1	2024
BandFuzz: A Practical Framework for Collaborative Fuzzing with Reinforcement Learning W Shi, H Li, J Yu, W Guo, X Xing Proceedings of the 17th ACM/IEEE International Workshop on Search-Based and …, 2024	1	2024
UTF: Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification J Cai, J Yu, Y Shao, Y Wu, X Xing arXiv preprint arXiv:2410.12318, 2024		2024

A rendszer jelenleg nem tudja elvégezni a műveletet. Próbálkozzon újra később.

Cikkek 1–17

Hivatkozások évente

Ismétlődő hivatkozások

Összevont hivatkozások

Társszerzők hozzáadásaTársszerzők

Követés

Hivatkozott rá