" do anything now": Characterizing and evaluating in-the-wild jailbreak prompts on large language models X Shen, Z Chen, M Backes, Y Shen, Y Zhang Proceedings of the 2024 on ACM SIGSAC Conference on Computer and …, 2024 | 444 | 2024 |
In chatgpt we trust? measuring and characterizing the reliability of chatgpt X Shen, Z Chen, M Backes, Y Zhang arXiv preprint arXiv:2304.08979, 2023 | 137 | 2023 |
Mgtbench: Benchmarking machine-generated text detection X He, X Shen, Z Chen, M Backes, Y Zhang Proceedings of the 2024 on ACM SIGSAC Conference on Computer and …, 2024 | 103 | 2024 |
Comprehensive assessment of toxicity in ChatGPT B Zhang, X Shen, WM Si, Z Sha, Z Chen, A Salem, Y Shen, M Backes, ... arXiv preprint arXiv:2311.14685, 2023 | 5 | 2023 |
Medusa Attack: Exploring Security Hazards of {In-App}{QR} Code Scanning X Han, Y Zhang, X Zhang, Z Chen, M Wang, Y Zhang, S Ma, Y Yu, ... 32nd USENIX Security Symposium (USENIX Security 23), 4607-4624, 2023 | 4 | 2023 |
Simulation: demystifying (insecure) cellular network based one-tap authentication services Z Zhou, X Han, Z Chen, Y Nan, J Li, D Gu 2022 52nd Annual IEEE/IFIP International Conference on Dependable Systems …, 2022 | 4 | 2022 |