Glitch tokens in large language models: Categorization taxonomy and effective detection Y Li, Y Liu, G Deng, Y Zhang, W Song, L Shi, K Wang, Y Li, Y Liu, H Wang Proceedings of the ACM on Software Engineering 1 (FSE), 2075-2097, 2024 | 18 | 2024 |
Lockpicking LLMs: A Logit-Based Jailbreak Using Token-level Manipulation Y Li, Y Liu, Y Li, L Shi, G Deng, S Chen, K Wang arXiv preprint arXiv:2405.13068, 2024 | 8 | 2024 |
Glitchprober: Advancing effective detection and mitigation of glitch tokens in large language models Z Zhang, W Bai, Y Li, MH Meng, K Wang, L Shi, L Li, J Wang, H Wang Proceedings of the 39th IEEE/ACM International Conference on Automated …, 2024 | 6 | 2024 |
EpiCarousel: memory-and time-efficient identification of metacells for atlas-level single-cell chromatin accessibility data S Li, Y Li, Y Sun, Y Li, X Chen, S Tang, S Chen Bioinformatics 40 (4), btae191, 2024 | 2 | 2024 |
Model-Editing-Based Jailbreak against Safety-aligned Large Language Models Y Li, Z Zhang, K Wang, L Shi, H Wang arXiv preprint arXiv:2412.08201, 2024 | | 2024 |