Prompting frameworks for large language models: A survey X Liu, J Wang, J Sun, X Yuan, G Dong, P Di, W Wang, D Wang arXiv preprint arXiv:2311.12785, 2023 | 33 | 2023 |
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models X Yuan, J Li, D Wang, Y Chen, X Mao, L Huang, H Xue, W Wang, K Ren, ... arXiv preprint arXiv:2405.14191, 2024 | 16 | 2024 |