Folgen
Jiale Cheng
Jiale Cheng
phd student in Tsinghua University
Bestätigte E-Mail-Adresse bei mails.tsinghua.edu.cn
Titel
Zitiert von
Zitiert von
Jahr
Chatglm: A family of large language models from glm-130b to glm-4 all tools
T GLM, A Zeng, B Xu, B Wang, C Zhang, D Yin, D Zhang, D Rojas, G Feng, ...
arXiv preprint arXiv:2406.12793, 2024
2382024
Safety assessment of chinese large language models
H Sun, Z Zhang, J Deng, J Cheng, M Huang
arXiv preprint arXiv:2304.10436, 2023
1192023
On the safety of conversational models: Taxonomy, dataset, and benchmark
H Sun, G Xu, J Deng, J Cheng, C Zheng, H Zhou, N Peng, X Zhu, ...
arXiv preprint arXiv:2110.08466, 2021
802021
Black-box prompt optimization: Aligning large language models without model training
J Cheng, X Liu, K Zheng, P Ke, H Wang, Y Dong, J Tang, M Huang
arXiv preprint arXiv:2311.04155, 2023
562023
Alignbench: Benchmarking chinese alignment of large language models
X Liu, X Lei, S Wang, Y Huang, Z Feng, B Wen, J Cheng, P Ke, Y Xu, ...
arXiv preprint arXiv:2311.18743, 2023
502023
CritiqueLLM: Towards an informative critique generation model for evaluation of large language model generation
P Ke, B Wen, A Feng, X Liu, X Lei, J Cheng, S Wang, A Zeng, Y Dong, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
45*2024
Towards Safer Generative Language Models: A Survey on Safety Risks, Evaluations, and Improvements
J Deng, J Cheng, H Sun, Z Zhang, M Huang
arXiv preprint arXiv:2302.09270, 2023
43*2023
Pal: Persona-augmented emotional support conversation generation
J Cheng, S Sabour, H Sun, Z Chen, M Huang
arXiv preprint arXiv:2212.09235, 2022
252022
Constructing highly inductive contexts for dialogue safety through controllable reverse generation
Z Zhang, J Cheng, H Sun, J Deng, F Mi, Y Wang, L Shang, M Huang
arXiv preprint arXiv:2212.01810, 2022
112022
Autodetect: Towards a unified framework for automated weakness detection in large language models
J Cheng, Y Lu, X Gu, P Ke, X Liu, Y Dong, H Wang, J Tang, M Huang
arXiv preprint arXiv:2406.16714, 2024
52024
InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning
Z Zhang, J Cheng, H Sun, J Deng, M Huang
Findings of the Association for Computational Linguistics: EMNLP 2023, 10421 …, 2023
52023
Logicgame: Benchmarking rule-based reasoning abilities of large language models
J Gui, Y Liu, J Cheng, X Gu, X Liu, H Wang, Y Dong, J Tang, M Huang
arXiv preprint arXiv:2408.15778, 2024
22024
Visionreward: Fine-grained multi-dimensional human preference learning for image and video generation
J Xu, Y Huang, J Cheng, Y Yang, J Xu, Y Wang, W Duan, S Yang, Q Jin, ...
arXiv preprint arXiv:2412.21059, 2024
12024
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
J Cheng, X Liu, C Wang, X Gu, Y Lu, D Zhang, Y Dong, J Tang, H Wang, ...
arXiv preprint arXiv:2412.11605, 2024
12024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–14