Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents R Wang, H Li, X Han, Y Zhang, T Baldwin arXiv preprint arXiv:2402.11651, 2024 | 15 | 2024 |
Against The Achilles' Heel: A Survey on Red Teaming for Generative Models L Lin, H Mu, Z Zhai, M Wang, Y Wang, R Wang, J Gao, Y Zhang, W Che, ... arXiv preprint arXiv:2404.00629, 2024 | 13 | 2024 |
Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability H Li, X Han, Z Zhai, H Mu, H Wang, Z Zhang, Y Geng, S Lin, R Wang, ... arXiv preprint arXiv:2412.18551, 2024 | 3 | 2024 |
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Z Liu, B Tan, H Wang, W Neiswanger, T Tao, H Li, F Koto, Y Wang, S Sun, ... arXiv preprint arXiv:2501.07124, 2025 | 1* | 2025 |
ToolGen: Unified Tool Retrieval and Calling via Generation R Wang, X Han, L Ji, S Wang, T Baldwin, H Li arXiv preprint arXiv:2410.03439, 2024 | 1 | 2024 |
DialogueGLP: Global-Local Modeling with Prompt-Based Knowledge Enhancement for Emotion Inference in Conversation R Wang, S Feng Findings of the Association for Computational Linguistics: EACL 2023, 2075-2082, 2023 | 1* | 2023 |
Explore the Reasoning Capability of LLMs in the Chess Testbed S Wang, L Ji, R Wang, W Zhao, H Liu, Y Hou, YN Wu arXiv preprint arXiv:2411.06655, 2024 | | 2024 |
Demystifying Instruction Mixing for Fine-tuning Large Language Models R Wang, H Li, M Wu, Y Wang, X Han, C Zhang, T Baldwin Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024 | | 2024 |