CodeEditorBench: Evaluating Code Editing Capability of Large Language Models J Guo, Z Li, X Liu, K Ma, T Zheng, Z Yu, D Pan, Y Li, R Liu, Y Wang, S Guo, ... arXiv preprint arXiv:2404.03543, 2024 | 8 | 2024 |
Autokaggle: A multi-agent framework for autonomous data science competitions Z Li, Q Zang, D Ma, J Guo, T Zheng, M Liu, X Niu, Y Wang, J Yang, J Liu, ... arXiv preprint arXiv:2410.20424, 2024 | 6 | 2024 |