CriticBench: Benchmarking LLMs for Critique-Correct Reasoning Z Lin, Z Gou, T Liang, R Luo, H Liu, Y Yang ACL 2024 Findings, 2024 | 23 | 2024 |
Chain of history: Learning and forecasting with llms for temporal knowledge graph completion R Luo, T Gu, H Li, J Li, Z Lin, J Li, Y Yang arXiv preprint arXiv:2401.06072, 2024 | 17 | 2024 |
Ffaa: Multimodal large language model based explainable open-world face forgery analysis assistant Z Huang, B Xia, Z Lin, Z Mou, W Yang arXiv preprint arXiv:2408.10072, 2024 | 6 | 2024 |
Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability Z Lin, T Liang, J Xu, X Wang, R Luo, C Shi, S Li, Y Yang, Z Tu arXiv preprint arXiv:2411.19943, 2024 | 3 | 2024 |
Ptd-sql: Partitioning and targeted drilling with llms in text-to-sql R Luo, L Wang, B Lin, Z Lin, Y Yang EMNLP 2024, 2024 | 3 | 2024 |
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics R Luo, Z Zheng, Y Wang, Y Yu, X Ni, Z Lin, J Zeng, Y Yang arXiv preprint arXiv:2501.04686, 2025 | | 2025 |