Is your model really a good math reasoner? evaluating mathematical reasoning with checklist

Z Zhou, S Liu, M Ning, W Liu, J Wang, DF Wong… - ar** a LLMs-Driven System Based on Human-AI Progressive Code Generation Framework to Assist Mathematics Learning
CYE SIT, Y Yin, WK YEUNG… - … Conference on Computers …, 2024 - library.apsce.net
This paper proposed a system interface based on a novel progressive code generation
framework to produce verified programming codes using natural language for mathematics …