A systematic assessment of openai o1-preview for higher order thinking in education

E Latif, Y Zhou, S Guo, Y Gao, L Shi… - arxiv preprint arxiv …, 2024 - arxiv.org
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable
to human intelligence, with significant potential to transform education and workforce …

MetaphorPrompt-An Analogical Reasoning Approach for Extracting Causal Links from Biological Text

P Patel, YC Chiu, Y Hunag, J Zhang - Proceedings of the 15th ACM …, 2024 - dl.acm.org
In recent years, Large Language Models (LLMs) have revolutionized Natural Language
Processing (NLP), offering significant improvements for extracting complex information from …

Benchmarking LLMs for Real-World Applications: From Numerical Metrics to Contextual and Qualitative Evaluation

HI Ashqar - Authorea Preprints, 2025 - techrxiv.org
The evaluation of large language models (LLMs) has traditionally relied on static
benchmarks that prioritize performance metrics such as accuracy, precision, BLEU and …