Large language model for table processing: A survey

W Lu, J Zhang, J Fan, Z Fu, Y Chen, X Du - Frontiers of Computer Science, 2025 - Springer
Tables, typically two-dimensional and structured to store large amounts of data, are
essential in daily activities like database queries, spreadsheet manipulations, Web table …

A survey of table reasoning with large language models

X Zhang, D Wang, L Dou, Q Zhu, W Che - Frontiers of Computer Science, 2025 - Springer
Table reasoning aims to generate inference results based on the user requirement and the
provided table. Enhancing the table reasoning capability of the model can aid in obtaining …

Synthesizing text-to-SQL data from weak and strong LLMs

J Yang, B Hui, M Yang, J Yang, J Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
The capability gap between open-source and closed-source large language models (LLMs)
remains a challenge in text-to-SQL tasks. In this paper, we introduce a synthetic data …

Large language models for tabular data: Progresses and future directions

H Dong, Z Wang - Proceedings of the 47th International ACM SIGIR …, 2024 - dl.acm.org
Tables contain a significant portion of the world's structured information. The ability to
efficiently and accurately understand, process, reason about, analyze, and generate tabular …

Large language models meet nlp: A survey

L Qin, Q Chen, X Feng, Y Wu, Y Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org
While large language models (LLMs) like ChatGPT have shown impressive capabilities in
Natural Language Processing (NLP) tasks, a systematic investigation of their potential in this …

Promptintern: Saving inference costs by internalizing recurrent prompt during large language model fine-tuning

J Zou, M Zhou, T Li, S Han, D Zhang - arxiv preprint arxiv:2407.02211, 2024 - arxiv.org
Recent advances in fine-tuning large language models (LLMs) have greatly enhanced their
usage in domain-specific tasks. Despite the success, fine-tuning continues to rely on …

Tablerag: Million-token table understanding with language models

SA Chen, L Miculicich, JM Eisenschlos, Z Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in language models (LMs) have notably enhanced their ability to
reason with tabular data, primarily through program-aided mechanisms that manipulate and …

STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing

J Zou, Q Wang, P Thakur, N Kani - arxiv preprint arxiv:2411.00387, 2024 - arxiv.org
Advances in large language models (LLMs) have spurred research into enhancing their
reasoning capabilities, particularly in math-rich STEM documents. While LLMs can generate …

RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector

Z Wang, W Yang, K Zhou, Y Zhang, W Jia - arxiv preprint arxiv …, 2024 - arxiv.org
The real estate market relies heavily on structured data, such as property details, market
trends, and price fluctuations. However, the lack of specialized Tabular Question Answering …

MATSA: Multi-Agent Table Structure Attribution

P Mathur, A Siu, N Lipka, T Sun - Proceedings of the 2024 …, 2024 - aclanthology.org
Abstract Large Language Models (LLMs) have significantly advanced QA tasks through in-
context learning but often suffer from hallucinations. Attributing supporting evidence …