Large Language Models (LLMs) on Tabular Data: Prediction, Generation, and Understanding--A Survey

X Fang, W Xu, FA Tan, J Zhang, Z Hu, Y Qi… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent breakthroughs in large language modeling have facilitated rigorous exploration of
their application in diverse tasks related to tabular data modeling, such as prediction, tabular …

From tabular data to knowledge graphs: A survey of semantic table interpretation tasks and methods

J Liu, Y Chabot, R Troncy, VP Huynh, T Labbé… - Journal of Web …, 2023 - Elsevier
Tabular data often refers to data that is organized in a table with rows and columns. We
observe that this data format is widely used on the Web and within enterprise data …

Table meets llm: Can large language models understand structured table data? a benchmark and empirical study

Y Sui, M Zhou, M Zhou, S Han, D Zhang - Proceedings of the 17th ACM …, 2024 - dl.acm.org
Large language models (LLMs) are becoming attractive as few-shot reasoners to solve
Natural Language (NL)-related tasks. However, there is still much to learn about how well …

Chain-of-table: Evolving tables in the reasoning chain for table understanding

Z Wang, H Zhang, CL Li, JM Eisenschlos… - arxiv preprint arxiv …, 2024 - arxiv.org
Table-based reasoning with large language models (LLMs) is a promising direction to tackle
many table understanding tasks, such as table-based question answering and fact …

Tablellama: Towards open large generalist models for tables

T Zhang, X Yue, Y Li, H Sun - arxiv preprint arxiv:2311.09206, 2023 - arxiv.org
Semi-structured tables are ubiquitous. There has been a variety of tasks that aim to
automatically interpret, augment, and query tables. Current methods often require …

TAPEX: Table pre-training via learning a neural SQL executor

Q Liu, B Chen, J Guo, M Ziyadi, Z Lin, W Chen… - arxiv preprint arxiv …, 2021 - arxiv.org
Recent progress in language model pre-training has achieved a great success via
leveraging large-scale unstructured textual data. However, it is still a challenge to apply pre …

Hytrel: Hypergraph-enhanced tabular data representation learning

P Chen, S Sarkar, L Lausen… - Advances in …, 2023 - proceedings.neurips.cc
Abstract Language models pretrained on large collections of tabular data have
demonstrated their effectiveness in several downstream tasks. However, many of these …

Annotating columns with pre-trained language models

Y Suhara, J Li, Y Li, D Zhang, Ç Demiralp… - Proceedings of the …, 2022 - dl.acm.org
Inferring meta information about tables, such as column headers or relationships between
columns, is an active research topic in data management as we find many tables are …

MultiHiertt: Numerical reasoning over multi hierarchical tabular and textual data

Y Zhao, Y Li, C Li, R Zhang - arxiv preprint arxiv:2206.01347, 2022 - arxiv.org
Numerical reasoning over hybrid data containing both textual and tabular content (eg,
financial reports) has recently attracted much attention in the NLP community. However …

Transformers for tabular data representation: A survey of models and applications

G Badaro, M Saeed, P Papotti - Transactions of the Association for …, 2023 - direct.mit.edu
In the last few years, the natural language processing community has witnessed advances
in neural representations of free texts with transformer-based language models (LMs). Given …