Table pre-training: A survey on model architectures, pre-training objectives, and downstream tasks

H Dong, Z Cheng, X He, M Zhou, A Zhou… - arxiv preprint arxiv …, 2022 - arxiv.org
Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs,
and various other document types, a flurry of table pre-training frameworks have been …

A survey of table reasoning with large language models

X Zhang, D Wang, L Dou, Q Zhu, W Che - Frontiers of Computer Science, 2025 - Springer
Table reasoning aims to generate inference results based on the user requirement and the
provided table. Enhancing the table reasoning capability of the model can aid in obtaining …

Neurologic a* esque decoding: Constrained text generation with lookahead heuristics

X Lu, S Welleck, P West, L Jiang, J Kasai… - arxiv preprint arxiv …, 2021 - arxiv.org
The dominant paradigm for neural text generation is left-to-right decoding from
autoregressive language models. Constrained or controllable generation under complex …

Folio: Natural language reasoning with first-order logic

S Han, H Schoelkopf, Y Zhao, Z Qi, M Riddell… - arxiv preprint arxiv …, 2022 - arxiv.org
Large language models (LLMs) have achieved remarkable performance on a variety of
natural language understanding tasks. However, existing benchmarks are inadequate in …

ToTTo: A controlled table-to-text generation dataset

AP Parikh, X Wang, S Gehrmann, M Faruqui… - arxiv preprint arxiv …, 2020 - arxiv.org
We present ToTTo, an open-domain English table-to-text dataset with over 120,000 training
examples that proposes a controlled generation task: given a Wikipedia table and a set of …

Large language models are few (1)-shot table reasoners

W Chen - arxiv preprint arxiv:2210.06710, 2022 - arxiv.org
Recent literature has shown that large language models (LLMs) are generally excellent few-
shot reasoners to solve text reasoning tasks. However, the capability of LLMs on table …

Chart-to-text: A large-scale benchmark for chart summarization

S Kantharaj, RTK Leong, X Lin, A Masry… - arxiv preprint arxiv …, 2022 - arxiv.org
Charts are commonly used for exploring data and communicating insights. Generating
natural language summaries from charts can be very helpful for people in inferring key …

Dart: Open-domain structured data record to text generation

L Nan, D Radev, R Zhang, A Rau, A Sivaprasad… - arxiv preprint arxiv …, 2020 - arxiv.org
We present DART, an open domain structured DAta Record to Text generation dataset with
over 82k instances (DARTs). Data-to-Text annotations can be a costly process, especially …

KGPT: Knowledge-grounded pre-training for data-to-text generation

W Chen, Y Su, X Yan, WY Wang - arxiv preprint arxiv:2010.02307, 2020 - arxiv.org
Data-to-text generation has recently attracted substantial interests due to its wide
applications. Existing methods have shown impressive performance on an array of tasks …

Multi-hop question answering

V Mavi, A Jangra, A Jatowt - Foundations and Trends® in …, 2024 - nowpublishers.com
Abstract The task of Question Answering (QA) has attracted significant research interest for a
long time. Its relevance to language understanding and knowledge retrieval tasks, along …