Table pre-training: A survey on model architectures, pre-training objectives, and downstream tasks
Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs,
and various other document types, a flurry of table pre-training frameworks have been …
and various other document types, a flurry of table pre-training frameworks have been …
Deep learning for table detection and structure recognition: A survey
Tables are everywhere, from scientific journals, articles, websites, and newspapers all the
way to items we buy at the supermarket. Detecting them is thus of utmost importance to …
way to items we buy at the supermarket. Detecting them is thus of utmost importance to …
Tuta: Tree-based transformers for generally structured table pre-training
We propose TUTA, a unified pre-training architecture for understanding generally structured
tables. Noticing that understanding a table requires spatial, hierarchical, and semantic …
tables. Noticing that understanding a table requires spatial, hierarchical, and semantic …
Large language models for tabular data: Progresses and future directions
Tables contain a significant portion of the world's structured information. The ability to
efficiently and accurately understand, process, reason about, analyze, and generate tabular …
efficiently and accurately understand, process, reason about, analyze, and generate tabular …
Entrant: A large financial dataset for table understanding
Tabular data is a way to structure, organize, and present information conveniently and
effectively. Real-world tables present data in two dimensions by arranging cells in matrices …
effectively. Real-world tables present data in two dimensions by arranging cells in matrices …
Table understanding: Problem overview
A Shigarov - Wiley Interdisciplinary Reviews: Data Mining and …, 2023 - Wiley Online Library
Tables are probably the most natural way to represent relational data in various media and
formats. They store a large number of valuable facts that could be utilized for question …
formats. They store a large number of valuable facts that could be utilized for question …
Fortap: Using formulas for numerical-reasoning-aware table pretraining
Tables store rich numerical data, but numerical reasoning over tables is still a challenge. In
this paper, we find that the spreadsheet formula, which performs calculations on numerical …
this paper, we find that the spreadsheet formula, which performs calculations on numerical …
GetPt: Graph-enhanced General Table Pre-training with Alternate Attention Network
Tables are widely used for data storage and presentation due to their high flexibility in
layout. The importance of tables as information carriers and the complexity of tabular data …
layout. The importance of tables as information carriers and the complexity of tabular data …
End-to-End Compound Table Understanding with Multi-Modal Modeling
Table is a widely used data form in webpages, spreadsheets, or PDFs to organize and
present structural data. Although studies on table structure recognition have been …
present structural data. Although studies on table structure recognition have been …
Document parsing unveiled: Techniques, challenges, and prospects for structured information extraction
Document parsing is essential for converting unstructured and semi-structured documents-
such as contracts, academic papers, and invoices-into structured, machine-readable data …
such as contracts, academic papers, and invoices-into structured, machine-readable data …