Knowledge graphs
In this article, we provide a comprehensive introduction to knowledge graphs, which have
recently garnered significant attention from both industry and academia in scenarios that …
recently garnered significant attention from both industry and academia in scenarios that …
A comprehensive survey on automatic knowledge graph construction
Automatic knowledge graph construction aims at manufacturing structured human
knowledge. To this end, much effort has historically been spent extracting informative fact …
knowledge. To this end, much effort has historically been spent extracting informative fact …
Data lake management: challenges and opportunities
The ubiquity of data lakes has created fascinating new challenges for data management
research. In this tutorial, we review the state-of-the-art in data management for data lakes …
research. In this tutorial, we review the state-of-the-art in data management for data lakes …
Tuta: Tree-based transformers for generally structured table pre-training
We propose TUTA, a unified pre-training architecture for understanding generally structured
tables. Noticing that understanding a table requires spatial, hierarchical, and semantic …
tables. Noticing that understanding a table requires spatial, hierarchical, and semantic …
From tabular data to knowledge graphs: A survey of semantic table interpretation tasks and methods
Tabular data often refers to data that is organized in a table with rows and columns. We
observe that this data format is widely used on the Web and within enterprise data …
observe that this data format is widely used on the Web and within enterprise data …
Web table extraction, retrieval, and augmentation: A survey
Tables are powerful and popular tools for organizing and manipulating data. A vast number
of tables can be found on the Web, which represent a valuable knowledge resource. The …
of tables can be found on the Web, which represent a valuable knowledge resource. The …
Dataset discovery and exploration: A survey
Data scientists are tasked with obtaining insights from data. However, suitable data is often
not immediately at hand, and there may be many potentially relevant datasets in a data lake …
not immediately at hand, and there may be many potentially relevant datasets in a data lake …
Gittables: A large-scale corpus of relational tables
The success of deep learning has sparked interest in improving relational table tasks, like
data preparation and search, with table representation models trained on large table …
data preparation and search, with table representation models trained on large table …
Ten years of webtables
In 2008, we wrote about WebTables, an effort to exploit the large and diverse set of
structured databases casually published online in the form of HTML tables. The past decade …
structured databases casually published online in the form of HTML tables. The past decade …
Entrant: A large financial dataset for table understanding
Tabular data is a way to structure, organize, and present information conveniently and
effectively. Real-world tables present data in two dimensions by arranging cells in matrices …
effectively. Real-world tables present data in two dimensions by arranging cells in matrices …