Web table extraction, retrieval, and augmentation: A survey

S Zhang, K Balog - ACM Transactions on Intelligent Systems and …, 2020 - dl.acm.org
Tables are powerful and popular tools for organizing and manipulating data. A vast number
of tables can be found on the Web, which represent a valuable knowledge resource. The …

[HTML][HTML] Dataset search: a survey

A Chapman, E Simperl, L Koesten, G Konstantinidis… - The VLDB Journal, 2020 - Springer
Generating value from data requires the ability to find, access and make sense of datasets.
There are many efforts underway to encourage data sharing and reuse, from scientific …

Large language models are versatile decomposers: Decomposing evidence and questions for table-based reasoning

Y Ye, B Hui, M Yang, B Li, F Huang, Y Li - Proceedings of the 46th …, 2023 - dl.acm.org
Table-based reasoning has shown remarkable progress in a wide range of table-based
tasks. It is a challenging task, which requires reasoning over both free-form natural language …

Table2vec: Neural word and entity embeddings for table population and retrieval

L Zhang, S Zhang, K Balog - Proceedings of the 42nd international ACM …, 2019 - dl.acm.org
Tables contain valuable knowledge in a structured form. We employ neural language
modeling approaches to embed tabular data into vector spaces. Specifically, we consider …

Novel entity discovery from web tables

S Zhang, E Meij, K Balog, R Reinanda - Proceedings of The Web …, 2020 - dl.acm.org
When working with any sort of knowledge base (KB) one has to make sure it is as complete
and also as up-to-date as possible. Both tasks are non-trivial as they require recall-oriented …

ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models

B Newman, Y Lee, A Naik, P Siangliulue, R Fok… - arxiv preprint arxiv …, 2024 - arxiv.org
When conducting literature reviews, scientists often create literature review tables-tables
whose rows are publications and whose columns constitute a schema, a set of aspects used …

Auto-completion for data cells in relational tables

S Zhang, K Balog - Proceedings of the 28th ACM International …, 2019 - dl.acm.org
We address the task of auto-completing data cells in relational tables. Such tables describe
entities (in rows) with their attributes (in columns). We present the CellAutoComplete …

SynSetExpan: An iterative framework for joint entity set expansion and synonym discovery

J Shen, W Qiu, J Shang, M Vanni, X Ren… - arxiv preprint arxiv …, 2020 - arxiv.org
Entity set expansion and synonym discovery are two critical NLP tasks. Previous studies
accomplish them separately, without exploring their interdependencies. In this work, we …

Semantics-enabled query performance prediction for ad hoc table retrieval

M Khodabakhsh, E Bagheri - Information Processing & Management, 2021 - Elsevier
Predicting the performance of a retrieval method for a given query is a highly important and
challenging problem in information retrieval. Accurate Query Performance Prediction (QPP) …

Neural relation extraction on wikipedia tables for augmenting knowledge graphs

E Macdonald, D Barbosa - Proceedings of the 29th ACM International …, 2020 - dl.acm.org
Knowledge Graph Augmentation is the task of adding missing facts to an incomplete
knowledge graph to improve its effectiveness in applications such as web search and …