Dataset discovery and exploration: A survey

NW Paton, J Chen, Z Wu - ACM Computing Surveys, 2023 - dl.acm.org
Data scientists are tasked with obtaining insights from data. However, suitable data is often
not immediately at hand, and there may be many potentially relevant datasets in a data lake …

Business data sharing through data marketplaces: A systematic literature review

AE Abbas, W Agahari, M Van de Ven… - Journal of Theoretical …, 2021 - mdpi.com
Data marketplaces are expected to play a crucial role in tomorrow's data economy, but such
marketplaces are seldom commercially viable. Currently, there is no clear understanding of …

Annotating columns with pre-trained language models

Y Suhara, J Li, Y Li, D Zhang, Ç Demiralp… - Proceedings of the …, 2022 - dl.acm.org
Inferring meta information about tables, such as column headers or relationships between
columns, is an active research topic in data management as we find many tables are …

A multitrophic perspective on biodiversity–ecosystem functioning research

N Eisenhauer, H Schielzeth, AD Barnes… - Advances in ecological …, 2019 - Elsevier
Concern about the functional consequences of unprecedented loss in biodiversity has
prompted biodiversity–ecosystem functioning (BEF) research to become one of the most …

A survey on question answering systems over linked data and documents

E Dimitrakis, K Sgontzos, Y Tzitzikas - Journal of intelligent information …, 2020 - Springer
Question Answering (QA) systems aim at supplying precise answers to questions, posed by
users in a natural language form. They are used in a wide range of application areas, from …

Column type annotation using chatgpt

K Korini, C Bizer - arxiv preprint arxiv:2306.00745, 2023 - arxiv.org
Column type annotation is the task of annotating the columns of a relational table with the
semantic type of the values contained in each column. Column type annotation is an …

CHORUS: foundation models for unified data discovery and exploration

M Kayali, A Lykov, I Fountalis, N Vasiloglou… - arxiv preprint arxiv …, 2023 - arxiv.org
We apply foundation models to data discovery and exploration tasks. Foundation models
include large language models (LLMs) that show promising performance on a range of …

Table discovery in data lakes: State-of-the-art and future directions

G Fan, J Wang, Y Li, RJ Miller - … of the 2023 International Conference on …, 2023 - dl.acm.org
Data discovery refers to a set of tasks that enable users and downstream applications to
explore and gain insights from massive collections of data sources such as data lakes. In …

Auctus: A dataset search engine for data augmentation

S Castelo, R Rampin, A Santos, A Bessa… - arxiv preprint arxiv …, 2021 - arxiv.org
The large volumes of structured data currently available, from Web tables to open-data
portals and enterprise data, open up new opportunities for progress in answering many …

Data management in digital twins: a systematic literature review

JB Correia, M Abel, K Becker - Knowledge and Information Systems, 2023 - Springer
Abstract The Internet of Things (IoT) and continuous advances in data-gathering devices
and techniques have significantly increased the amount of relevant data that can be …