Question answering versus named entity recognition for extracting unknown datasets

Y Younes, A Scherp - IEEE Access, 2023 - ieeexplore.ieee.org
Dataset mention extraction is a difficult problem due to the unstructured nature of text, the
sparsity of dataset mentions, and the various ways the same dataset can be mentioned …

A maturity model for catalogues of semantic artefacts

O Corcho, FJ Ekaputra, I Heibi, C Jonquet, A Micsik… - Scientific Data, 2024 - nature.com
This work presents a maturity model for assessing catalogues of semantic artefacts, one of
the keystones that permit semantic interoperability of systems. We defined the dimensions …

DataExpo: A One-Stop Dataset Service for Open Science Research

B Lu, L Wu, L Yang, C Sun, W Liu, X Gan… - … Proceedings of the …, 2023 - dl.acm.org
The large volumes of data on the Internet provides new opportunities for scientific discovery,
especially promoting data-driven open science research. However, due to lack of accurate …

Discovering datasets on the web scale: Challenges and recommendations for Google Dataset Search

K Sostek, DM Russell, N Goyal, T Alrashed, S Dugall… - 2024 - hdsr.mitpress.mit.edu
With the rise of open data in the last two decades, more datasets are online and more
people are using them for projects and research. But how do people find datasets? We …

[BOOK][B] Human-centered data discovery

K Gregory, L Koesten - 2022 - Springer
Data are everywhere. Across sectors, millions of datasets are available in data repositories,
online marketplaces and from individual publishers (Brickley et al. 2019; Verhulst and …

DSDD: Domain-Specific Dataset Discovery on the Web

H Zhang, A Santos, J Freire - Proceedings of the 30th ACM International …, 2021 - dl.acm.org
With the push for transparency and open data, many datasets and data repositories are
becoming available on the Web. This opens new opportunities for data-driven exploration …

Relationships Are Complicated! An Analysis of Relationships Between Datasets on the Web

K Lin, T Alrashed, N Noy - International Semantic Web Conference, 2024 - Springer
The Web today has millions of datasets, and the number of datasets continues to grow at a
rapid pace. These datasets are not standalone entities; rather, they are intricately connected …

Model Lakes

K Pal, D Bau, RJ Miller - arxiv preprint arxiv:2403.02327, 2024 - arxiv.org
Given a set of deep learning models, it can be hard to find models appropriate to a task,
understand the models, and characterize how models are different one from another …

[PDF][PDF] Where are the Datasets? A case study on the German Academic Web Archive.

Y Younes, S Tiesler, R Jäschke, B Mathiak - WADL, 2022 - amor.cms.hu-berlin.de
ABSTRACT The German Academic Web (GAW) is a longitudinal archive of websites from
German academic institutions, mainly universities. It can support answering research …

[PDF][PDF] a maturity model for catalogues of semantic artefacts

S Peroni, E Storti - iris.univpm.it
Oscar Corcho1, Fajar J. Ekaputra2, 3, Ivan Heibi4, 5, Clement Jonquet6, 7, andras Micsik 8,
Silvio Peroni 4, 5✉ & Emanuele Storti9, 10 this work presents a maturity model for assessing …