IndeGx: A model and a framework for indexing RDF knowledge graphs with SPARQL-based test suits

P Maillot, O Corby, C Faron, F Gandon… - Journal of Web Semantics, 2023 - Elsevier
In recent years, a large number of RDF datasets have been built and published on the Web
in fields as diverse as linguistics or life sciences, as well as general datasets such as …

Dense re-ranking with weak supervision for RDF dataset search

Q Chen, Z Huang, Z Zhang, W Luo, T Lin, Q Shi… - International Semantic …, 2023 - Springer
Dataset search aims to find datasets that are relevant to a keyword query. Existing dataset
search engines rely on conventional sparse retrieval models (eg, BM25). Dense models (eg …

ACORDAR: a test collection for ad hoc content-based (RDF) dataset retrieval

T Lin, Q Chen, G Cheng, A Soylu, B Ell… - Proceedings of the 45th …, 2022 - dl.acm.org
Ad hoc dataset retrieval is a trending topic in IR research. Methods and systems are evolving
from metadata-based to content-based ones which exploit the data itself for improving …

ACORDAR 2.0: A Test Collection for Ad Hoc Dataset Retrieval with Densely Pooled Datasets and Question-Style Queries

Q Chen, W Luo, Z Huang, T Lin, X Wang… - Proceedings of the 47th …, 2024 - dl.acm.org
Dataset search, or more specifically, ad hoc dataset retrieval which is a trending specialized
IR task, has received increasing attention in both academia and industry. While methods …

Towards more usable dataset search: From query characterization to snippet generation

J Chen, X Wang, G Cheng, E Kharlamov… - Proceedings of the 28th …, 2019 - dl.acm.org
Reusing published datasets on the Web is of great interest to researchers and developers.
Their data needs may be met by submitting queries to a dataset search engine to retrieve …

BANDAR: benchmarking snippet generation algorithms for (RDF) dataset search

X Wang, G Cheng, JZ Pan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
The large volume of open data on the Web is expected to be reused and create value.
Finding the right data to reuse is a non-trivial task addressed by the recent dataset search …

Recommending datasets for scientific problem descriptions

M Färber, AK Leisinger - Proceedings of the 30th ACM International …, 2021 - dl.acm.org
The steadily rising number of datasets is making it increasingly difficult for researchers and
practitioners to be aware of all datasets, particularly of the most relevant datasets for a given …

Enabling automatic discovery and querying of web APIs at web scale using linked data standards

F Michel, C Faron-Zucker, O Corby… - … proceedings of the 2019 …, 2019 - dl.acm.org
To help in making sense of the ever-increasing number of data sources available on the
Web, in this article we tackle the problem of enabling automatic discovery and querying of …

Enhancing Dataset Search with Compact Data Snippets

Q Chen, J Chen, X Zhou, G Cheng - … of the 47th International ACM SIGIR …, 2024 - dl.acm.org
In light of the growing availability and significance of open data, the problem of dataset
search has attracted great attention in the field of information retrieval. Nevertheless, current …

Content-based union and complement metrics for dataset search over RDF knowledge graphs

M Mountantonakis, Y Tzitzikas - Journal of Data and Information Quality …, 2020 - dl.acm.org
RDF Knowledge Graphs (or Datasets) contain valuable information that can be exploited for
a variety of real-world tasks. However, due to the enormous size of the available RDF …