Semantic similarity and machine learning with ontologies

M Kulmanov, FZ Smaili, X Gao… - Briefings in …, 2021 - academic.oup.com
Ontologies have long been employed in the life sciences to formally represent and reason
over domain knowledge and they are employed in almost every major biological database …

Modeling community standards for metadata as templates makes data FAIR

MA Musen, MJ O'Connor, E Schultes… - Scientific Data, 2022 - nature.com
It is challenging to determine whether datasets are findable, accessible, interoperable, and
reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria …

[PDF][PDF] Towards Semantic Integration for Explainable Artificial Intelligence in the Biomedical Domain.

C Pesquita - HEALTHINF, 2021 - pdfs.semanticscholar.org
Explainable artificial intelligence typically focuses on data-based explanations, lacking the
semantic context needed to produce human-centric explanations. This is especially relevant …

An empirical meta-analysis of the life sciences linked open data on the web

MR Kamdar, MA Musen - Scientific data, 2021 - nature.com
While the biomedical community has published several “open data” sources in the last
decade, most researchers still endure severe logistical and technical challenges to discover …

Text snippets to corroborate medical relations: an unsupervised approach using a knowledge graph and embeddings

MR Kamdar, CE Stanley, M Carroll… - AMIA Summits on …, 2020 - pmc.ncbi.nlm.nih.gov
Abstract Knowledge graphs have been shown to significantly improve search results.
Usually populated by subject matter experts, relations therein need to keep up to date with …

Use of a Structured Knowledge Base Enhances Metadata Curation by Large Language Models

SS Sundaram, B Solomon, A Khatri, A Laumas… - arxiv preprint arxiv …, 2024 - arxiv.org
Metadata play a crucial role in ensuring the findability, accessibility, interoperability, and
reusability of datasets. This paper investigates the potential of large language models …

Interoperability of Open Science Metadata: What About the Reality?

VN Dang, N Aussenac-Gilles, I Megdiche… - … Conference on Research …, 2023 - Springer
Open Science aims at sharing results and data widely between different research domains.
Interoperability is one of the keys to enable the exchange and crossing of data between …

Making metadata more FAIR using large language models

SS Sundaram, MA Musen - arxiv preprint arxiv:2307.13085, 2023 - arxiv.org
With the global increase in experimental data artifacts, harnessing them in a unified fashion
leads to a major stumbling block-bad metadata. To bridge this gap, this work presents a …

Increasing metadata coverage of SRA BioSample entries using deep learning–based named entity recognition

A Klie, BY Tsui, S Mollah, D Skola, M Dow, CN Hsu… - Database, 2021 - academic.oup.com
High-quality metadata annotations for data hosted in large public repositories are essential
for research reproducibility and for conducting fast, powerful and scalable meta-analyses …

[PDF][PDF] The Life Cycle of Data Labels in Organizational Learning: a Case Study of the Automotive Industry.

J Eirich, D Fischer-Preßler - ECIS, 2022 - researchgate.net
Data labels are an integral input to develop machine learning (ML) models. In complex
domains, labels represent the externalized product of complex knowledge. While prior …