Dataset discovery and exploration: A survey

NW Paton, J Chen, Z Wu - ACM Computing Surveys, 2023 - dl.acm.org
Data scientists are tasked with obtaining insights from data. However, suitable data is often
not immediately at hand, and there may be many potentially relevant datasets in a data lake …

Knowledge graphs on the web–an overview

N Heist, S Hertling, D Ringler… - Knowledge Graphs for …, 2020 - ebooks.iospress.nl
Abstract Knowledge Graphs are an emerging form of knowledge representation. While
Google coined the term Knowledge Graph first and promoted it as a means to improve their …

Ultra-fine entity ty** with weak supervision from a masked language model

H Dai, Y Song, H Wang - ar** by using a richer and ultra-fine
set of types, and labeling noun phrases including pronouns and nominal nouns instead of …

Definition modeling: Learning to define word embeddings in natural language

T Noraset, C Liang, L Birnbaum… - Proceedings of the AAAI …, 2017 - ojs.aaai.org
Distributed representations of words have been shown to capture lexical semantics, based
on their effectiveness in word similarity and analogical relation tasks. But, these tasks only …

Nettaxo: Automated topic taxonomy construction from text-rich network

J Shang, X Zhang, L Liu, S Li, J Han - Proceedings of the web …, 2020 - dl.acm.org
The automated construction of topic taxonomies can benefit numerous applications,
including web search, recommendation, and knowledge discovery. One of the major …

Taxogen: Unsupervised topic taxonomy construction by adaptive term embedding and clustering

C Zhang, F Tao, X Chen, J Shen, M Jiang… - Proceedings of the 24th …, 2018 - dl.acm.org
Taxonomy construction is not only a fundamental task for semantic analysis of text corpora,
but also an important step for applications such as information filtering, recommendation …

Advanced semantics for commonsense knowledge extraction

TP Nguyen, S Razniewski, G Weikum - Proceedings of the Web …, 2021 - dl.acm.org
Commonsense knowledge (CSK) about concepts and their properties is useful for AI
applications such as robust chatbots. Prior works like ConceptNet, TupleKB and others …

Inferring concept hierarchies from text corpora via hyperbolic embeddings

M Le, S Roller, L Papaxanthos, D Kiela… - arxiv preprint arxiv …, 2019 - arxiv.org
We consider the task of inferring is-a relationships from large text corpora. For this purpose,
we propose a new method combining hyperbolic embeddings and Hearst patterns. This …

Refined commonsense knowledge from large-scale web contents

TP Nguyen, S Razniewski, J Romero… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Commonsense knowledge (CSK) about concepts and their properties is helpful for AI
applications. Prior works, such as ConceptNet, have compiled large CSK collections …

The limits of word level differential privacy

J Mattern, B Weggenmann, F Kerschbaum - arxiv preprint arxiv …, 2022 - arxiv.org
As the issues of privacy and trust are receiving increasing attention within the research
community, various attempts have been made to anonymize textual data. A significant …