Dataset discovery and exploration: A survey

NW Paton, J Chen, Z Wu - ACM Computing Surveys, 2023 - dl.acm.org
Data scientists are tasked with obtaining insights from data. However, suitable data is often
not immediately at hand, and there may be many potentially relevant datasets in a data lake …

Knowledge graphs on the web–an overview

N Heist, S Hertling, D Ringler… - Knowledge Graphs for …, 2020 - ebooks.iospress.nl
Abstract Knowledge Graphs are an emerging form of knowledge representation. While
Google coined the term Knowledge Graph first and promoted it as a means to improve their …

Definition modeling: Learning to define word embeddings in natural language

T Noraset, C Liang, L Birnbaum… - Proceedings of the AAAI …, 2017 - ojs.aaai.org
Distributed representations of words have been shown to capture lexical semantics, based
on their effectiveness in word similarity and analogical relation tasks. But, these tasks only …

The limits of word level differential privacy

J Mattern, B Weggenmann, F Kerschbaum - ar** with weak supervision from a masked language model
H Dai, Y Song, H Wang - ar** by using a richer and ultra-fine
set of types, and labeling noun phrases including pronouns and nominal nouns instead of …

Taxogen: Unsupervised topic taxonomy construction by adaptive term embedding and clustering

C Zhang, F Tao, X Chen, J Shen, M Jiang… - Proceedings of the 24th …, 2018 - dl.acm.org
Taxonomy construction is not only a fundamental task for semantic analysis of text corpora,
but also an important step for applications such as information filtering, recommendation …

Nettaxo: Automated topic taxonomy construction from text-rich network

J Shang, X Zhang, L Liu, S Li, J Han - Proceedings of the web …, 2020 - dl.acm.org
The automated construction of topic taxonomies can benefit numerous applications,
including web search, recommendation, and knowledge discovery. One of the major …

Inferring concept hierarchies from text corpora via hyperbolic embeddings

M Le, S Roller, L Papaxanthos, D Kiela… - arxiv preprint arxiv …, 2019 - arxiv.org
We consider the task of inferring is-a relationships from large text corpora. For this purpose,
we propose a new method combining hyperbolic embeddings and Hearst patterns. This …

Refined commonsense knowledge from large-scale web contents

TP Nguyen, S Razniewski, J Romero… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Commonsense knowledge (CSK) about concepts and their properties is helpful for AI
applications. Prior works, such as ConceptNet, have compiled large CSK collections …

[PDF][PDF] Taxi at semeval-2016 task 13: a taxonomy induction method based on lexico-syntactic patterns, substrings and focused crawling

A Panchenko, S Faralli, E Ruppert… - Proceedings of the …, 2016 - aclanthology.org
We present a system for taxonomy construction that reached the first place in all subtasks of
the SemEval 2016 challenge on Taxonomy Extraction Evaluation. Our simple yet effective …