YAGO: A multilingual knowledge base from wikipedia, wordnet, and geonames

T Rebele, F Suchanek, J Hoffart, J Biega… - The Semantic Web …, 2016 - Springer
YAGO is a large knowledge base that is built automatically from Wikipedia, WordNet and
GeoNames. The project combines information from Wikipedias in 10 different languages into …

Knowledge bases and language models: Complementing forces

F Suchanek, AT Luu - International Joint Conference on Rules and …, 2023 - Springer
Large language models (LLMs), as a particular instance of generative artificial intelligence,
have revolutionized natural language processing. In this invited paper, we argue that LLMs …

Synthesizing type-detection logic for rich semantic data types using open-source code

C Yan, Y He - Proceedings of the 2018 International Conference on …, 2018 - dl.acm.org
Given a table of data, existing systems can often detect basic atomic types (eg, strings vs.
numbers) for each column. A new generation of data-analytics and data-preparation …

Knowledge harvesting: achievements and challenges

G Weikum, J Hoffart, F Suchanek - … and Software Science: State of the Art …, 2019 - Springer
Knowledge Harvesting: Achievements and Challenges | SpringerLink Skip to main content
Advertisement Springer Nature Link Account Menu Find a journal Publish with us Track your …

[PDF][PDF] Ten Years of Knowledge Harvesting: Lessons and Challenges.

G Weikum, J Hoffart, FM Suchanek - IEEE Data Eng. Bull., 2016 - Citeseer
This article is a retrospective on the theme of knowledge harvesting: automatically
constructing large highquality knowledge bases from Internet sources. We draw on our …

Auto-Tag: Tagging-Data-By-Example in Data Lakes

Y He, J Song, Y Wang, S Chaudhuri, V Anil… - arxiv preprint arxiv …, 2021 - arxiv.org
As data lakes become increasingly popular in large enterprises today, there is a growing
need to tag or classify data assets (eg, files and databases) in data lakes with additional …

Big data linkage for product specification pages

D Qiu, L Barbosa, V Crescenzi, P Merialdo… - Proceedings of the …, 2018 - dl.acm.org
An increasing number of product pages are available from thousands of web sources, each
page associated with a product, containing its attributes and one or more product identifiers …

Big Data Integration for Product Specifications.

L Barbosa, V Crescenzi, XL Dong, P Merialdo… - IEEE Data Eng …, 2018 - sites.computer.org
The product domain contains valuable data for many important applications. Given the large
and increasing number of sources that provide data about product specifications and the …

Product classification-a hierarchical approach

M Karlsson, A Karlstedt - LU-CS-EX 2016-31, 2016 - lup.lub.lu.se
The social and environmental impact associated with consuming a product is something that
is becoming increasingly important to consumers and businesses alike. This impact can in …

A hitchhiker's guide to ontology

F Suchanek - DESIRES 2021, 2021 - imt.hal.science
A knowledge base (KB) is a computer-processable collection of knowledge about the world.
In its simplest variant, a KB takes the form of a labeled graph, where the nodes are entities …