Large-scale semantic integration of linked data: A survey

M Mountantonakis, Y Tzitzikas - ACM Computing Surveys (CSUR), 2019‏ - dl.acm.org
A large number of published datasets (or sources) that follow Linked Data principles is
currently available and this number grows rapidly. However, the major target of Linked Data …

The WDC training dataset and gold standard for large-scale product matching

A Primpeli, R Peeters, C Bizer - … Proceedings of The 2019 World Wide …, 2019‏ - dl.acm.org
A current research question in the area of entity resolution (also called link discovery or
duplicate detection) is whether and in which cases embeddings and deep neural network …

The webdatacommons microdata, rdfa and microformat dataset series

R Meusel, P Petrovski, C Bizer - The Semantic Web–ISWC 2014: 13th …, 2014‏ - Springer
In order to support web applications to understand the content of HTML pages an increasing
number of websites have started to annotate structured data within their pages using markup …

A machine learning approach for product matching and categorization

P Ristoski, P Petrovski, P Mika, H Paulheim - Semantic web, 2018‏ - content.iospress.com
Consumers today have the option to purchase products from thousands of e-shops.
However, the completeness of the product specifications and the taxonomies used for …

Extracting attribute-value pairs from product specifications on the web

P Petrovski, C Bizer - Proceedings of the International Conference on …, 2017‏ - dl.acm.org
Comparison shop** portals integrate product offers from large numbers of e-shops in
order to support consumers in their buying decisions. Product offers often consist of a title …

Heuristics for Fixing Common Errors in Deployed schema.org Microdata

R Meusel, H Paulheim - European Semantic Web Conference, 2015‏ - Springer
Being promoted by major search engines such as Google, Yahoo!, Bing, and Yandex,
Microdata embedded in web pages, especially using schema. org, has become one of the …

The WDC gold standards for product feature extraction and product matching

P Petrovski, A Primpeli, R Meusel, C Bizer - International Conference on …, 2016‏ - Springer
Finding out which e-shops offer a specific product is a central challenge for building
integrated product catalogs and comparison shop** portals. Determining whether two …

Enriching product ads with metadata from html annotations

P Ristoski, P Mika - The Semantic Web. Latest Advances and New …, 2016‏ - Springer
Product ads are a popular form of search advertizing offered by major search engines,
including Yahoo, Google and Bing. Unlike traditional search ads, product ads include …

Using the semantic web as a source of training data

C Bizer, A Primpeli, R Peeters - Datenbank-Spektrum, 2019‏ - Springer
Deep neural networks are increasingly used for tasks such as entity resolution, sentiment
analysis, and information extraction. As the methods are rather training data hungry, it is …

An incremental hierarchical clustering based system for record linkage in E-Commerce domain

F Gözükara, SA Özel - The Computer Journal, 2023‏ - academic.oup.com
In this study, a novel record linkage system for E-commerce products is presented. Our
system aims to cluster the same products that are crawled from different E-commerce …