Dual-objective fine-tuning of BERT for entity matching

R Peeters, C Bizer - Proceedings of the VLDB …, 2021 - madoc.bib.uni-mannheim.de
An increasing number of data providers have adopted shared numbering schemes such as
GTIN, ISBN, DUNS, or ORCID numbers for identifying entities in the respective domain. This …

Entity matching using large language models

R Peeters, A Steiner, C Bizer - arxiv preprint arxiv:2310.11244, 2023 - arxiv.org
Entity matching is the task of deciding whether two entity descriptions refer to the same real-
world entity. Entity matching is a central step in most data integration pipelines. Many state …

AIS-based intelligent vessel trajectory prediction using bi-LSTM

CH Yang, CH Wu, JC Shao, YC Wang… - IEEE Access, 2022 - ieeexplore.ieee.org
Accurate vessel trajectory prediction is essential for maritime traffic control and
management. In addition to collision avoidance, accurate vessel trajectory prediction can …

The WDC training dataset and gold standard for large-scale product matching

A Primpeli, R Peeters, C Bizer - … Proceedings of The 2019 World Wide …, 2019 - dl.acm.org
A current research question in the area of entity resolution (also called link discovery or
duplicate detection) is whether and in which cases embeddings and deep neural network …

WDC products: A multi-dimensional entity matching benchmark

R Peeters, RC Der, C Bizer - arxiv preprint arxiv:2301.09521, 2023 - arxiv.org
The difficulty of an entity matching task depends on a combination of multiple factors such as
the amount of corner-case pairs, the fraction of entities in the test set that have not been …

[PDF][PDF] Intermediate training of BERT for product matching

R Peeters, C Bizer, G Glavaš - small, 2020 - ceur-ws.org
Product matching is the task of deciding if offers originating from different web-shops refer to
the same real-world product. This is a central task for e-commerce applications such as …

BERT-based similarity learning for product matching

J Tracz, PI Wójcik, K Jasinska-Kobus… - … of Workshop on …, 2020 - aclanthology.org
Product matching, ie, being able to infer the product being sold for a merchant-created offer,
is crucial for any e-commerce marketplace, enabling product-based navigation, price …

Siamese networks for large-scale author identification

C Saedi, M Dras - Computer Speech & Language, 2021 - Elsevier
Authorship attribution is the process of identifying the author of a text. Approaches to tackling
it have been conventionally divided into classification-based ones, which work well for small …

Automated identification of libraries from vulnerability data

Y Chen, AE Santosa, A Sharma, D Lo - Proceedings of the ACM/IEEE …, 2020 - dl.acm.org
Software Composition Analysis (SCA) has gained traction in recent years with a number of
commercial offerings from various companies. SCA involves vulnerability curation process …

Using schema. org annotations for training and maintaining product matchers

R Peeters, A Primpeli, B Wichtlhuber… - Proceedings of the 10th …, 2020 - dl.acm.org
Product matching is a central task within e-commerce applications such as price comparison
portals and online market places. State-of-the-art product matching methods achieve F1 …