Deep learning approaches for similarity computation: A survey

P Yang, H Wang, J Yang, Z Qian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The requirement for appropriate ways to measure the similarity between data objects is a
common but vital task in various domains, such as data mining, machine learning and so on …

Dilof: Effective and memory efficient local outlier detection in data streams

GS Na, D Kim, H Yu - Proceedings of the 24th ACM SIGKDD …, 2018 - dl.acm.org
With precipitously growing demand to detect outliers in data streams, many studies have
been conducted aiming to develop extensions of well-known outlier detection algorithm …

Fast discrete distribution clustering using Wasserstein barycenter with sparse support

J Ye, P Wu, JZ Wang, J Li - IEEE Transactions on Signal …, 2017 - ieeexplore.ieee.org
In a variety of research areas, the weighted bag of vectors and the histogram are widely
used descriptors for complex objects. Both can be expressed as discrete distributions. D2 …

Combining quantitative and logical data cleaning

N Prokoshyna, J Szlichta, F Chiang, RJ Miller… - Proceedings of the …, 2015 - dl.acm.org
Quantitative data cleaning relies on the use of statistical methods to identify and repair data
quality problems while logical data cleaning tackles the same problems using various forms …

Quantifying differences between UGC and DMO's image content on Instagram using deep learning

Á Díaz-Pacheco, R Guerrero-Rodríguez… - … Technology & Tourism, 2024 - Springer
In the tourism industry, the implementation of effective strategies to promote destinations is
considered of utmost importance. Taking advantage of social media, Destination …

Where is the Soho of Rome? Measures and algorithms for finding similar neighborhoods in cities

G Le Falher, A Gionis, M Mathioudakis - Proceedings of the …, 2015 - ojs.aaai.org
Data generated on location-aware social media provide rich information about the places
(shop** malls, restaurants, cafés, etc) where citizens spend their time. That information …

Trajectory-based spatiotemporal entity linking

F **, W Hua, T Zhou, J Xu, M Francia… - … on Knowledge and …, 2020 - ieeexplore.ieee.org
Trajectory-based spatiotemporal entity linking is to match the same moving object in different
datasets based on their movement traces. It is a fundamental step to support spatiotemporal …

Fast dataset search with earth mover's distance

W Yang, S Wang, Y Sun, Z Peng - Proceedings of the VLDB Endowment, 2022 - dl.acm.org
The amount of spatial data in open data portals has increased rapidly, raising the demand
for spatial dataset search in large data repositories. In this paper, we tackle spatial dataset …

Moving object linking based on historical trace

F **, W Hua, J Xu, X Zhou - 2019 IEEE 35th International …, 2019 - ieeexplore.ieee.org
The prevalent adoption of GPS-enabled devices has witnessed an explosion of various
location-based services which produce a huge amount of trajectories monitoring an …

Optimizing bipartite matching in real-world applications by incremental cost computation

T Abeywickrama, V Liang, KL Tan - Proceedings of the VLDB …, 2021 - dl.acm.org
The Kuhn-Munkres (KM) algorithm is a classical combinatorial optimization algorithm that is
widely used for minimum cost bipartite matching in many real-world applications, such as …