ReCG: Bottom-up JSON Schema Discovery Using a Repetitive Cluster-and-Generalize Framework

J Yun, B Tak, WS Han - Proceedings of the VLDB Endowment, 2024 - dl.acm.org
The schemalessness, one of the major advantages of JSON representation format, comes
with high penalties in querying and operations by denying various critical functions such as …

Clustering raw sensor data in process logs to detect data streams

M Ehrendorfer, J Mangler, S Rinderle-Ma - International Conference on …, 2023 - Springer
The execution and analysis of processes is strongly influenced by sensor streams, eg,
temperature, that are measured in parallel to the process execution and stored in process …

Temporal JSON Keyword Search

C Dyreson, A Shatnawi, SS Bhowmick… - Proceedings of the ACM …, 2024 - dl.acm.org
JSON keyword search searches the current versions of documents in a collection. However,
JSON documents change over time due to edits. Some applications, such as data forensics …

FUDJ: Flexible User-Defined Distributed Joins

A Sevim, A Eldawy, EP Carman… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Join operations are crucial in data analysis, but can suffer inefficiency with large datasets
and complex non-equality-based conditions. Optimized join algorithms have gained traction …

Subtree Similarity Search Based on Structure and Text

T Mizokami, S Bou, T Amagasa - … Conference on Big Data Analytics and …, 2024 - Springer
Given a query tree, the subtree similarity search problem is finding all subtrees in a
document tree that are similar to the query tree. The previous scan-based method extracts …

[BOOK][B] Fundamentals of Information Systems Interoperability: Data, Services, and Processes

S Rinderle-Ma, J Mangler, D Ritter - 2024 - books.google.com
This book presents fundamental concepts and technologies to tackle interoperability
between information systems. It details interoperability at the data, service, and process …

Scalable Computation of Fuzzy Joins Over Large Collections of JSON Data

R Uhartegaray, L d'Orazio, M Damigos… - … Conference on Fuzzy …, 2023 - ieeexplore.ieee.org
Fuzzy joins are widely used in a variety of data analysis applications such as data
integration, data mining, and master data management. In the context of Big Data …

An adaptable JSON Diff Framework

A Sun - arxiv preprint arxiv:2305.05865, 2023 - arxiv.org
In this paper, we present an implementation of JSON-diff framework JYCM, extending the
existing framework by introducing the concept of" unordered" comparisons and allowing …

Transformation and Integration of Exchange Formats

S Rinderle-Ma, J Mangler, D Ritter - Fundamentals of Information Systems …, 2024 - Springer
This chapter presents concepts and technologies for transforming exchange formats, ie,
XPath, XSLT, XML Schema, and RelaxNG, into each other and hence furthering the …

Feedforward-Aided Course Designs for Similarity Search

T Hütter, D Kocher - Proceedings of the 2nd International Workshop on …, 2023 - dl.acm.org
In this paper, we present two feedforward-aided designs for a Master's level course on
similarity search based on different teaching methods: In project-based learning, the …