ReCG: Bottom-up JSON Schema Discovery Using a Repetitive Cluster-and-Generalize Framework
The schemalessness, one of the major advantages of JSON representation format, comes
with high penalties in querying and operations by denying various critical functions such as …
with high penalties in querying and operations by denying various critical functions such as …
Clustering raw sensor data in process logs to detect data streams
The execution and analysis of processes is strongly influenced by sensor streams, eg,
temperature, that are measured in parallel to the process execution and stored in process …
temperature, that are measured in parallel to the process execution and stored in process …
Temporal JSON Keyword Search
JSON keyword search searches the current versions of documents in a collection. However,
JSON documents change over time due to edits. Some applications, such as data forensics …
JSON documents change over time due to edits. Some applications, such as data forensics …
FUDJ: Flexible User-Defined Distributed Joins
Join operations are crucial in data analysis, but can suffer inefficiency with large datasets
and complex non-equality-based conditions. Optimized join algorithms have gained traction …
and complex non-equality-based conditions. Optimized join algorithms have gained traction …
Subtree Similarity Search Based on Structure and Text
Given a query tree, the subtree similarity search problem is finding all subtrees in a
document tree that are similar to the query tree. The previous scan-based method extracts …
document tree that are similar to the query tree. The previous scan-based method extracts …
[BOOK][B] Fundamentals of Information Systems Interoperability: Data, Services, and Processes
This book presents fundamental concepts and technologies to tackle interoperability
between information systems. It details interoperability at the data, service, and process …
between information systems. It details interoperability at the data, service, and process …
Scalable Computation of Fuzzy Joins Over Large Collections of JSON Data
Fuzzy joins are widely used in a variety of data analysis applications such as data
integration, data mining, and master data management. In the context of Big Data …
integration, data mining, and master data management. In the context of Big Data …
An adaptable JSON Diff Framework
A Sun - arxiv preprint arxiv:2305.05865, 2023 - arxiv.org
In this paper, we present an implementation of JSON-diff framework JYCM, extending the
existing framework by introducing the concept of" unordered" comparisons and allowing …
existing framework by introducing the concept of" unordered" comparisons and allowing …
Transformation and Integration of Exchange Formats
This chapter presents concepts and technologies for transforming exchange formats, ie,
XPath, XSLT, XML Schema, and RelaxNG, into each other and hence furthering the …
XPath, XSLT, XML Schema, and RelaxNG, into each other and hence furthering the …
Feedforward-Aided Course Designs for Similarity Search
In this paper, we present two feedforward-aided designs for a Master's level course on
similarity search based on different teaching methods: In project-based learning, the …
similarity search based on different teaching methods: In project-based learning, the …