Big data systems: A software engineering perspective

A Davoudian, M Liu - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Big Data Systems (BDSs) are an emerging class of scalable software technologies whereby
massive amounts of heterogeneous data are gathered from multiple sources, managed …

Toward data lakes as central building blocks for data management and analysis

P Wieder, H Nolte - Frontiers in big Data, 2022 - frontiersin.org
Data lakes are a fundamental building block for many industrial data analysis solutions and
becoming increasingly popular in research. Often associated with big data use cases, data …

Constance: An intelligent data lake system

R Hai, S Geisler, C Quix - … of the 2016 international conference on …, 2016 - dl.acm.org
As the challenge of our time, Big Data still has many research hassles, especially the variety
of data. The high diversity of data sources often results in information silos, a collection of …

Data lakes: A survey of functions and systems

R Hai, C Koutras, C Quix… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Data lakes are becoming increasingly prevalent for Big Data management and data
analytics. In contrast to traditional 'schema-on-write'approaches such as data warehouses …

[HTML][HTML] SemML: Facilitating development of ML models for condition monitoring with semantics

B Zhou, Y Svetashova, A Gusmao, A Soylu… - Journal of Web …, 2021 - Elsevier
Monitoring of the state, performance, quality of operations and other parameters of
equipment and production processes, which is typically referred to as condition monitoring …

Data ecosystems: Sovereign data exchange among organizations (Dagstuhl Seminar 19391)

C Cappiello, A Gal, M Jarke, J Rehof - Dagstuhl Reports, 2020 - drops.dagstuhl.de
This report documents the program and the outcomes of Dagstuhl Seminar 19391``Data
Ecosystems: Sovereign Data Exchange among Organizations''. The goal of the seminar was …

Ontology-enhanced machine learning: a Bosch use case of welding quality monitoring

Y Svetashova, B Zhou, T Pychynski, S Schmidt… - The Semantic Web …, 2020 - Springer
In the automotive industry, welding is a critical process of automated manufacturing and its
quality monitoring is important. IoT technologies behind automated factories enable …

Operationalizing and automating data governance

S Nadal, P Jovanovic, B Bilalli, O Romero - Journal of big data, 2022 - Springer
The ability to cross data from multiple sources represents a competitive advantage for
organizations. Yet, the governance of the data lifecycle, from the data sources into valuable …

Ontario: Federated query processing against a semantic data lake

KM Endris, PD Rohde, ME Vidal, S Auer - … 29, 2019, Proceedings, Part I 30, 2019 - Springer
Data lakes enable flexible knowledge discovery and reduce the overhead of materialized
data integration. Albeit effective for data storage, query execution over data lakes may be …

An integration-oriented ontology to govern evolution in big data ecosystems

S Nadal, O Romero, A Abelló, P Vassiliadis… - Information systems, 2019 - Elsevier
Big Data architectures allow to flexibly store and process heterogeneous data, from multiple
sources, in their original format. The structure of those data, commonly supplied by means of …