A survey on the evolution of stream processing systems

M Fragkoulis, P Carbone, V Kalavri, A Katsifodimos - The VLDB Journal, 2024 - Springer
Stream processing has been an active research field for more than 20 years, but it is now
witnessing its prime time due to recent successful efforts by the research community and …

Consistency and completeness: Rethinking distributed stream processing in apache kafka

G Wang, L Chen, A Dikshit, J Gustafson… - Proceedings of the …, 2021 - dl.acm.org
An increasingly important system requirement for distributed stream processing applications
is to provide strong correctness guarantees under unexpected failures and out-of-order data …

Survey of window types for aggregation in stream processing systems

J Verwiebe, PM Grulich, J Traub, V Markl - The VLDB Journal, 2023 - Springer
In this paper, we present the first comprehensive survey of window types for stream
processing systems which have been presented in research and commercial systems. We …

Analyzing efficient stream processing on modern hardware

S Zeuch, BD Monte, J Karimov, C Lutz, M Renz… - Proceedings of the …, 2019 - dl.acm.org
Modern Stream Processing Engines (SPEs) process large data volumes under tight latency
constraints. Many SPEs execute processing pipelines using message passing on shared …

The NebulaStream platform: data and application management for the internet of things

S Zeuch, A Chaudhary, B Del Monte… - arxiv preprint arxiv …, 2019 - arxiv.org
The Internet of Things (IoT) presents a novel computing architecture for data management: a
distributed, highly dynamic, and heterogeneous environment of massive scale. Applications …

Grizzly: Efficient stream processing through adaptive query compilation

PM Grulich, B Sebastian, S Zeuch, J Traub… - Proceedings of the …, 2020 - dl.acm.org
Stream Processing Engines (SPEs) execute long-running queries on unbounded data
streams. They follow an interpretation-based processing model and do not perform runtime …

{TVA}: A multi-party computation system for secure and expressive time series analytics

M Faisal, J Zhang, J Liagouris, V Kalavri… - 32nd USENIX Security …, 2023 - usenix.org
We present TVA, a multi-party computation (MPC) system for secure analytics on secret-
shared time series data. TVA achieves strong security guarantees in the semi-honest and …

Efficient placement of decomposable aggregation functions for stream processing over large geo-distributed topologies

X Chatziliadis, ET Zacharatou, A Eracar… - Proceedings of the …, 2024 - dl.acm.org
A recent trend in stream processing is offloading the computation of decomposable
aggregation functions (DAF) from cloud nodes to geo-distributed fog/edge devices to …

Materialization and reuse optimizations for production data science pipelines

B Derakhshan, A Rezaei Mahdiraji, Z Kaoudi… - Proceedings of the …, 2022 - dl.acm.org
Many companies and businesses train and deploy machine learning (ML) pipelines to
answer prediction queries. In many applications, new training data continuously becomes …

Real-time big data analytics for data stream challenges: an overview

AA Hassan, TM Hassan - European Journal of Information …, 2022 - ej-compute.org
The conventional approach of evaluating massive data is inappropriate for real-time
analysis; therefore, analyzing big data in a data stream remains a critical issue for numerous …