Recent advancements in event processing
Event processing (EP) is a data processing technology that conducts online processing of
event information. In this survey, we summarize the latest cutting-edge work done on EP …
event information. In this survey, we summarize the latest cutting-edge work done on EP …
[PDF][PDF] Apache flink: Stream and batch processing in a single engine
Apache Flink 1 is an open-source system for processing streaming and batch data. Flink is
built on the philosophy that many classes of data processing applications, including real …
built on the philosophy that many classes of data processing applications, including real …
Videoedge: Processing camera streams using hierarchical clusters
Organizations deploy a hierarchy of clusters-cameras, private clusters, public clouds-for
analyzing live video feeds from their cameras. Video analytics queries have many …
analyzing live video feeds from their cameras. Video analytics queries have many …
Live video analytics at scale with approximation and {Delay-Tolerance}
Video cameras are pervasively deployed for security and smart city scenarios, with millions
of them in large cities worldwide. Achieving the potential of these cameras requires …
of them in large cities worldwide. Achieving the potential of these cameras requires …
The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing
T Akidau, R Bradshaw, C Chambers… - Proceedings of the …, 2015 - dl.acm.org
Unbounded, unordered, global-scale datasets are increasingly common in day-to-day
business (eg Web logs, mobile usage statistics, and sensor networks). At the same time …
business (eg Web logs, mobile usage statistics, and sensor networks). At the same time …
In-memory big data management and processing: A survey
Growing main memory capacity has fueled the development of in-memory big data
management and processing. By eliminating disk I/O bottleneck, it is now possible to support …
management and processing. By eliminating disk I/O bottleneck, it is now possible to support …
Structured streaming: A declarative api for real-time applications in apache spark
With the ubiquity of real-time data, organizations need streaming systems that are scalable,
easy to use, and easy to integrate into business applications. Structured Streaming is a new …
easy to use, and easy to integrate into business applications. Structured Streaming is a new …
InferLine: latency-aware provisioning and scaling for prediction serving pipelines
Serving ML prediction pipelines spanning multiple models and hardware accelerators is a
key challenge in production machine learning. Optimally configuring these pipelines to meet …
key challenge in production machine learning. Optimally configuring these pipelines to meet …
Approximate query processing: No silver bullet
In this paper, we reflect on the state of the art of Approximate Query Processing. Although
much technical progress has been made in this area of research, we are yet to see its impact …
much technical progress has been made in this area of research, we are yet to see its impact …
Consistency and completeness: Rethinking distributed stream processing in apache kafka
G Wang, L Chen, A Dikshit, J Gustafson… - Proceedings of the …, 2021 - dl.acm.org
An increasingly important system requirement for distributed stream processing applications
is to provide strong correctness guarantees under unexpected failures and out-of-order data …
is to provide strong correctness guarantees under unexpected failures and out-of-order data …