Big data analytics for manufacturing internet of things: opportunities, challenges and enabling technologies

HN Dai, H Wang, G Xu, J Wan… - Enterprise Information …, 2020 - Taylor & Francis
Data analytics in massive manufacturing data can extract huge business values while can
also result in research challenges due to the heterogeneous data types, enormous volume …

Big data analytics for large-scale wireless networks: Challenges and opportunities

HN Dai, RCW Wong, H Wang, Z Zheng… - ACM Computing …, 2019 - dl.acm.org
The wide proliferation of various wireless communication systems and wireless devices has
led to the arrival of big data era in large-scale wireless networks. Big data of large-scale …

State management in Apache Flink®: consistent stateful distributed stream processing

P Carbone, S Ewen, G Fóra, S Haridi… - Proceedings of the …, 2017 - dl.acm.org
Stream processors are emerging in industry as an apparatus that drives analytical but also
mission critical services handling the core of persistent application logic. Thus, apart from …

The stratosphere platform for big data analytics

A Alexandrov, R Bergmann, S Ewen, JC Freytag… - The VLDB Journal, 2014 - Springer
We present Stratosphere, an open-source software stack for parallel data analysis.
Stratosphere brings together a unique set of features that allow the expressive, easy, and …

Parallel data processing with MapReduce: a survey

KH Lee, YJ Lee, H Choi, YD Chung, B Moon - AcM sIGMoD record, 2012 - dl.acm.org
A prominent parallel data processing tool MapReduce is gaining significant momentum from
both industry and academia as the volume of data to analyze grows rapidly. While …

Skewtune: mitigating skew in mapreduce applications

YC Kwon, M Balazinska, B Howe, J Rolia - Proceedings of the 2012 …, 2012 - dl.acm.org
We present an automatic skew mitigation approach for user-defined MapReduce programs
and present SkewTune, a system that implements this approach as a drop-in replacement …

Apache tez: A unifying framework for modeling and building data processing applications

B Saha, H Shah, S Seth, G Vijayaraghavan… - Proceedings of the …, 2015 - dl.acm.org
The broad success of Hadoop has led to a fast-evolving and diverse ecosystem of
application engines that are building upon the YARN resource management layer. The open …

Hyracks: A flexible and extensible foundation for data-intensive computing

V Borkar, M Carey, R Grover, N Onose… - 2011 IEEE 27th …, 2011 - ieeexplore.ieee.org
Hyracks is a new partitioned-parallel software platform designed to run data-intensive
computations on large shared-nothing clusters of computers. Hyracks allows users to …

[PDF][PDF] 大数据管理: 概念, 技术与挑战

孟小峰, 慈祥 - 2013 - idke.ruc.edu.cn
大数据管理:概念,技术与挑战 Page 1 大数据管理:概念,技术与挑战 孟小峰慈祥 (**人民大学信息
学院北京100872) Big Data Management: Concepts, Techniques and Challenges Meng …

Exploiting dynamic resource allocation for efficient parallel data processing in the cloud

D Warneke, O Kao - IEEE transactions on parallel and …, 2011 - ieeexplore.ieee.org
In recent years ad hoc parallel data processing has emerged to be one of the killer
applications for Infrastructure-as-a-Service (IaaS) clouds. Major Cloud computing …