Gaia:{Geo-Distributed} machine learning approaching {LAN} speeds

K Hsieh, A Harlap, N Vijaykumar, D Konomis… - … USENIX symposium on …, 2017 - usenix.org
Machine learning (ML) is widely used to derive useful information from large-scale data
(such as user activities, pictures, and videos) generated at increasingly rapid rates, all over …

Awstream: Adaptive wide-area streaming analytics

B Zhang, X **, S Ratnasamy, J Wawrzynek… - Proceedings of the 2018 …, 2018 - dl.acm.org
The emerging class of wide-area streaming analytics faces the challenge of scarce and
variable WAN bandwidth. Non-adaptive applications built with TCP or UDP suffer from …

Low latency geo-distributed data analytics

Q Pu, G Ananthanarayanan, P Bodik… - ACM SIGCOMM …, 2015 - dl.acm.org
Low latency analytics on geographically distributed datasets (across datacenters, edge
clusters) is an upcoming and increasingly important challenge. The dominant approach of …

Internet of Things and data mining: From applications to techniques and systems

MM Gaber, A Aneiba, S Basurra, O Batty… - … : Data Mining and …, 2019 - Wiley Online Library
The Internet of Things (IoT) is the result of the convergence of sensing, computing, and
networking technologies, allowing devices of varying sizes and computational capabilities …

Apache tez: A unifying framework for modeling and building data processing applications

B Saha, H Shah, S Seth, G Vijayaraghavan… - Proceedings of the …, 2015 - dl.acm.org
The broad success of Hadoop has led to a fast-evolving and diverse ecosystem of
application engines that are building upon the YARN resource management layer. The open …

Architecting and develo** big data-driven innovation (DDI) in the digital economy

S Sultana, S Akter, E Kyriazis… - Journal of Global …, 2021 - igi-global.com
To revamp with new creative age characterized by ongoing digital transformation, more and
more industries are capitalizing on digital innovation for their sustainable business growth …

Federated self-supervised learning of multisensor representations for embedded intelligence

A Saeed, FD Salim, T Ozcelebi… - IEEE Internet of Things …, 2020 - ieeexplore.ieee.org
Smartphones, wearables, and Internet-of-Things (IoT) devices produce a wealth of data that
cannot be accumulated in a centralized repository for learning supervised models due to …

Global analytics in the face of bandwidth and regulatory constraints

A Vulimiri, C Curino, PB Godfrey, T Jungblut… - … USENIX Symposium on …, 2015 - usenix.org
Global-scale organizations produce large volumes of data across geographically distributed
data centers. Querying and analyzing such data as a whole introduces new research issues …

Towards a computing continuum: Enabling edge-to-cloud integration for data-driven workflows

D Balouek-Thomert, EG Renart… - … Journal of High …, 2019 - journals.sagepub.com
Dramatic changes in the technology landscape marked by increasing scales and
pervasiveness of compute and data have resulted in the proliferation of edge applications …

Scheduling jobs across geo-distributed datacenters

CC Hung, L Golubchik, M Yu - Proceedings of the Sixth ACM Symposium …, 2015 - dl.acm.org
With growing data volumes generated and stored across geo-distributed datacenters, it is
becoming increasingly inefficient to aggregate all data required for computation at a single …