Big data analytics on Apache Spark

S Salloum, R Dautov, X Chen, PX Peng… - International Journal of …, 2016 - Springer
Apache Spark has emerged as the de facto framework for big data analytics with its
advanced in-memory programming model and upper-level libraries for scalable machine …

Data management in machine learning: Challenges, techniques, and systems

A Kumar, M Boehm, J Yang - Proceedings of the 2017 ACM International …, 2017 - dl.acm.org
Large-scale data analytics using statistical machine learning (ML), popularly called
advanced analytics, underpins many modern data-driven applications. The data …

Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms

E Cortez, A Bonde, A Muzio, M Russinovich… - Proceedings of the 26th …, 2017 - dl.acm.org
Cloud research to date has lacked data on the characteristics of the production virtual
machine (VM) workloads of large cloud providers. A thorough understanding of these …

Data validation for machine learning

N Polyzotis, M Zinkevich, S Roy… - … of machine learning …, 2019 - proceedings.mlsys.org
Abstract Machine learning is a powerful tool for gleaning knowledge from massive amounts
of data. While a great deal of machine learning research has focused on improving the …

{TensorFlow}: a system for {Large-Scale} machine learning

M Abadi, P Barham, J Chen, Z Chen, A Davis… - … USENIX symposium on …, 2016 - usenix.org
TensorFlow is a machine learning system that operates at large scale and in heterogeneous
environments. Tensor-Flow uses dataflow graphs to represent computation, shared state …

Clipper: A {Low-Latency} online prediction serving system

D Crankshaw, X Wang, G Zhou, MJ Franklin… - … USENIX Symposium on …, 2017 - usenix.org
Clipper: A Low-Latency Online Prediction Serving System Page 1 This paper is included in the
Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation …

Tensorflow-serving: Flexible, high-performance ml serving

C Olston, N Fiedel, K Gorovoy, J Harmsen… - arxiv preprint arxiv …, 2017 - arxiv.org
We describe TensorFlow-Serving, a system to serve machine learning models inside
Google which is also available in the cloud and via open-source. It is extremely flexible in …

Data lifecycle challenges in production machine learning: a survey

N Polyzotis, S Roy, SE Whang, M Zinkevich - ACM SIGMOD Record, 2018 - dl.acm.org
Machine learning has become an essential tool for gleaning knowledge from data and
tackling a diverse set of computationally hard tasks. However, the accuracy of a machine …

Serving deep learning models in a serverless platform

V Ishakian, V Muthusamy… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Serverless computing has emerged as a compelling paradigm for the development and
deployment of a wide range of event based cloud applications. At the same time, cloud …

Cocktail: A multidimensional optimization for model serving in cloud

JR Gunasekaran, CS Mishra, P Thinakaran… - … USENIX Symposium on …, 2022 - usenix.org
With a growing demand for adopting ML models for a variety of application services, it is vital
that the frameworks serving these models are capable of delivering highly accurate …