Fine-grained dynamic resource allocation for big-data applications

L Baresi, A Leva, G Quattrocchi - IEEE Transactions on …, 2019 - ieeexplore.ieee.org
Many big-data applications are batch applications that exploit dedicated frameworks to
perform massively parallel computations across clusters of machines. The time needed to …

Using formal verification to evaluate the execution time of Spark applications

L Baresi, MM Bersani, F Marconi, G Quattrocchi… - Formal Aspects of …, 2020 - Springer
Apache Spark is probably the most widely adopted framework for develo** big-data batch
applications and for executing them on a cluster of (virtual) machines. In general, the more …

Modeling big data processing programs

JB de Souza Neto, AM Moreira, G Vargas-Solar… - Brazilian Symposium on …, 2020 - Springer
We propose a new model for data processing programs. Our model generalizes the data
flow programming style implemented by systems such as Apache Spark, DryadLINQ …

A two-level formal model for Big Data processing programs

JB de Souza Neto, AM Moreira, G Vargas-Solar… - Science of Computer …, 2022 - Elsevier
This paper proposes a model for specifying data flow-based parallel data processing
programs agnostic of target Big Data processing frameworks. The paper focuses on the …

Symbolic execution-driven extraction of the parallel execution plans of spark applications

L Baresi, G Denaro, G Quattrocchi - Proceedings of the 2019 27th ACM …, 2019 - dl.acm.org
The execution of Spark applications is based on the execution order and parallelism of the
different jobs, given data and available resources. Spark reifies these dependencies in a …

Big-data applications as self-adaptive systems of systems

L Baresi, G Denaro… - 2019 IEEE international …, 2019 - ieeexplore.ieee.org
Virtualization technologies have enabled a new way of thinking of computing resources and
cloud computing frameworks offer many pay-per-use solutions for renting these resources …

An Abstract View of Big Data Processing Programs

JBS Neto, AM Moreira, G Vargas-Solar… - arxiv preprint arxiv …, 2021 - arxiv.org
This paper proposes a model for specifying data flow based parallel data processing
programs agnostic of target Big Data processing frameworks. The paper focuses on the …

[PDF][PDF] Modeling Big Data Processing Programs

G Vargas-Solar, MA Musicante - researchgate.net
We propose a new model for data processing programs. Our model generalizes the data
flow programming style implemented by systems such as Apache Spark, DryadLINQ …

[PDF][PDF] Final assessment report and impact analysis

V Papanikolaou - 2018 - wp.doc.ic.ac.uk
Executive summary DICE aims at defining a general-purpose methodology and toolset to
define data-intensive cloud applications. This means that applications defined with DICE …

Using symbolic execution to improve the runtime management of spark applications

D BERTOLOTTI - 2017 - politesi.polimi.it
The need to crunch a steadily growing amount of data generated by the modern applications
is driving an increasing demand of flexible computing power, often satisfied by cloud …