Programming languages for data-Intensive HPC applications: A systematic map** study

V Amaral, B Norberto, M Goulão, M Aldinucci… - Parallel Computing, 2020 - Elsevier
A major challenge in modelling and simulation is the need to combine expertise in both
software technologies and a given scientific domain. When High-Performance Computing …

Makeflow: A portable abstraction for data intensive computing on clusters, clouds, and grids

M Albrecht, P Donnelly, P Bui, D Thain - Proceedings of the 1st ACM …, 2012 - dl.acm.org
In recent years, there has been a renewed interest in languages and systems for large scale
distributed computing. Unfortunately, most systems available to the end user use a custom …

Pydron:{Semi-Automatic} Parallelization for {Multi-Core} and the Cloud

SC Müller, G Alonso, A Amara, A Csillaghy - 11th USENIX Symposium …, 2014 - usenix.org
The cloud, rack-scale computing, and multi-core are the basis for today's computing
platforms. Their intrinsic parallelism is a challenge for programmers, specially in areas …

Adapting bioinformatics applications for heterogeneous systems: a case study

I Lanc, P Bui, D Thain, S Emrich - Proceedings of the second …, 2011 - dl.acm.org
The advent of new sequencing technologies has generated extremely large amounts of
information. To successfully apply bioinformatics tools to such large datasets, they need to …

[BOOK][B] A compiler toolchain for distributed data intensive scientific workflows

P Bui - 2012 - search.proquest.com
With the growing amount of computational resources available to researchers today and the
explosion of scientific data in modern research, it is imperative that scientists be able to …

Balancing push and pull in Confuga, an active storage cluster file system for scientific workflows

P Donnelly, D Thain - Concurrency and Computation: Practice …, 2017 - Wiley Online Library
Most big‐data analysis systems require users to adopt restricted abstractions to achieve
scaling and system stability. While highly effective at establishing data locality and …

[BOOK][B] Data locality techniques in an active cluster file system designed for scientific workflows

PJ Donnelly - 2016 - search.proquest.com
The continued exponential growth of storage capacity has catalyzed the broad acquisition of
scientific data which must be processed. While today's large data analysis systems are …

[BOOK][B] Principles for the design and operation of elastic scientific applications on distributed systems

DR Pandiarajan - 2015 - search.proquest.com
Scientific applications often harness the concurrency in their workloads to partition and
operate them as independent tasks and achieve reasonable performance. To improve …

[PDF][PDF] Bernd Bischl, Michel Lang, Olaf Mersmann, Jörg Rahnenführer, Claus Weihs

J Rahnenführer - sfb876.tu-dortmund.de
Empirical analysis of statistical algorithms often demands time-consuming experiments
which are best performed on high performance computing clusters. We present two R …

[CITATION][C] Computing on high performance clusters with r: Packages batchjobs and batchexperiments