[SÁCH][B] High performance visualization: enabling extreme-scale scientific insight

EW Bethel, H Childs, C Hansen - 2012 - books.google.com
Visualization and analysis tools, techniques, and algorithms have undergone a rapid
evolution in recent decades to accommodate explosive growth in data size and complexity …

Java in the high performance computing arena: Research, practice and experience

GL Taboada, S Ramos, RR Expósito, J Tourino… - Science of Computer …, 2013 - Elsevier
The rising interest in Java for High Performance Computing (HPC) is based on the
appealing features of this language for programming multi-core cluster architectures …

Computational strategies for scalable genomics analysis

L Shi, Z Wang - Genes, 2019 - mdpi.com
The revolution in next-generation DNA sequencing technologies is leading to explosive data
growth in genomics, posing a significant challenge to the computing infrastructure and …

Streamline integration using MPI-hybrid parallelism on a large multicore architecture

D Camp, C Garth, H Childs… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
Streamline computation in a very large vector field data set represents a significant
challenge due to the nonlocal and data-dependent nature of streamline integration. In this …

Hybrid parallelism for volume rendering on large-, multi-, and many-core systems

M Howison, EW Bethel, H Childs - IEEE Transactions on …, 2011 - ieeexplore.ieee.org
With the computing industry trending toward multi-and many-core processors, we study how
a standard visualization algorithm, raycasting volume rendering, can benefit from a hybrid …

MPI-hybrid parallelism for volume rendering on large, multi-core systems

M Howison - 2010 - escholarship.org
This work studies the performance and scalability characteristics of" hybrid'" parallel
programming and execution as applied to raycasting volume rendering--a staple …

Optimizing sparse matrix assembly in finite element solvers with one-sided communication

N Jansson - … Conference on High Performance Computing for …, 2012 - Springer
In parallel finite element solvers, sparse matrix assembly is often a bottleneck. Implemented
using message passing, latency from message matching starts to limit performance as the …

Optimizing a parallel runtime system for multicore clusters: a case study

C Mei, G Zheng, F Gioachin, LV Kalé - Proceedings of the 2010 TeraGrid …, 2010 - dl.acm.org
Clusters of multicore nodes have become the most popular option for new HPC systems due
to their scalability and performance/cost ratio. The complexity of programming multicore …

A preliminary evaluation of the hardware acceleration of the Cray Gemini interconnect for PGAS languages and comparison with MPI

H Shan, NJ Wright, J Shalf, K Yelick… - ACM SIGMETRICS …, 2012 - dl.acm.org
The Gemini interconnect on the Cray XE6 platform provides for lightweight remote direct
memory access (RDMA) between nodes, which is useful for implementing partitioned global …

Task generation and compile-time scheduling for mixed data-control embedded software

J Cortadella, A Kondratyev, L Lavagno, M Massot… - 1999 - upcommons.upc.edu
A method for synthesizing code for the software component of a system is proposed. The
specification is given as a set of concurrent processes that communicate through channels …