[HTML][HTML] The landscape of parallel computing research: A view from berkeley

K Asanovic, R Bodik, B Catanzaro, J Gebis… - 2006 - escholarship.org
The recent switch to parallel microprocessors is a milestone in the history of computing.
Industry has laid out a roadmap for multicore designs that preserves the programming …

High performance RDMA-based MPI implementation over InfiniBand

J Liu, J Wu, SP Kini, P Wyckoff, DK Panda - Proceedings of the 17th …, 2003 - dl.acm.org
Although InfiniBand Architecture is relatively new in the high performance computing area, it
offers many features which help us to improve the performance of communication …

An overview of MPI characteristics of exascale proxy applications

B Klenk, H Fröning - … Computing: 32nd International Conference, ISC High …, 2017 - Springer
The scale of applications and computing systems is tremendously increasing and needs to
increase even more to realize exascale systems. As the number of nodes keeps growing …

Combining partial redundancy and checkpointing for HPC

J Elliott, K Kharbas, D Fiala, F Mueller… - 2012 IEEE 32nd …, 2012 - ieeexplore.ieee.org
Today's largest High Performance Computing (HPC) systems exceed one Petaflops (10^ 15)
floating point operations per second) and exascale systems are projected within seven …

Cross-platform performance prediction of parallel applications using partial execution

LT Yang, X Ma, F Mueller - SC'05: Proceedings of the 2005 …, 2005 - ieeexplore.ieee.org
Performance prediction across platforms is increasingly important as developers can choose
from a wide range of execution platforms. The main challenge remains to perform accurate …

Performance comparison of MPI implementations over InfiniBand, Myrinet and Quadrics

J Liu, B Chandrasekaran, J Wu, W Jiang… - Proceedings of the …, 2003 - dl.acm.org
In this paper, we present a comprehensive performance comparison of MPI implementations
over Infini-Band, Myrinet and Quadrics. Our performance evaluation consists of two major …

Workload modeling for performance evaluation

DG Feitelson - IFIP International Symposium on Computer …, 2002 - Springer
The performance of a computer system depends on the characteristics of the workload it
must serve: for example, if work is evenly distributed performance will be better than if it …

Design automation for application-specific on-chip interconnects: A survey

A Cilardo, E Fusella - Integration, 2016 - Elsevier
On-chip interconnects provide a vital facility for highly parallel MultiProcessor Systems-on-
Chip, particularly in data-intensive applications, where the choice of the underlying …

High performance interconnect network for Tianhe system

XK Liao, ZB Pang, KF Wang, YT Lu, M **e, J **a… - Journal of Computer …, 2015 - Springer
In this paper, we present the Tianhe-2 interconnect network and message passing services.
We describe the architecture of the router and network interface chips, and highlight a set of …

RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits

S Sur, HW **, L Chai, DK Panda - … on Principles and practice of parallel …, 2006 - dl.acm.org
Message Passing Interface (MPI) is a popular parallel programming model for scientific
applications. Most high-performance MPI implementations use Rendezvous Protocol for …