Implementation and evaluation of shared-memory communication and synchronization operations in MPICH2 using the Nemesis communication subsystem

D Buntinas, G Mercier, W Gropp - Parallel Computing, 2007 - Elsevier
This paper presents the implementation of MPICH2 over the Nemesis communication
subsystem and the evaluation of its shared-memory performance. We describe design …

[책][B] Algorithms for memory hierarchies: advanced lectures

U Meyer, P Sanders - 2003 - books.google.com
Algorithms that have to process large data sets have to take into account that the cost of
memory access depends on where the data is stored. Traditional algorithm design is based …

PM2: High performance communication middleware for heterogeneous network environments

T Takahashi, S Sumimoto, A Hori… - SC'00: Proceedings …, 2000 - ieeexplore.ieee.org
This paper introduces a high performance communication middle layer, called PM2, for
hetero-geneous network environments. PM2 currently supports Myrinet, Ethernet, and SMP …

[PDF][PDF] BIP-SMP: High performance message passing over a cluster of commodity SMPs

P Geoffray, L Prylli, B Tourancheau - Proceedings of the 1999 ACM/IEEE …, 1999 - dl.acm.org
As we approach the next century, parallel machines are gradually and incrementally being
replaced by clusters of commodity workstations. The price of such a cluster is a fraction of …

System management software for virtual environments

G Vallée, T Naughton, SL Scott - … of the 4th international conference on …, 2007 - dl.acm.org
Recently there has been an increased interest in the use of system-level virtualization using
mature solutions such as Xen, QEMU, or VMWare. These virtualization platforms are being …

Investigating the performance of two programming models for clusters of SMP PCs

F Cappello, O Richard… - … Symposium on High …, 2000 - ieeexplore.ieee.org
Multiprocessors and high performance networks allow building CLUsters of MultiProcessors
(CLUMPs). One distinctive feature over traditional parallel computers is their hybrid memory …

Optimizing collective communications on SMP clusters

MS Wu, RA Kendall, K Wright - 2005 International Conference …, 2005 - ieeexplore.ieee.org
We describe a generic programming model to design collective communications on SMP
clusters. The programming model utilizes shared memory for collective communications and …

Process migration based on gobelins distributed shared memory

G Vallee, C Morin, R Lottiaux… - 2nd IEEE/ACM …, 2002 - ieeexplore.ieee.org
Clusters are attractive for executing sequential and parallel applications. However, there is a
need to design a cluster distributed operating system to provide a Single System Image. A …

Understanding performance of SMP clusters running MPI programs

F Cappello, O Richard, D Etiemble - Future Generation Computer Systems, 2001 - Elsevier
Clusters of multiprocessors (CLUMPs) have an hybrid memory model, with message
passing between nodes and shared memory inside nodes. We examine the performance of …

Proposing a new task model towards many-core architecture

A Shimada, B Gerofi, A Hori, Y Ishikawa - Proceedings of the First …, 2013 - dl.acm.org
Many-core processors are gathering attention in the areas of embedded systems due to their
power-performance ratios. To utilize cores of a many-core processor in parallel …