Challenges in high-performance computing

POA Navaux, AF Lorenzon… - Journal of the Brazilian …, 2023 - sol.sbc.org.br
Abstract High-Performance Computing, HPC, has become one of the most active computer
science fields. Driven mainly by the need for high processing capabilities required by …

Rcmp: Reconstructing RDMA-Based Memory Disaggregation via CXL

Z Wang, Y Guo, K Lu, J Wan, D Wang, T Yao… - ACM Transactions on …, 2024 - dl.acm.org
Memory disaggregation is a promising architecture for modern datacenters that separates
compute and memory resources into independent pools connected by ultra-fast networks …

A quantitative approach for adopting disaggregated memory in HPC systems

J Wahlgren, G Schieffer, M Gokhale… - Proceedings of the …, 2023 - dl.acm.org
Memory disaggregation has recently been adopted in data centers to improve resource
utilization, motivated by cost and sustainability. Recent studies on large-scale HPC facilities …

Exploring Numba and CuPy for GPU-Accelerated Monte Carlo Radiation Transport

T Askar, A Yergaliyev, B Shukirgaliyev, E Abdikamalov - Computation, 2024 - mdpi.com
This paper examines the performance of two popular GPU programming platforms, Numba
and CuPy, for Monte Carlo radiation transport calculations. We conducted tests involving …

A survey of compute nodes with 100 TFLOPS and beyond for supercomputers

J Chang, K Lu, Y Guo, Y Wang, Z Zhao… - CCF Transactions on …, 2024 - Springer
With the Frontier supercomputer ranked first on the Top500 list, it marks the era of exascale
computing power for supercomputers, employing the compute nodes with double-precision …

Shifting Between Compute and Memory Bounds: A Compression-Enabled Roofline Model

R Naraparaju, T Zhao, Y Hu, D Zhao… - SC24-W: Workshops …, 2024 - ieeexplore.ieee.org
In the evolving landscape of high-performance computing, especially to fight the end of
Moore's Law and Dennard's Scaling, the ability to shift between compute-bound and …

ngAP: Non-blocking Large-scale Automata Processing on GPUs

T Ge, T Zhang, H Liu - Proceedings of the 29th ACM International …, 2024 - dl.acm.org
Finite automata serve as compute kernels for various applications that require high
throughput. However, despite the increasing compute power of GPUs, their potential in …

Exascale Quantum Mechanical Simulations: Navigating the Shifting Sands of Hardware and Software

R Shinde, C Filippi, A Scemama, W Jalby - arxiv preprint arxiv …, 2024 - arxiv.org
The era of exascale computing presents both exciting opportunities and unique challenges
for quantum mechanical simulations. While the transition from petaflops to exascale …

Data-Oriented Operating System for Big Data and Cloud.

SD Kessler, KW Ng, SC Haw - Intelligent Automation & Soft …, 2024 - search.ebscohost.com
Operating System (OS) is a critical piece of software that manages a computer's hardware
and resources, acting as the intermediary between the computer and the user. The existing …

CPU-GPU Tuning for Modern Scientific Applications using Node-Level Heterogeneity

M Thavappiragasam, V Kale - 2023 IEEE 30th International …, 2023 - ieeexplore.ieee.org
Scientific applications must be tuned for performance to run efficiently on supercomputers
having nodes with a CPU (or, a general-purpose host processor) and GPUs (or, accelerator …