FPGA-based near-memory acceleration of modern data-intensive applications

G Singh, M Alser, DS Cali, D Diamantopoulos… - IEEE Micro, 2021 - ieeexplore.ieee.org
Modern data-intensive applications demand high computational capabilities with strict
power constraints. Unfortunately, such applications suffer from a significant waste of both …

NERO: A near high-bandwidth memory stencil accelerator for weather prediction modeling

G Singh, D Diamantopoulos… - … Conference on Field …, 2020 - ieeexplore.ieee.org
Ongoing climate change calls for fast and accurate weather and climate modeling. However,
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …

Accelerating weather prediction using near-memory reconfigurable fabric

G Singh, D Diamantopoulos, J Gómez-Luna… - ACM Transactions on …, 2022 - dl.acm.org
Ongoing climate change calls for fast and accurate weather and climate modeling. However,
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …

Optimizing Cloud Computing Resource Usage for Hemodynamic Simulation

W Ladd, C Jensen, M Vardhan, J Ames… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
Cloud computing resources are becoming an increasingly attractive option for simulation
workflows but require users to assess a wider variety of hardware options and associated …

Highly scalable parallel genetic algorithm on sunway many-core processors

Z **ao, X Liu, J Xu, Q Sun, L Gan - Future Generation Computer Systems, 2021 - Elsevier
As a heuristic method, the genetic algorithm provides promising solutions with impressive
performance benefits for large-scale problems. In this study, we propose a highly scalable …

NARMADA: Near-memory horizontal diffusion accelerator for scalable stencil computations

G Singh, D Diamantopoulos… - … Conference on Field …, 2019 - ieeexplore.ieee.org
Real-world weather forecasting applications consist of compound stencil kernels that do not
perform well on conventional architectures. This behavior is due to their complex data …

Enabling large-scale simulation of cam on the sunway taihulight supercomputer

Y Li, X Duan, L Gan, W Wan, Y Chen… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
The Community Atmosphere Model (CAM) has been ported, redesigned, and scaled to the
full system of the Sunway TaihuLight, and provides peta-scale climate modeling …

Low precision processing for high order stencil computations

G Singh, D Diamantopoulos, S Stuijk… - … , and Simulation: 19th …, 2019 - Springer
Modern scientific workloads have demonstrated the inefficiency of using high precision
formats. Moving to a lower bit format or even to a different number system can provide …

Designing, modeling, and optimizing data-intensive computing systems

G Singh - arxiv preprint arxiv:2208.08886, 2022 - arxiv.org
The cost of moving data between the memory units and the compute units is a major
contributor to the execution time and energy consumption of modern workloads in …

Million-core-scalable simulation of the elastic migration algorithm on Sunway TaihuLight supercomputer

L Gan, J Xu, X Wang, S Wu, X Duan, Y Li… - 2019 19th IEEE/ACM …, 2019 - ieeexplore.ieee.org
Migration algorithm is one of the most essential methods in seismic application to image the
underground geology, and to help scientists and researchers in geophysics exploration …