FPGA-based near-memory acceleration of modern data-intensive applications
Modern data-intensive applications demand high computational capabilities with strict
power constraints. Unfortunately, such applications suffer from a significant waste of both …
power constraints. Unfortunately, such applications suffer from a significant waste of both …
NERO: A near high-bandwidth memory stencil accelerator for weather prediction modeling
Ongoing climate change calls for fast and accurate weather and climate modeling. However,
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …
Accelerating weather prediction using near-memory reconfigurable fabric
Ongoing climate change calls for fast and accurate weather and climate modeling. However,
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …
when solving large-scale weather prediction simulations, state-of-the-art CPU and GPU …
Optimizing Cloud Computing Resource Usage for Hemodynamic Simulation
Cloud computing resources are becoming an increasingly attractive option for simulation
workflows but require users to assess a wider variety of hardware options and associated …
workflows but require users to assess a wider variety of hardware options and associated …
Highly scalable parallel genetic algorithm on sunway many-core processors
Z **ao, X Liu, J Xu, Q Sun, L Gan - Future Generation Computer Systems, 2021 - Elsevier
As a heuristic method, the genetic algorithm provides promising solutions with impressive
performance benefits for large-scale problems. In this study, we propose a highly scalable …
performance benefits for large-scale problems. In this study, we propose a highly scalable …
NARMADA: Near-memory horizontal diffusion accelerator for scalable stencil computations
Real-world weather forecasting applications consist of compound stencil kernels that do not
perform well on conventional architectures. This behavior is due to their complex data …
perform well on conventional architectures. This behavior is due to their complex data …
Enabling large-scale simulation of cam on the sunway taihulight supercomputer
The Community Atmosphere Model (CAM) has been ported, redesigned, and scaled to the
full system of the Sunway TaihuLight, and provides peta-scale climate modeling …
full system of the Sunway TaihuLight, and provides peta-scale climate modeling …
Low precision processing for high order stencil computations
Modern scientific workloads have demonstrated the inefficiency of using high precision
formats. Moving to a lower bit format or even to a different number system can provide …
formats. Moving to a lower bit format or even to a different number system can provide …
Designing, modeling, and optimizing data-intensive computing systems
G Singh - arxiv preprint arxiv:2208.08886, 2022 - arxiv.org
The cost of moving data between the memory units and the compute units is a major
contributor to the execution time and energy consumption of modern workloads in …
contributor to the execution time and energy consumption of modern workloads in …
Million-core-scalable simulation of the elastic migration algorithm on Sunway TaihuLight supercomputer
Migration algorithm is one of the most essential methods in seismic application to image the
underground geology, and to help scientists and researchers in geophysics exploration …
underground geology, and to help scientists and researchers in geophysics exploration …