A survey of performance optimization for mobile applications
To ensure user satisfaction and success of mobile applications, it is important to provide
highly performant applications. This is particularly important for resource-constrained …
highly performant applications. This is particularly important for resource-constrained …
Understanding GPU power: A survey of profiling, modeling, and simulation methods
RA Bridges, N Imam, TM Mintz - ACM Computing Surveys (CSUR), 2016 - dl.acm.org
Modern graphics processing units (GPUs) have complex architectures that admit
exceptional performance and energy efficiency for high-throughput applications. Although …
exceptional performance and energy efficiency for high-throughput applications. Although …
Accel-Sim: An extensible simulation framework for validated GPU modeling
In computer architecture, significant innovation frequently comes from industry. However, the
simulation tools used by industry are often not released for open use, and even when they …
simulation tools used by industry are often not released for open use, and even when they …
Nvbit: A dynamic binary instrumentation framework for nvidia gpus
Binary instrumentation frameworks are widely used to implement profilers, performance
evaluation, error checking, and bug detection tools. While dynamic binary instrumentation …
evaluation, error checking, and bug detection tools. While dynamic binary instrumentation …
Nvbitfi: Dynamic fault injection for gpus
GPUs have found wide acceptance in domains such as high-performance computing and
autonomous vehicles, which require fast processing of large amounts of data along with …
autonomous vehicles, which require fast processing of large amounts of data along with …
SASSIFI: An architecture-level fault injection tool for GPU application resilience evaluation
As GPUs become more pervasive in both scalable high-performance computing systems
and safety-critical embedded systems, evaluating and analyzing their resilience to soft errors …
and safety-critical embedded systems, evaluating and analyzing their resilience to soft errors …
Optimizing software-directed instruction replication for gpu error detection
Application execution on safety-critical and high-performance computer systems must be
resilient to transient errors. As GPUs become more pervasive in such systems, they must …
resilient to transient errors. As GPUs become more pervasive in such systems, they must …
Ten quick tips for bioinformatics analyses using an Apache Spark distributed computing environment
Some scientific studies involve huge amounts of bioinformatics data that cannot be analyzed
on personal computers usually employed by researchers for day-to-day activities but rather …
on personal computers usually employed by researchers for day-to-day activities but rather …
Hybrid, scalable, trace-driven performance modeling of GPGPUs
In this paper, we present PPT-GPU, a scalable performance prediction toolkit for GPUs. PPT-
GPU achieves scalability through a hybrid high-level modeling approach where some …
GPU achieves scalability through a hybrid high-level modeling approach where some …
A simple model for portable and fast prediction of execution time and power consumption of GPU kernels
Characterizing compute kernel execution behavior on GPUs for efficient task scheduling is a
non-trivial task. We address this with a simple model enabling portable and fast predictions …
non-trivial task. We address this with a simple model enabling portable and fast predictions …