Cloud computing landscape and research challenges regarding trust and reputation
Cloud Computing is an emerging computing paradigm. It shares massively scalable, elastic
resources (eg, data, calculations, and services) transparently among the users over a …
resources (eg, data, calculations, and services) transparently among the users over a …
Crono: A benchmark suite for multithreaded graph algorithms executing on futuristic multicores
Algorithms operating on a graph setting are known to be highly irregular and unstructured.
This leads to workload imbalance and data locality challenge when these algorithms are …
This leads to workload imbalance and data locality challenge when these algorithms are …
Choosing the best parallelization and implementation styles for graph analytics codes: Lessons learned from 1106 programs
Graph analytics has become a major workload in recent years. The underlying core
algorithms tend to be irregular and data dependent, making them challenging to parallelize …
algorithms tend to be irregular and data dependent, making them challenging to parallelize …
Architecting waferscale processors-a GPU case study
Increasing communication overheads are already threatening computer system scaling. One
approach to dramatically reduce communication overheads is waferscale processing …
approach to dramatically reduce communication overheads is waferscale processing …
Fast segmented sort on gpus
Segmented sort, as a generalization of classical sort, orders a batch of independent
segments in a whole array. Along with the wider adoption of manycore processors for HPC …
segments in a whole array. Along with the wider adoption of manycore processors for HPC …
Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus
GPUs have been proven effective for structured applications that map well to the rigid 1D-3D
grid of threads in modern bulk synchronous parallel (BSP) programming languages …
grid of threads in modern bulk synchronous parallel (BSP) programming languages …
Data-parallel query processing on non-uniform data
Graphics processing units (GPUs) promise spectacular performance advantages when used
as database coprocessors. Their massive compute capacity, however, is often hampered by …
as database coprocessors. Their massive compute capacity, however, is often hampered by …
Laperm: Locality aware scheduler for dynamic parallelism on gpus
Recent developments in GPU execution models and architectures have introduced dynamic
parallelism to facilitate the execution of irregular applications where control flow and …
parallelism to facilitate the execution of irregular applications where control flow and …
Free launch: optimizing GPU dynamic kernel launches through thread reuse
G Chen, X Shen - Proceedings of the 48th International Symposium on …, 2015 - dl.acm.org
Supporting dynamic parallelism is important for GPU to benefit a broad range of
applications. There are currently two fundamental ways for programs to exploit dynamic …
applications. There are currently two fundamental ways for programs to exploit dynamic …
Controlled kernel launch for dynamic parallelism in GPUs
Dynamic parallelism (DP) is a promising feature for GPUs, which allows on-demand
spawning of kernels on the GPU without any CPU intervention. However, this feature has …
spawning of kernels on the GPU without any CPU intervention. However, this feature has …