- Academic Search

SM Habib, S Ries, M Muhlhauser - 2010 7th International …, 2010 - ieeexplore.ieee.org

Cloud Computing is an emerging computing paradigm. It shares massively scalable, elastic
resources (eg, data, calculations, and services) transparently among the users over a …

Save Cite Cited by 193 Related articles All 10 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Crono: A benchmark suite for multithreaded graph algorithms executing on futuristic multicores

M Ahmad, F Hijaz, Q Shi, O Khan - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

Algorithms operating on a graph setting are known to be highly irregular and unstructured.
This leads to workload imbalance and data locality challenge when these algorithms are …

Save Cite Cited by 140 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Choosing the best parallelization and implementation styles for graph analytics codes: Lessons learned from 1106 programs

Y Liu, N Azami, A VanAusdal, M Burtscher - Proceedings of the …, 2023 - dl.acm.org

Graph analytics has become a major workload in recent years. The underlying core
algorithms tend to be irregular and data dependent, making them challenging to parallelize …

Save Cite Cited by 4 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] ucla.edu

Architecting waferscale processors-a GPU case study

S Pal, D Petrisko, M Tomei, P Gupta… - … Symposium on High …, 2019 - ieeexplore.ieee.org

Increasing communication overheads are already threatening computer system scaling. One
approach to dramatically reduce communication overheads is waferscale processing …

Save Cite Cited by 62 Related articles All 10 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Fast segmented sort on gpus

K Hou, W Liu, H Wang, W Feng - Proceedings of the International …, 2017 - dl.acm.org

Segmented sort, as a generalization of classical sort, orders a batch of independent
segments in a whole array. Along with the wider adoption of manycore processors for HPC …

Save Cite Cited by 72 Related articles All 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] danielwong.org

Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus

J Wang, N Rubin, A Sidelnik… - ACM SIGARCH Computer …, 2015 - dl.acm.org

GPUs have been proven effective for structured applications that map well to the rigid 1D-3D
grid of threads in modern bulk synchronous parallel (BSP) programming languages …

Save Cite Cited by 88 Related articles All 13 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] vldb.org

Data-parallel query processing on non-uniform data

H Funke, J Teubner - Proceedings of the VLDB Endowment, 2020 - dl.acm.org

Graphics processing units (GPUs) promise spectacular performance advantages when used
as database coprocessors. Their massive compute capacity, however, is often hampered by …

Save Cite Cited by 42 Related articles All 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] semanticscholar.org

Laperm: Locality aware scheduler for dynamic parallelism on gpus

J Wang, N Rubin, A Sidelnik… - ACM SIGARCH Computer …, 2016 - dl.acm.org

Recent developments in GPU execution models and architectures have introduced dynamic
parallelism to facilitate the execution of irregular applications where control flow and …

Save Cite Cited by 69 Related articles All 5 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Free launch: optimizing GPU dynamic kernel launches through thread reuse

G Chen, X Shen - Proceedings of the 48th International Symposium on …, 2015 - dl.acm.org

Supporting dynamic parallelism is important for GPU to benefit a broad range of
applications. There are currently two fundamental ways for programs to exploit dynamic …

Save Cite Cited by 73 Related articles All 6 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] nsf.gov

Controlled kernel launch for dynamic parallelism in GPUs

X Tang, A Pattnaik, H Jiang, O Kayiran… - … Symposium on High …, 2017 - ieeexplore.ieee.org

Dynamic parallelism (DP) is a promising feature for GPUs, which allows on-demand
spawning of kernels on the GPU without any CPU intervention. However, this feature has …

Save Cite Cited by 62 Related articles All 12 versions Free GPT-4 DeepSeek

Create alert

Cite

Advanced search

Saved to My library

Characterization and analysis of dynamic parallelism in unstructured GPU applications

Cloud computing landscape and research challenges regarding trust and reputation

Crono: A benchmark suite for multithreaded graph algorithms executing on futuristic multicores

Choosing the best parallelization and implementation styles for graph analytics codes: Lessons learned from 1106 programs

Architecting waferscale processors-a GPU case study

Fast segmented sort on gpus

Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus

Data-parallel query processing on non-uniform data

Laperm: Locality aware scheduler for dynamic parallelism on gpus

Free launch: optimizing GPU dynamic kernel launches through thread reuse

Controlled kernel launch for dynamic parallelism in GPUs