- Academic Search

G Bosilca, A Bouteiller, A Danalis, T Herault… - Parallel Computing, 2012 - Elsevier

The frenetic development of the current architectures places a strain on the current state-of-
the-art programming environments. Harnessing the full potential of such architectures is a …

บันทึก อ้างอิง อ้างโดย504 บทความที่เกี่ยวข้อง ทั้งหมด 35 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

A hybridization methodology for high-performance linear algebra software for GPUs

E Agullo, C Augonnet, J Dongarra, H Ltaief… - GPU Computing Gems …, 2012 - Elsevier

Publisher Summary This chapter presents a hybridization methodology for the development
of high-performance linear algebra software for graphics processing units (GPUs). The …

บันทึก อ้างอิง อ้างโดย168 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] lbl.gov

Enabling in-situ execution of coupled scientific workflow on multi-core platform

F Zhang, C Docan, M Parashar, S Klasky… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org

Emerging scientific application workflows are composed of heterogeneous coupled
component applications that simulate different aspects of the physical phenomena being …

บันทึก อ้างอิง อ้างโดย114 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

PLASMA: Parallel linear algebra software for multicore using OpenMP

J Dongarra, M Gates, A Haidar, J Kurzak… - ACM Transactions on …, 2019 - dl.acm.org

The recent version of the Parallel Linear Algebra Software for Multicore Architectures
(PLASMA) library is based on tasks with dependencies from the OpenMP standard. The …

บันทึก อ้างอิง อ้างโดย73 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Are static schedules so bad? a case study on cholesky factorization

E Agullo, O Beaumont… - 2016 IEEE …, 2016 - ieeexplore.ieee.org

Our goal is to provide an analysis and comparison of static and dynamic strategies for task
graph scheduling on platforms consisting of heterogeneous and unrelated resources, such …

บันทึก อ้างอิง อ้างโดย64 บทความที่เกี่ยวข้อง ทั้งหมด 11 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] tennessee.edu

Dynamic task execution on shared and distributed memory architectures

A YarKhan - 2012 - trace.tennessee.edu

Multicore architectures with high core counts have come to dominate the world of high
performance computing, from shared memory machines to the largest distributed memory …

บันทึก อ้างอิง อ้างโดย63 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] idsa.prd.fr

High performance matrix inversion based on LU factorization for multicore architectures

J Dongarra, M Faverge, H Ltaief… - Proceedings of the 2011 …, 2011 - dl.acm.org

The goal of this paper is to present an efficient implementation of an explicit matrix inversion
of general square matrices on multicore computer architecture. The inversion procedure is …

บันทึก อ้างอิง อ้างโดย35 บทความที่เกี่ยวข้อง ทั้งหมด 22 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] academie-sciences.fr

Parallel hierarchical hybrid linear solvers for emerging computing platforms

E Agullo, L Giraud… - Comptes …, 2011 - comptes-rendus.academie-sciences …

La conception des plateformes d'échelle extrême qui devraient être disponibles dans la
décade à venir représenteront la convergence de tendances technologiques et définiront le …

บันทึก อ้างอิง อ้างโดย39 บทความที่เกี่ยวข้อง ทั้งหมด 7 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Task-based sparse hybrid linear solver for distributed memory heterogeneous architectures

E Agullo, L Giraud, S Nakov - … Workshops, Grenoble, France, August 24-26 …, 2017 - Springer

Heterogeneity is emerging as one of the most challenging characteristics of today's parallel
environments. However, not many fully-featured advanced numerical, scientific libraries …

บันทึก อ้างอิง อ้างโดย14 บทความที่เกี่ยวข้อง ทั้งหมด 5 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Flexible linear algebra development and scheduling with cholesky factorization

A Haidar, A YarKhan, C Cao… - 2015 IEEE 17th …, 2015 - ieeexplore.ieee.org

Modern high performance computing environments are composed of networks of compute
nodes that often contain a variety of heterogeneous compute resources, such as multicore …

บันทึก อ้างอิง อ้างโดย12 บทความที่เกี่ยวข้อง ทั้งหมด 9 ฉบับ

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Distributed-memory task execution and dependence tracking within DAGuE and the DPLASMA project

DAGuE: A generic distributed DAG engine for high performance computing

A hybridization methodology for high-performance linear algebra software for GPUs

Enabling in-situ execution of coupled scientific workflow on multi-core platform

PLASMA: Parallel linear algebra software for multicore using OpenMP

Are static schedules so bad? a case study on cholesky factorization

Dynamic task execution on shared and distributed memory architectures

High performance matrix inversion based on LU factorization for multicore architectures

Parallel hierarchical hybrid linear solvers for emerging computing platforms

Task-based sparse hybrid linear solver for distributed memory heterogeneous architectures

Flexible linear algebra development and scheduling with cholesky factorization