- Academic Search

J Liu, J Wickerson, S Bayliss… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

Loop pipelining is one of the most important optimization methods in high-level synthesis
(HLS) for increasing loop parallelism. There has been considerable work on improving loop …

Zapisz Cytuj Cytowane przez 44 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Generating piecewise-regular code from irregular structures

T Augustine, J Sarma, LN Pouchet… - Proceedings of the 40th …, 2019 - dl.acm.org

Irregular data structures, as exemplified with sparse matrices, have proved to be essential in
modern computing. Numerous sparse formats have been investigated to improve the overall …

Zapisz Cytuj Cytowane przez 37 Powiązane artykuły Wszystkie wersje 2

Full runtime polyhedral optimizing loop transformations with the generation, instantiation, and scheduling of code‐bones

JM Martinez Caamaño, M Selva… - Concurrency and …, 2017 - Wiley Online Library

In this paper, we present a new runtime code generation technique for speculative loop
optimization and parallelization. The main benefit of this technique, compared to previous …

Zapisz Cytuj Cytowane przez 36 Powiązane artykuły Wszystkie wersje 6

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Automatically harnessing sparse acceleration

P Ginsbach, B Collie, MFP O'Boyle - Proceedings of the 29th …, 2020 - dl.acm.org

Sparse linear algebra is central to many scientific programs, yet compilers fail to optimize it
well. High-performance libraries are available, but adoption costs are significant. Moreover …

Zapisz Cytuj Cytowane przez 18 Powiązane artykuły Wszystkie wersje 10

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

APOLLO: Automatic speculative polyhedral loop optimizer

JMM Caamaño, A Sukumaran-Rajam… - IMPACT 2017-7th …, 2017 - inria.hal.science

A few weeks ago, we were glad to announce the first release of Apollo, the Automatic
speculative POLyhedral Loop Opti-mizer. Apollo applies polyhedral optimizations on-the-fly …

Zapisz Cytuj Cytowane przez 24 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Runtime multi-versioning and specialization inside a memoized speculative loop optimizer

R Lazcano, D Madroñal, E Juarez… - Proceedings of the 29th …, 2020 - dl.acm.org

In this paper, we propose a runtime framework that implements code multi-versioning and
specialization to optimize and parallelize loop kernels that are invoked many times with …

Zapisz Cytuj Cytowane przez 12 Powiązane artykuły Wszystkie wersje 4

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Fast data-dependence profiling through prior static analysis

M Norouzi, N Morew, Q Ilias, L Rothenberger… - Parallel Computing, 2024 - Elsevier

Data-dependence profiling is a program-analysis technique for detecting parallelism
opportunities in sequential programs. It captures data dependences that actually occur …

Zapisz Cytuj Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]
[DeepSeek]

[PDF] upc.edu

AutoParallel: Automatic parallelisation and distributed execution of affine loop nests in Python

C Ramon-Cortes, R Amela, J Ejarque… - … Journal of High …, 2020 - journals.sagepub.com

The last improvements in programming languages and models have focused on simplicity
and abstraction; leading Python to the top of the list of the programming languages …

Zapisz Cytuj Cytowane przez 5 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Affine modeling of program traces

G Rodríguez, MT Kandemir… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org

A formal, high-level representation of programs is typically needed for static and dynamic
analyses performed by compilers. However, the source code of target applications is not …

Zapisz Cytuj Cytowane przez 9 Powiązane artykuły Wszystkie wersje 6

[Free GPT-4]
[DeepSeek]

[PDF] acm.org Full View

Autovesk: Automatic vectorized code generation from unstructured static kernels using graph transformations

H Tayeb, L Paillat, B Bramas - ACM Transactions on Architecture and …, 2023 - dl.acm.org

Leveraging the SIMD capability of modern CPU architectures is mandatory to take full
advantage of their increased performance. To exploit this capability, binary executables …

Zapisz Cytuj Cytowane przez 3 Powiązane artykuły Wszystkie wersje 9

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

The polyhedral model of nonlinear loops

Polyhedral-based dynamic loop pipelining for high-level synthesis

Generating piecewise-regular code from irregular structures

Full runtime polyhedral optimizing loop transformations with the generation, instantiation, and scheduling of code‐bones

Automatically harnessing sparse acceleration

APOLLO: Automatic speculative polyhedral loop optimizer

Runtime multi-versioning and specialization inside a memoized speculative loop optimizer

[HTML][HTML] Fast data-dependence profiling through prior static analysis

AutoParallel: Automatic parallelisation and distributed execution of affine loop nests in Python

Affine modeling of program traces

Autovesk: Automatic vectorized code generation from unstructured static kernels using graph transformations