Google Académico

RCO Rocha, P Petoumenos, Z Wang, M Cole… - Proceedings of the 41st …, 2020 - dl.acm.org

Function merging is an important optimization for reducing code size. This technique
eliminates redundant code across functions by merging them into a single function. While …

Guardar Citar Citado por 34 Artigos relacionados Todas as 15 versões

[Free GPT-4]
[DeepSeek]

[PDF] whiterose.ac.uk

Function merging by sequence alignment

RCO Rocha, P Petoumenos, Z Wang… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org

Resource-constrained devices for embedded systems are becoming increasingly important.
In such systems, memory is highly restrictive, making code size in most cases even more …

Guardar Citar Citado por 41 Artigos relacionados Todas as 19 versões

[Free GPT-4]
[DeepSeek]

[PDF] whiterose.ac.uk

HyFM: Function merging for free

RCO Rocha, P Petoumenos, Z Wang, M Cole… - Proceedings of the …, 2021 - dl.acm.org

Function merging is an important optimization for reducing code size. It merges multiple
functions into a single one, eliminating duplicate code among them. The existing state-of-the …

Guardar Citar Citado por 23 Artigos relacionados Todas as 13 versões

[Free GPT-4]
[DeepSeek]

[PDF] whiterose.ac.uk

Vectorization-aware loop unrolling with seed forwarding

RCO Rocha, V Porpodas, P Petoumenos… - Proceedings of the 29th …, 2020 - dl.acm.org

Loop unrolling is a widely adopted loop transformation, commonly used for enabling
subsequent optimizations. Straight-line-code vectorization (SLP) is an optimization that …

Guardar Citar Citado por 27 Artigos relacionados Todas as 14 versões

[Free GPT-4]
[DeepSeek]

[PDF] ed.ac.uk

Loop rolling for code size reduction

RCO Rocha, P Petoumenos, B Franke… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org

Code size is critical for resource-constrained devices, where memory and storage are
limited. Compilers, therefore, should offer optimizations aimed at code reduction. One such …

Guardar Citar Citado por 15 Artigos relacionados Todas as 10 versões

[Free GPT-4]
[DeepSeek]

[PDF] vporpo.me

VW-SLP: auto-vectorization with adaptive vector width

V Porpodas, RCO Rocha, LFW Góes - Proceedings of the 27th …, 2018 - dl.acm.org

Auto-vectorization techniques allow the compiler to automatically generate SIMD vector
code out of scalar code. SLP is a commonly-used algorithm for converting straight-line code …

Guardar Citar Citado por 27 Artigos relacionados Todas as 5 versões

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

Super-Node SLP: Optimized vectorization for code sequences containing operators and their inverse elements

V Porpodas, RCO Rocha, E Brevnov… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org

SLP Auto-vectorization converts straight-line code into vector code. It scans input code for
groups of instructions that can be combined into vectors and replaces them with their …

Guardar Citar Citado por 21 Artigos relacionados Todas as 8 versões

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Custom High-Performance Vector Code Generation for Data-Specific Sparse Computations

M Horro, LN Pouchet, G Rodríguez… - Proceedings of the …, 2022 - dl.acm.org

Sparse computations, such as sparse matrix-dense vector multiplication, are notoriously
hard to optimize due to their irregularity and memory-boundedness. Solutions to improve the …

Guardar Citar Citado por 8 Artigos relacionados Todas as 5 versões

[Free GPT-4]
[DeepSeek]

[PDF] supercomputing.org

Function/kernel vectorization via loop vectorizer

M Masten, E Tyurin, K Mitropoulou… - 2018 IEEE/ACM 5th …, 2018 - ieeexplore.ieee.org

Currently, there are three vectorizers in the LLVM trunk: Loop Vectorizer, SLP Vectorizer,
and Load-Store Vectorizer. There is a need for vectorizing functions/kernels: 1) Function …

Guardar Citar Citado por 8 Artigos relacionados Todas as 3 versões

[Free GPT-4]
[DeepSeek]

[PDF] acm.org Full View

Autovesk: Automatic vectorized code generation from unstructured static kernels using graph transformations

H Tayeb, L Paillat, B Bramas - ACM Transactions on Architecture and …, 2023 - dl.acm.org

Leveraging the SIMD capability of modern CPU architectures is mandatory to take full
advantage of their increased performance. To exploit this capability, binary executables …

Guardar Citar Citado por 3 Artigos relacionados Todas as 9 versões

Criar alerta

Citar

Pesquisa avançada

Guardado em A minha biblioteca

Look-ahead SLP: Auto-vectorization in the presence of commutative operations

Effective function merging in the ssa form

Function merging by sequence alignment

HyFM: Function merging for free

Vectorization-aware loop unrolling with seed forwarding

Loop rolling for code size reduction

VW-SLP: auto-vectorization with adaptive vector width

Super-Node SLP: Optimized vectorization for code sequences containing operators and their inverse elements

Custom High-Performance Vector Code Generation for Data-Specific Sparse Computations

Function/kernel vectorization via loop vectorizer

Autovesk: Automatic vectorized code generation from unstructured static kernels using graph transformations