Decoupling algorithms from schedules for easy optimization of image processing pipelines

J Ragan-Kelley, A Adams, S Paris, M Levoy… - ACM Transactions on …, 2012 - dl.acm.org
Using existing programming tools, writing high-performance image processing code
requires sacrificing readability, portability, and modularity. We argue that this is a …

Voodoo-a vector algebra for portable database performance on modern hardware

H Pirk, O Moll, M Zaharia, S Madden - Proceedings of the VLDB …, 2016 - dl.acm.org
In-memory databases require careful tuning and many engineering tricks to achieve good
performance. Such database performance engineering is hard: a plethora of data and …

HIPAcc: A Domain-Specific Language and Compiler for Image Processing

R Membarth, O Reiche, F Hannig… - … on Parallel and …, 2015 - ieeexplore.ieee.org
Domain-specific languages (DSLs) provide high-level and domain-specific abstractions that
allow expressive and concise algorithm descriptions. Since the description in a DSL hides …

Terra: a multi-stage language for high-performance computing

Z DeVito, J Hegarty, A Aiken, P Hanrahan… - Proceedings of the 34th …, 2013 - dl.acm.org
High-performance computing applications, such as auto-tuners and domain-specific
languages, rely on generative programming techniques to achieve high performance and …

A generic parallel pattern interface for stream and data processing

D del Rio Astorga, MF Dolz… - Concurrency and …, 2017 - Wiley Online Library
Current parallel programming frameworks aid developers to a great extent in implementing
applications that exploit parallel hardware resources. Nevertheless, developers require …

[PDF][PDF] A survey of high-level parallel programming models

E Belikov, P Deligiannis, P Totoo, M Aljabri… - Heriot-Watt University …, 2013 - macs.hw.ac.uk
Increasingly heterogeneous and hierarchical parallel architectures are now mainstream,
however, most of the traditional programming models are lowlevel and explicit, limiting …

Compiling a high-level language for GPUs: (via language support for architectures and compilers)

C Dubach, P Cheng, R Rabbah, DF Bacon… - ACM SIGPLAN …, 2012 - dl.acm.org
Languages such as OpenCL and CUDA offer a standard interface for general-purpose
programming of GPUs. However, with these languages, programmers must explicitly …

AnyDSL: a partial evaluation framework for programming high-performance libraries

R Leißa, K Boesche, S Hack, A Pérard-Gayot… - Proceedings of the …, 2018 - dl.acm.org
This paper advocates programming high-performance code using partial evaluation. We
present a clean-slate programming system with a simple, annotation-based, online partial …

Improving performance of OpenCL on CPUs

R Karrenberg, S Hack - … : 21st International Conference, CC 2012, Held as …, 2012 - Springer
Data-parallel languages like OpenCL and CUDA are an important means to exploit the
computational power of today's computing devices. In this paper, we deal with two aspects of …

BuildIt: A type-based multi-stage programming framework for code generation in C++

A Brahmakshatriya… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
The simplest implementation of a domain-specific language is to embed it in an existing
language using operator overloading. This way, the DSL can inherit parsing, syntax and …