Deep bilateral learning for real-time image enhancement
Performance is a critical challenge in mobile image processing. Given a reference imaging
pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements …
pipeline, or even human-adjusted pairs of images, we seek to reproduce the enhancements …
Taichi: a language for high-performance computation on spatially sparse data structures
3D visual computing data are often spatially sparse. To exploit such sparsity, people have
developed hierarchical sparse data structures, such as multi-level sparse voxel grids …
developed hierarchical sparse data structures, such as multi-level sparse voxel grids …
Dr. jit: A just-in-time compiler for differentiable rendering
DR. JIT is a new just-in-time compiler for physically based rendering and its derivative. DR.
JIT expedites research on these topics in two ways: first, it traces high-level simulation code …
JIT expedites research on these topics in two ways: first, it traces high-level simulation code …
Parallel programming models for heterogeneous many-cores: a comprehensive survey
Heterogeneous many-cores are now an integral part of modern computing systems ranging
from embedding systems to supercomputers. While heterogeneous many-core design offers …
from embedding systems to supercomputers. While heterogeneous many-core design offers …
Fast image processing with fully-convolutional networks
We present an approach to accelerating a wide variety of image processing operators. Our
approach uses a fully-convolutional network that is trained on input-output pairs that …
approach uses a fully-convolutional network that is trained on input-output pairs that …
Tiramisu: A polyhedral compiler for expressing fast and portable code
R Baghdadi, J Ray, MB Romdhane… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
This paper introduces Tiramisu, a polyhedral framework designed to generate high
performance code for multiple platforms including multicores, GPUs, and distributed …
performance code for multiple platforms including multicores, GPUs, and distributed …
Burst photography for high dynamic range and low-light imaging on mobile cameras
Cell phone cameras have small apertures, which limits the number of photons they can
gather, leading to noisy images in low light. They also have small sensor pixels, which limits …
gather, leading to noisy images in low light. They also have small sensor pixels, which limits …
Learning to optimize halide with tree search and random programs
We present a new algorithm to automatically schedule Halide programs for high-
performance image processing and deep learning. We significantly improve upon the …
performance image processing and deep learning. We significantly improve upon the …
Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
Image processing pipelines combine the challenges of stencil computations and stream
programs. They are composed of large graphs of different stencil stages, as well as complex …
programs. They are composed of large graphs of different stencil stages, as well as complex …
Opentuner: An extensible framework for program autotuning
Program autotuning has been shown to achieve better or more portable performance in a
number of domains. However, autotuners themselves are rarely portable between projects …
number of domains. However, autotuners themselves are rarely portable between projects …