Deep learning in electron microscopy

JM Ede - Machine Learning: Science and Technology, 2021 - iopscience.iop.org
Deep learning is transforming most areas of science and technology, including electron
microscopy. This review paper offers a practical perspective aimed at developers with …

Medical image processing on the GPU–Past, present and future

A Eklund, P Dufort, D Forsberg, SM LaConte - Medical image analysis, 2013 - Elsevier
Graphics processing units (GPUs) are used today in a wide range of applications, mainly
because they can dramatically accelerate parallel computing, are affordable and energy …

A survey of general‐purpose computation on graphics hardware

JD Owens, D Luebke, N Govindaraju… - Computer graphics …, 2007 - Wiley Online Library
The rapid increase in the performance of graphics hardware, coupled with recent
improvements in its programmability, have made graphics hardware a compelling platform …

Brook for GPUs: stream computing on graphics hardware

I Buck, T Foley, D Horn, J Sugerman… - ACM transactions on …, 2004 - dl.acm.org
In this paper, we present Brook for GPUs, a system for general-purpose computation on
programmable graphics hardware. Brook extends C to include simple data-parallel …

Understanding the efficiency of GPU algorithms for matrix-matrix multiplication

K Fatahalian, J Sugerman, P Hanrahan - Proceedings of the ACM …, 2004 - dl.acm.org
Utilizing graphics hardware for general purpose numerical computations has become a
topic of considerable interest. The implementation of streaming algorithms, typified by highly …

High performance discrete Fourier transforms on graphics processors

NK Govindaraju, B Lloyd, Y Dotsenko… - SC'08: Proceedings …, 2008 - ieeexplore.ieee.org
We present novel algorithms for computing discrete Fourier transforms with high
performance on GPUs. We present hierarchical, mixed radix FFT algorithms for both power …

Uberflow: a gpu-based particle engine

P Kipfer, M Segal, R Westermann - Proceedings of the ACM SIGGRAPH …, 2004 - dl.acm.org
We present a system for real-time animation and rendering of large particle sets using GPU
computation and memory objects in OpenGL. Memory objects can be used both as …

Mint: realizing CUDA performance in 3D stencil methods with annotated C

D Unat, X Cai, SB Baden - Proceedings of the international conference …, 2011 - dl.acm.org
We present Mint, a programming model that enables the non-expert to enjoy the
performance benefits of hand coded CUDA without becoming entangled in the details. Mint …

ePlace: Electrostatics-based placement using fast fourier transform and Nesterov's method

J Lu, P Chen, CC Chang, L Sha, DJH Huang… - ACM Transactions on …, 2015 - dl.acm.org
We develop a flat, analytic, and nonlinear placement algorithm, ePlace, which is more
effective, generalized, simpler, and faster than previous works. Based on the analogy …

GPUTreeShap: massively parallel exact calculation of SHAP scores for tree ensembles

R Mitchell, E Frank, G Holmes - PeerJ Computer Science, 2022 - peerj.com
Abstract SHapley Additive exPlanation (SHAP) values (Lundberg & Lee, 2017) provide a
game theoretic interpretation of the predictions of machine learning models based on …