Automatic heterogeneous quantization of deep neural networks for low-latency inference on the edge for particle detectors

CN Coelho, A Kuusela, S Li, H Zhuang… - Nature Machine …, 2021 - nature.com
Although the quest for more accurate solutions is pushing deep learning research towards
larger and more complex algorithms, edge devices demand efficient inference and therefore …

Enabling fast differentially private sgd via just-in-time compilation and vectorization

P Subramani, N Vadivelu… - Advances in Neural …, 2021 - proceedings.neurips.cc
A common pain point in differentially private machine learning is the significant runtime
overhead incurred when executing Differentially Private Stochastic Gradient Descent …

Beyond human-level accuracy: Computational challenges in deep learning

J Hestness, N Ardalani, G Diamos - … of the 24th symposium on principles …, 2019 - dl.acm.org
Deep learning (DL) research yields accuracy and product improvements from both model
architecture changes and scale: larger data sets and models, and more computation. For …

Recognition of building group patterns using graph convolutional network

R Zhao, T Ai, W Yu, Y He, Y Shen - Cartography and Geographic …, 2020 - Taylor & Francis
Recognition of building group patterns is of great significance for understanding and
modeling the urban space. However, many current methods cannot fully utilize spatial …

Is deeper always better? Evaluating deep learning models for yield forecasting with small data

F Sabo, M Meroni, F Waldner, F Rembold - Environmental Monitoring and …, 2023 - Springer
Predicting crop yields, and especially anomalously low yields, is of special importance for
food insecure countries. In this study, we investigate a flexible deep learning approach to …

Classification of broad absorption line quasars with a convolutional neural network

Z Guo, P Martini - The Astrophysical Journal, 2019 - iopscience.iop.org
Quasars that exhibit blueshifted, broad absorption lines (BAL QSOs) are an important probe
of black hole feedback on galaxy evolution. Yet the presence of BALs is also a complication …

Artificial neural network-based sequential approximate optimization of metal sheet architecture and forming process

SS Han, HK Kim - Journal of Computational Design and …, 2024 - academic.oup.com
This paper introduces a sequential approximate optimization method that combines the finite
element method (FEM), dynamic differential evolution (DDE), and artificial neural network …

The Next 700 ML-Enabled Compiler Optimizations

S VenkataKeerthy, S Jain, U Kalvakuntla… - Proceedings of the 33rd …, 2024 - dl.acm.org
There is a growing interest in enhancing compiler optimizations with ML models, yet
interactions between compilers and ML frameworks remain challenging. Some optimizations …

APPy: Annotated Parallelism for Python on GPUs

T Zhou, J Shirako, V Sarkar - Proceedings of the 33rd ACM SIGPLAN …, 2024 - dl.acm.org
GPUs are increasingly being used used to speed up Python applications in the scientific
computing and machine learning domains. Currently, the two common approaches to …

imageseg: An R package for deep learning‐based image segmentation

J Niedballa, J Axtner, TF Döbert, A Tilker… - Methods in Ecology …, 2022 - Wiley Online Library
Convolutional neural networks (CNNs) and deep learning are powerful and robust tools for
ecological applications, and are particularly suited for image data. Image segmentation (the …