Array programming with NumPy

CR Harris, KJ Millman, SJ Van Der Walt, R Gommers… - Nature, 2020 - nature.com
Array programming provides a powerful, compact and expressive syntax for accessing,
manipulating and operating on data in vectors, matrices and higher-dimensional arrays …

The Chronus quantum software package

DB Williams‐Young, A Petrone, S Sun… - Wiley …, 2020 - Wiley Online Library
Abstract The Chronus Quantum (ChronusQ) software package is an open source (under the
GNU General Public License v2) software infrastructure which targets the solution of …

FastSpar: rapid and scalable correlation estimation for compositional data

SC Watts, SC Ritchie, M Inouye, KE Holt - Bioinformatics, 2019 - academic.oup.com
A common goal of microbiome studies is the elucidation of community composition and
member interactions using counts of taxonomic units extracted from sequence data …

AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs

Q Wang, X Zhang, Y Zhang, Q Yi - Proceedings of the international …, 2013 - dl.acm.org
Basic Liner algebra subprograms (BLAS) is a fundamental library in scientific computing. In
this paper, we present a template-based optimization framework, AUGEM, which can …

Tabla: A unified template-based framework for accelerating statistical machine learning

D Mahajan, J Park, E Amaro, H Sharma… - … Symposium on High …, 2016 - ieeexplore.ieee.org
A growing number of commercial and enterprise systems increasingly rely on compute-
intensive Machine Learning (ML) algorithms. While the demand for these compute-intensive …

Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations

K Kanellopoulos, N Vijaykumar, C Giannoula… - Proceedings of the …, 2019 - dl.acm.org
Important workloads, such as machine learning and graph analytics applications, heavily
involve sparse linear algebra operations. These operations use sparse matrix compression …

A hybrid gene selection approach to create the S1500+ targeted gene sets for use in high-throughput transcriptomics

D Mav, RR Shah, BE Howard, SS Auerbach… - PloS one, 2018 - journals.plos.org
Changes in gene expression can help reveal the mechanisms of disease processes and the
mode of action for toxicities and adverse effects on cellular responses induced by exposures …

Caffeinated FPGAs: FPGA framework for convolutional neural networks

R DiCecco, G Lacey, J Vasiljevic… - … Conference on Field …, 2016 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) have gained significant traction in the field of
machine learning, particularly due to their high accuracy in visual recognition. Recent works …

Anatomy of high-performance many-threaded matrix multiplication

TM Smith, R Van De Geijn… - 2014 IEEE 28th …, 2014 - ieeexplore.ieee.org
BLIS is a new framework for rapid instantiation of the BLAS. We describe how BLIS extends
the" GotoBLAS approach" to implementing matrix multiplication (GEMM). While GEMM was …