Implementing sparse matrix-vector multiplication on throughput-oriented processors

N Bell, M Garland - Proceedings of the conference on high performance …, 2009 - dl.acm.org
Sparse matrix-vector multiplication (SpMV) is of singular importance in sparse linear
algebra. In contrast to the uniform regularity of dense linear algebra, sparse operations …

Composable memory transactions

T Harris, S Marlow, S Peyton-Jones… - Proceedings of the tenth …, 2005 - dl.acm.org
Writing concurrent programs is notoriously difficult, and is of increasing practical importance.
A particular source of concern is that even correctly-implemented concurrency abstractions …

Programming parallel algorithms

GE Blelloch - Communications of the ACM, 1996 - dl.acm.org
parallel algorithftls. Researchers have developed efficient parallel algorithms to solve most
problems for which efficient sequential solutions are known. Although some ofthese …

Futhark: purely functional GPU-programming with nested parallelism and in-place array updates

T Henriksen, NGW Serup, M Elsman… - Proceedings of the 38th …, 2017 - dl.acm.org
Futhark is a purely functional data-parallel array language that offers a machine-neutral
programming model and an optimising compiler that generates OpenCL code for GPUs …

Libcint: An efficient general integral library for g aussian basis functions

Q Sun - Journal of computational chemistry, 2015 - Wiley Online Library
An efficient integral library Libcint was designed to automatically implement general
integrals for Gaussian‐type scalar and spinor basis functions. The library is able to evaluate …

[HTML][HTML] The role of recipient characteristics in health video communication outcomes: sco** review

DA Lungu, J Røislien, S Wiig, MT Shortt, F Ferrè… - Journal of Medical …, 2021 - jmir.org
Background The importance of effective communication during public health emergencies
has been highlighted by the World Health Organization, and it has published guidelines for …

Research directions in parallel functional programming

K Hammond, G Michaelson - 2012 - books.google.com
Programming is hard. Building a large program is like constructing a steam locomotive
through a hole the size of a postage stamp. An artefact that is the fruit of hundreds of person …

Copperhead: compiling an embedded data parallel language

B Catanzaro, M Garland, K Keutzer - … of the 16th ACM symposium on …, 2011 - dl.acm.org
Modern parallel microprocessors deliver high performance on applications that expose
substantial fine-grained data parallelism. Although data parallelism is widely available in …

[BOOK][B] Efficient parallel scan algorithms for many-core gpus

S Sengupta, MJ Harris, M Garland, JD Owens - 2011 - api.taylorfrancis.com
We have witnessed a phenomenal increase in computational resources for graphics
processors units (GPU) over the last few years. The highest performing graphics processors …

Data Parallel Haskell: a status report

MMT Chakravarty, R Leshchinskiy… - Proceedings of the …, 2007 - dl.acm.org
We describe the design and current status of our effort to implement the programming model
of nested data parallelism into the Glasgow Haskell Compiler. We extended the original …