- Academic Search

P Hijma, S Heldens, A Sclocco… - ACM Computing …, 2023 - dl.acm.org

In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …

Save Cite Cited by 67 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Deep neural network approximation for custom hardware: Where we've been, where we're going

E Wang, JJ Davis, R Zhao, HC Ng, X Niu… - ACM Computing …, 2019 - dl.acm.org

Deep neural networks have proven to be particularly effective in visual and audio
recognition tasks. Existing models tend to be computationally expensive and memory …

Save Cite Cited by 247 Related articles All 5 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

EIE: Efficient inference engine on compressed deep neural network

S Han, X Liu, H Mao, J Pu, A Pedram… - ACM SIGARCH …, 2016 - dl.acm.org

State-of-the-art deep neural networks (DNNs) have hundreds of millions of connections and
are both computationally and memory intensive, making them difficult to deploy on …

Save Cite Cited by 3345 Related articles All 29 versions Free GPT-4

[Free GPT-4]

[PDF] psu.edu

Multicore bundle adjustment

C Wu, S Agarwal, B Curless, SM Seitz - CVPR 2011, 2011 - ieeexplore.ieee.org

We present the design and implementation of new inexact Newton type Bundle Adjustment
algorithms that exploit hardware parallelism for efficiently solving large scale 3D scene …

Save Cite Cited by 1137 Related articles All 15 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing

T Geng, A Li, R Shi, C Wu, T Wang, Y Li… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org

Deep learning systems have been successfully applied to Euclidean data such as images,
video, and audio. In many applications, however, information and their relationships are …

Save Cite Cited by 305 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] acm.org

Memory coherence in shared virtual memory systems

K Li, P Hudak - ACM Transactions on Computer Systems (TOCS), 1989 - dl.acm.org

The memory coherence problem in designing and implementing a shared virtual memory on
loosely coupled multiprocessors is studied in depth. Two classes of algorithms, centralized …

Save Cite Cited by 2256 Related articles All 83 versions Free GPT-4

[Free GPT-4]

[PDF] unc.edu

Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU

VW Lee, C Kim, J Chhugani, M Deisher, D Kim… - Proceedings of the 37th …, 2010 - dl.acm.org

Recent advances in computing have led to an explosion in the amount of data being
generated. Processing the ever-growing data in a timely manner has made throughput …

Save Cite Cited by 1228 Related articles All 25 versions Free GPT-4

SparTen: A sparse tensor accelerator for convolutional neural networks

A Gondimalla, N Chesnut, M Thottethodi… - Proceedings of the …, 2019 - dl.acm.org

Convolutional neural networks (CNNs) are emerging as powerful tools for image
processing. Recent machine learning work has reduced CNNs' compute and data volumes …

Save Cite Cited by 296 Related articles

[Free GPT-4]

[PDF] iitkgp.ac.in

[PDF][PDF] The Chinese Wall Security Policy.

DFC Brewer, MJ Nash - S&P, 1989 - facweb.iitkgp.ac.in

Everyone who has seen the movie Wall Street will have seen a commercial security policy in
action. The recent work of Clark and Wilson and the WIPCIS initiative (the Workshop on …

Save Cite Cited by 1527 Related articles All 24 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] cam.ac.uk

Scalable GPU graph traversal

D Merrill, M Garland, A Grimshaw - ACM Sigplan Notices, 2012 - dl.acm.org

Breadth-first search (BFS) is a core primitive for graph traversal and a basis for many higher-
level graph analysis algorithms. It is also representative of a class of parallel computations …

Save Cite Cited by 729 Related articles All 17 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Implementing sparse matrix-vector multiplication on throughput-oriented processors

Optimization techniques for GPU programming

Deep neural network approximation for custom hardware: Where we've been, where we're going

EIE: Efficient inference engine on compressed deep neural network

Multicore bundle adjustment

AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing

Memory coherence in shared virtual memory systems

Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU

SparTen: A sparse tensor accelerator for convolutional neural networks

[PDF][PDF] The Chinese Wall Security Policy.

Scalable GPU graph traversal