- Academic Search

JK Eshraghian, X Wang, WD Lu - IEEE Nanotechnology …, 2022 - ieeexplore.ieee.org

Memristive arrays are a natural fit to implement spiking neural network (SNN) acceleration.
Representing information as digital spiking events can improve noise margins and tolerance …

Simpan Kutip Dirujuk 72 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] royalsocietypublishing.org Full View

Stochastic rounding: implementation, error analysis and applications

M Croci, M Fasi, NJ Higham… - Royal Society Open …, 2022 - royalsocietypublishing.org

Stochastic rounding (SR) randomly maps a real number x to one of the two nearest values in
a finite precision number system. The probability of choosing either of these two numbers is …

Simpan Kutip Dirujuk 79 kali Artikel terkait 18 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PLAM: A posit logarithm-approximate multiplier

R Murillo, AA Del Barrio, G Botella… - … on Emerging Topics …, 2021 - ieeexplore.ieee.org

The Posit™ Number System was introduced in 2017 as a replacement for floating-point
numbers. Since then, the community has explored its application in several areas, such as …

Simpan Kutip Dirujuk 42 kali Artikel terkait 6 versi Pencarian Perpustakaan

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

A block minifloat representation for training deep neural networks

S Fox, S Rasoulinezhad, J Faraone… - … Conference on Learning …, 2020 - openreview.net

Training Deep Neural Networks (DNN) with high efficiency can be difficult to achieve with
native floating-point representations and commercially available hardware. Specialized …

Simpan Kutip Dirujuk 51 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences

G Shen, J Zhao, Q Chen, J Leng, C Li… - Proceedings of the 59th …, 2022 - dl.acm.org

The attention mechanisms of transformers effectively extract pertinent information from the
input sequence. However, the quadratic complexity of self-attention wrt the sequence length …

Simpan Kutip Dirujuk 27 kali Artikel terkait 3 versi

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Low-precision stochastic gradient Langevin dynamics

R Zhang, AG Wilson, C De Sa - International Conference on …, 2022 - proceedings.mlr.press

While low-precision optimization has been widely used to accelerate deep learning, low-
precision sampling remains largely unexplored. As a consequence, sampling is simply …

Simpan Kutip Dirujuk 17 kali Artikel terkait 8 versi Versi HTML

PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures

C Giannoula, P Yang, I Fernandez, J Yang… - Proceedings of the …, 2024 - dl.acm.org

Graph Neural Networks (GNNs) are emerging models to analyze graph-structure data. GNN
execution involves both compute-intensive and memory-intensive kernels. The latter kernels …

Simpan Kutip Dirujuk 1 kali Artikel terkait 2 versi

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

C Giannoula, P Yang, I Fernandez Vega… - arxiv e …, 2024 - ui.adsabs.harvard.edu

Abstract Graph Neural Networks (GNNs) are emerging ML models to analyze graph-
structure data. Graph Neural Network (GNN) execution involves both compute-intensive and …

Simpan Kutip Dirujuk 7 kali Artikel terkait

Optimization of block-scaled integer GeMMs for efficient DNN deployment on scalable in-order vector processors

NS Murthy, F Catthoor, M Verhelst - Journal of Systems Architecture, 2024 - Elsevier

A continuing rise in DNN usage in distributed and embedded use cases has demanded
more efficient hardware execution in the field. Low-precision GeMMs with optimized data …

Simpan Kutip Dirujuk 2 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning

H Chen, Y Ni, A Zakeri, Z Zou, S Yun, F Wen… - arxiv preprint arxiv …, 2024 - arxiv.org

In recent times, a plethora of hardware accelerators have been put forth for graph learning
applications such as vertex classification and graph classification. However, previous works …

Simpan Kutip Dirujuk 5 kali Artikel terkait 2 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Qpytorch: A low-precision arithmetic simulation framework

Memristor-based binarized spiking neural networks: Challenges and applications

Stochastic rounding: implementation, error analysis and applications

PLAM: A posit logarithm-approximate multiplier

A block minifloat representation for training deep neural networks

SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences

Low-precision stochastic gradient Langevin dynamics

PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

Optimization of block-scaled integer GeMMs for efficient DNN deployment on scalable in-order vector processors

HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning