Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Memristor-based binarized spiking neural networks: Challenges and applications
Memristive arrays are a natural fit to implement spiking neural network (SNN) acceleration.
Representing information as digital spiking events can improve noise margins and tolerance …
Representing information as digital spiking events can improve noise margins and tolerance …
Stochastic rounding: implementation, error analysis and applications
Stochastic rounding (SR) randomly maps a real number x to one of the two nearest values in
a finite precision number system. The probability of choosing either of these two numbers is …
a finite precision number system. The probability of choosing either of these two numbers is …
PLAM: A posit logarithm-approximate multiplier
The Posit™ Number System was introduced in 2017 as a replacement for floating-point
numbers. Since then, the community has explored its application in several areas, such as …
numbers. Since then, the community has explored its application in several areas, such as …
A block minifloat representation for training deep neural networks
Training Deep Neural Networks (DNN) with high efficiency can be difficult to achieve with
native floating-point representations and commercially available hardware. Specialized …
native floating-point representations and commercially available hardware. Specialized …
SALO: an efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
The attention mechanisms of transformers effectively extract pertinent information from the
input sequence. However, the quadratic complexity of self-attention wrt the sequence length …
input sequence. However, the quadratic complexity of self-attention wrt the sequence length …
Low-precision stochastic gradient Langevin dynamics
While low-precision optimization has been widely used to accelerate deep learning, low-
precision sampling remains largely unexplored. As a consequence, sampling is simply …
precision sampling remains largely unexplored. As a consequence, sampling is simply …
PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures
C Giannoula, P Yang, I Fernandez, J Yang… - Proceedings of the …, 2024 - dl.acm.org
Graph Neural Networks (GNNs) are emerging models to analyze graph-structure data. GNN
execution involves both compute-intensive and memory-intensive kernels. The latter kernels …
execution involves both compute-intensive and memory-intensive kernels. The latter kernels …
Accelerating Graph Neural Networks on Real Processing-In-Memory Systems
C Giannoula, P Yang, I Fernandez Vega… - arxiv e …, 2024 - ui.adsabs.harvard.edu
Abstract Graph Neural Networks (GNNs) are emerging ML models to analyze graph-
structure data. Graph Neural Network (GNN) execution involves both compute-intensive and …
structure data. Graph Neural Network (GNN) execution involves both compute-intensive and …
Optimization of block-scaled integer GeMMs for efficient DNN deployment on scalable in-order vector processors
NS Murthy, F Catthoor, M Verhelst - Journal of Systems Architecture, 2024 - Elsevier
A continuing rise in DNN usage in distributed and embedded use cases has demanded
more efficient hardware execution in the field. Low-precision GeMMs with optimized data …
more efficient hardware execution in the field. Low-precision GeMMs with optimized data …
HDReason: Algorithm-Hardware Codesign for Hyperdimensional Knowledge Graph Reasoning
In recent times, a plethora of hardware accelerators have been put forth for graph learning
applications such as vertex classification and graph classification. However, previous works …
applications such as vertex classification and graph classification. However, previous works …