- Academic Search

H Ye, H Jun, D Chen - Proceedings of the 29th ACM International …, 2024 - dl.acm.org

Dataflow architectures are growing in popularity due to their potential to mitigate the
challenges posed by the memory wall inherent to the Von Neumann architecture. At the …

Spara Citera Citerat av 12 Relaterade artiklar Alla 6 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] sfu.ca

PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs

M Khatti, X Tian, Y Chi, L Guo, J Cong… - 2023 IEEE 31st …, 2023 - ieeexplore.ieee.org

In recent years, there has been increasing adoption of FPGAs in datacenters as hardware
accelerators, where a large population of end users are software developers. While high …

Spara Citera Citerat av 8 Relaterade artiklar Alla 5 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Callipepla: Stream centric instruction set and mixed precision for accelerating conjugate gradient solver

L Song, L Guo, S Basalama, Y Chi, RF Lucas… - Proceedings of the 2023 …, 2023 - dl.acm.org

The continued growth in the processing power of FPGAs coupled with high bandwidth
memories (HBM), makes systems like the **linx U280 credible platforms for linear solvers …

Spara Citera Citerat av 6 Relaterade artiklar Alla 5 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

SpectraFlux: Harnessing the Flow of Multi-FPGA in Mass Spectrometry Clustering

T Zhang, N Prakriya, S **e, J Cong… - Proceedings of the 61st …, 2024 - dl.acm.org

The identification and quantification of proteins through mass spectrometry (MS) are
foundational to proteomics, offering insights into biological systems and disease states …

Spara Citera Citerat av 1 Relaterade artiklar

Lightning Talk: The Next Wave of High-level Synthesis

D Chen - 2023 60th ACM/IEEE Design Automation Conference …, 2023 - ieeexplore.ieee.org

Recent works established new High-Level Synthesis (HLS) solutions translating AI models
described in PyTorch to customized AI accelerators automatically. By adopting PyTorch as …

Spara Citera Citerat av 1 Relaterade artiklar Alla 2 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

LevelST: Stream-based Accelerator for Sparse Triangular Solver

Z He, L Song, RF Lucas, J Cong - Proceedings of the 2024 ACM/SIGDA …, 2024 - dl.acm.org

Over the past decade, much progress has been made to advance the acceleration of sparse
linear operators such as SpMM and SpMV on FPGAs. Nevertheless, few works have …

Spara Citera Citerat av 2 Relaterade artiklar

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

HMT: Hierarchical Memory Transformer for Long Context Language Processing

Z He, Z Qin, N Prakriya, Y Sun, J Cong - arxiv preprint arxiv:2405.06067, 2024 - arxiv.org

Transformer-based large language models (LLM) have been widely used in language
processing applications. However, most of them restrict the context window that permits the …

Spara Citera Citerat av 4 Relaterade artiklar Alla 2 versionerna Se som HTML-version

Hihispmv: Sparse matrix vector multiplication with hierarchical row reductions on fpgas with high bandwidth memory

AR Tareen, M Meyer, C Plessl… - 2024 IEEE 32nd Annual …, 2024 - ieeexplore.ieee.org

The multiplication of a sparse matrix with a dense vector is a vital operation in linear algebra,
with applications in numerous contexts. After earlier research on FPGA acceleration of this …

Spara Citera Citerat av 1 Relaterade artiklar Alla 2 versionerna

Partitioning Large-Scale, Multi-FPGA Applications for the Data Center

M Mazraeli, Y Gao, P Chow - 2023 33rd International …, 2023 - ieeexplore.ieee.org

With the deployment of FPGAs in a data center, there is the opportunity to build large multi-
FPGA applications. In this paper, we design a partitioner to address the problem of efficiently …

Spara Citera Citerat av 7 Relaterade artiklar Alla 2 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] github.io

[PDF][PDF] Latency Insensitivity Testing for Dataflow HLS Designs

J Cheng, L Wang, Z Jiang, Y Bao… - The 33rd ACM/SIGDA …, 2024 - jianyicheng.github.io

Dataflow high-level synthesis (HLS) tools automatically map a high-level software program
to a dataflow hardware design. When testing the design, the HLS tool takes a testing …

Spara Citera Relaterade artiklar Alla 2 versionerna Se som HTML-version

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

HIDA: A Hierarchical Dataflow Compiler for High-Level Synthesis

PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs

Callipepla: Stream centric instruction set and mixed precision for accelerating conjugate gradient solver

SpectraFlux: Harnessing the Flow of Multi-FPGA in Mass Spectrometry Clustering

Lightning Talk: The Next Wave of High-level Synthesis

LevelST: Stream-based Accelerator for Sparse Triangular Solver

HMT: Hierarchical Memory Transformer for Long Context Language Processing

Hihispmv: Sparse matrix vector multiplication with hierarchical row reductions on fpgas with high bandwidth memory

Partitioning Large-Scale, Multi-FPGA Applications for the Data Center

[PDF][PDF] Latency Insensitivity Testing for Dataflow HLS Designs