TCDM Burst Access: Breaking the Bandwidth Barrier in Shared-L1 RVV Clusters Beyond 1000 FPUs

D Shen, Y Zhang, M Bertuletti, L Benini - arxiv preprint arxiv:2501.14370, 2025 - arxiv.org
As computing demand and memory footprint of deep learning applications accelerate,
clusters of cores sharing local (L1) multi-banked memory are widely used as key building …