ARQUIN: architectures for multinode superconducting quantum computers

J Ang, G Carini, Y Chen, I Chuang, M Demarco… - ACM Transactions on …, 2024 - dl.acm.org
Many proposals to scale quantum technology rely on modular or distributed designs
wherein individual quantum processors, called nodes, are linked together to form one large …

Dalorex: A data-local program execution and architecture for memory-bound applications

M Orenes-Vera, E Tureci, D Wentzlaff… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Applications with low data reuse and frequent irregular memory accesses, such as graph or
sparse linear algebra workloads, fail to scale well due to memory bottlenecks and poor core …

Muchisim: A simulation framework for design exploration of multi-chip manycore systems

M Orenes-Vera, E Tureci, M Martonosi… - … Analysis of Systems …, 2024 - ieeexplore.ieee.org
The design space exploration of scaled-out manycores for communication-intensive
applications (eg, graph analytics and sparse linear algebra) is hampered due to either lack …

AutoCC: Automatic Discovery of Covert Channels in Time-Shared Hardware

M Orenes-Vera, H Yun, N Wistoff, G Heiser… - Proceedings of the 56th …, 2023 - dl.acm.org
Covert channels enable information leakage between security domains that should be
isolated by observing execution differences in shared hardware. These channels can …

HotTiles: Accelerating SpMM with Heterogeneous Accelerator Architectures

G Gerogiannis, S Aananthakrishnan… - … Symposium on High …, 2024 - ieeexplore.ieee.org
Sparse Matrix Dense Matrix Multiplication (SpMM) is an important kernel with application
across a wide range of domains, including machine learning and linear algebra solvers. In …

Cohort: Software-oriented acceleration for heterogeneous socs

T Wei, N Turtayeva, M Orenes-Vera, O Lonkar… - Proceedings of the 28th …, 2023 - dl.acm.org
Philosophically, our approaches to acceleration focus on the extreme. We must optimise
accelerators to the maximum, leaving software to fix any hardware-software mismatches …

SMAPPIC: Scalable multi-FPGA architecture prototype platform in the cloud

G Chirkov, D Wentzlaff - Proceedings of the 28th ACM International …, 2023 - dl.acm.org
Traditionally, architecture prototypes are built on top of FPGA infrastructure, with two
associated problems. First, very large FPGAs are prohibitively expensive for most people …

Seizing the bandwidth scaling of on-package interconnect in a post-Moore's law world

G Chirkov, D Wentzlaff - … of the 37th International Conference on …, 2023 - dl.acm.org
The slowing and forecasted end of Moore's Law have forced designers to look beyond
simply adding transistors, encouraging them to employ other unused resources as a manner …

Massive data-centric parallelism in the chiplet era

M Orenes-Vera, E Tureci, D Wentzlaff… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent works have introduced task-based parallelization schemes to accelerate graph
search and sparse data-structure traversal, where some solutions scale up to thousands of …

An architecture interface and offload model for low-overhead, near-data, distributed accelerators

S Baskaran, MT Kandemir… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
The performance and energy costs of coordinating and performing data movement have led
to proposals adding compute units and/or specialized access units to the memory hierarchy …