Blockmaestro: Enabling programmer-transparent task-based execution in gpu systems

AA Abdolrashidi, HA Esfeden… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
As modern GPU workloads grow in size and complexity, there is an ever-increasing demand
for GPU computational power. Emerging workloads contain hundreds or thousands of GPU …

Equinox: Training (for free) on a custom inference accelerator

M Drumond, L Coulon, A Pourhabibi… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
DNN inference accelerators executing online services exhibit low average loads because of
service demand variability, leading to poor resource utilization. Unfortunately, reclaiming …

Mozart: Taming taxes and composing accelerators with shared-memory

V Suresh, B Mishra, Y **g, Z Zhu, N **… - Proceedings of the …, 2024 - dl.acm.org
Resource-constrained system-on-chips (SoCs) are increasingly heterogeneous with
specialized accelerators for various tasks. Acceleration taxes due to control and data …

Processing in storage class memory

J Nider, C Mustard, A Zoltan, A Fedorova - … on Hot Topics in Storage and …, 2020 - usenix.org
Storage and memory technologies are experiencing unprecedented transformation. Storage-
class memory (SCM) delivers near-DRAM performance in non-volatile storage media and …

ACE-HoT: A ccelerating an Extreme Amount of Symmetric C ipher E valuations for (H igh-o rder) Avalanche T ests

E Bellini, J Grados, M Rachidi, N Satpute… - … on Cryptology and …, 2023 - Springer
In this work, we tackle the problem of estimating the security of iterated symmetric ciphers in
an efficient manner, with tests that do not require a deep analysis of the internal structure of …

SLePaaS: An Embedded Platform-as-a-Service Facilitating Research on Thermal Management of Embedded Platforms

R Kumar, A Sachan, B Ghoshal - IEEE Access, 2022 - ieeexplore.ieee.org
In this paper, we present to the embedded research community an Embedded Platform as a
Service facility named SLePaaS that allows researchers remote access to experimental …

Equinox: Training (for Free) on a Custom Inference Accelerator

MP Drumond Lages De Oliveira… - Proceedings of the …, 2021 - infoscience.epfl.ch
DNN infrastructure has observed an explosion in investment due to the increasing popularity
of DNNs in online services [16, 18, 22]. Unfortunately, a significant fraction of this investment …

[書籍][B] Improving Data-Dependent Parallelism in GPUs Through Programmer-Transparent Architectural Support

AA Abdolrashidi - 2021 - search.proquest.com
As modern GPU workloads become larger and more complex, there is an ever-increasing
demand for GPU computational power. Traditionally, GPUs have lacked generalized data …

Accelerating network function virtualization

M Lubeznov - 2021 - open.library.ubc.ca
Abstract Network function virtualization (NFV)[50] is increasingly used to implement network
operations traditionally implemented in customized ASICs. NFV employs commodity …

ACE-HoT: Accelerating an Extreme Amount of Symmetric Cipher Evaluations for (High-order) Avalanche Tests

J Daemen, S El Hirch - Progress in Cryptology–LATINCRYPT 2023 - Springer
In this work, we tackle the problem of estimating the security of iterated symmetric ciphers in
an efficient manner, with tests that do not require a deep analysis of the internal structure of …