Blockmaestro: Enabling programmer-transparent task-based execution in gpu systems
As modern GPU workloads grow in size and complexity, there is an ever-increasing demand
for GPU computational power. Emerging workloads contain hundreds or thousands of GPU …
for GPU computational power. Emerging workloads contain hundreds or thousands of GPU …
Equinox: Training (for free) on a custom inference accelerator
DNN inference accelerators executing online services exhibit low average loads because of
service demand variability, leading to poor resource utilization. Unfortunately, reclaiming …
service demand variability, leading to poor resource utilization. Unfortunately, reclaiming …
Mozart: Taming taxes and composing accelerators with shared-memory
Resource-constrained system-on-chips (SoCs) are increasingly heterogeneous with
specialized accelerators for various tasks. Acceleration taxes due to control and data …
specialized accelerators for various tasks. Acceleration taxes due to control and data …
Processing in storage class memory
Storage and memory technologies are experiencing unprecedented transformation. Storage-
class memory (SCM) delivers near-DRAM performance in non-volatile storage media and …
class memory (SCM) delivers near-DRAM performance in non-volatile storage media and …
ACE-HoT: A ccelerating an Extreme Amount of Symmetric C ipher E valuations for (H igh-o rder) Avalanche T ests
In this work, we tackle the problem of estimating the security of iterated symmetric ciphers in
an efficient manner, with tests that do not require a deep analysis of the internal structure of …
an efficient manner, with tests that do not require a deep analysis of the internal structure of …
SLePaaS: An Embedded Platform-as-a-Service Facilitating Research on Thermal Management of Embedded Platforms
In this paper, we present to the embedded research community an Embedded Platform as a
Service facility named SLePaaS that allows researchers remote access to experimental …
Service facility named SLePaaS that allows researchers remote access to experimental …
Equinox: Training (for Free) on a Custom Inference Accelerator
MP Drumond Lages De Oliveira… - Proceedings of the …, 2021 - infoscience.epfl.ch
DNN infrastructure has observed an explosion in investment due to the increasing popularity
of DNNs in online services [16, 18, 22]. Unfortunately, a significant fraction of this investment …
of DNNs in online services [16, 18, 22]. Unfortunately, a significant fraction of this investment …
[書籍][B] Improving Data-Dependent Parallelism in GPUs Through Programmer-Transparent Architectural Support
AA Abdolrashidi - 2021 - search.proquest.com
As modern GPU workloads become larger and more complex, there is an ever-increasing
demand for GPU computational power. Traditionally, GPUs have lacked generalized data …
demand for GPU computational power. Traditionally, GPUs have lacked generalized data …
ACE-HoT: Accelerating an Extreme Amount of Symmetric Cipher Evaluations for (High-order) Avalanche Tests
J Daemen, S El Hirch - Progress in Cryptology–LATINCRYPT 2023 - Springer
In this work, we tackle the problem of estimating the security of iterated symmetric ciphers in
an efficient manner, with tests that do not require a deep analysis of the internal structure of …
an efficient manner, with tests that do not require a deep analysis of the internal structure of …