Architectural implications of function-as-a-service computing
Serverless computing is a rapidly growing cloud application model, popularized by
Amazon's Lambda platform. Serverless cloud services provide fine-grained provisioning of …
Amazon's Lambda platform. Serverless cloud services provide fine-grained provisioning of …
Llmcompass: Enabling efficient hardware design for large language model inference
The past year has witnessed the increasing popularity of Large Language Models (LLMs).
Their unprecedented scale and associated high hardware cost have impeded their broader …
Their unprecedented scale and associated high hardware cost have impeded their broader …
Optimus prime: Accelerating data transformation in servers
Modern online services are shifting away from monolithic applications to loosely-coupled
microservices because of their improved scalability, reliability, programmability and …
microservices because of their improved scalability, reliability, programmability and …
BYOC: a" bring your own core" framework for heterogeneous-ISA research
Heterogeneous architectures and heterogeneous-ISA designs are growing areas of
computer architecture and system software research. Unfortunately, this line of research is …
computer architecture and system software research. Unfortunately, this line of research is …
Dalorex: A data-local program execution and architecture for memory-bound applications
Applications with low data reuse and frequent irregular memory accesses, such as graph or
sparse linear algebra workloads, fail to scale well due to memory bottlenecks and poor core …
sparse linear algebra workloads, fail to scale well due to memory bottlenecks and poor core …
A deep reinforcement learning framework for architectural exploration: A routerless NoC case study
Machine learning applied to architecture design presents a promising opportunity with broad
applications. Recent deep reinforcement learning (DRL) techniques, in particular, enable …
applications. Recent deep reinforcement learning (DRL) techniques, in particular, enable …
[PDF][PDF] OpenPiton+ Ariane: The first open-source, SMP Linux-booting RISC-V system scaling from one to many cores
ABSTRACT This paper introduces OpenPiton+ Ariane, a permissively-licensed open-source
framework designed to enable scalable architecture research prototypes. With the recent …
framework designed to enable scalable architecture research prototypes. With the recent …
Simmani: Runtime power modeling for arbitrary RTL with automatic signal selection
This paper presents a novel runtime power modeling methodology which automatically
identifies key signals for power dissipation of any RTL design. The toggle-pattern matrix is …
identifies key signals for power dissipation of any RTL design. The toggle-pattern matrix is …
[HTML][HTML] Energy efficiency of inference algorithms for clinical laboratory data sets: Green artificial intelligence study
Background The use of artificial intelligence (AI) in the medical domain has attracted
considerable research interest. Inference applications in the medical domain require energy …
considerable research interest. Inference applications in the medical domain require energy …
FlooNoC: A 645-Gb/s/link 0.15-pJ/B/hop Open-Source NoC With Wide Physical Links and End-to-End AXI4 Parallel Multistream Support
The new generation of domain-specific AI accelerators is characterized by rapidly increasing
demands for bulk data transfers, as opposed to small, latency-critical cache line transfers …
demands for bulk data transfers, as opposed to small, latency-critical cache line transfers …