Edge learning using a fully integrated neuro-inspired memristor chip
Learning is highly important for edge intelligence devices to adapt to different application
scenes and owners. Current technologies for training neural networks require moving …
scenes and owners. Current technologies for training neural networks require moving …
Simba: Scaling deep-learning inference with multi-chip-module-based architecture
Package-level integration using multi-chip-modules (MCMs) is a promising approach for
building large-scale systems. Compared to a large monolithic die, an MCM combines many …
building large-scale systems. Compared to a large monolithic die, an MCM combines many …
Architecture of computing system based on chiplet
Computing systems are widely used in medical diagnosis, climate prediction, autonomous
vehicles, etc. As the key part of electronics, the performance of computing systems is crucial …
vehicles, etc. As the key part of electronics, the performance of computing systems is crucial …
Tensordimm: A practical near-memory processing architecture for embeddings and tensor operations in deep learning
Recent studies from several hyperscalars pinpoint to embedding layers as the most memory-
intensive deep learning (DL) algorithm being deployed in today's datacenters. This paper …
intensive deep learning (DL) algorithm being deployed in today's datacenters. This paper …
Non-deep networks
Latency is of utmost importance in safety-critical systems. In neural networks, lowest
theoretical latency is dependent on the depth of the network. This begs the question--is it …
theoretical latency is dependent on the depth of the network. This begs the question--is it …
Using Chiplet Encapsulation Technology to Achieve Processing-in-Memory Functions
W Tian, B Li, Z Li, H Cui, J Shi, Y Wang, J Zhao - Micromachines, 2022 - mdpi.com
With the rapid development of 5G, artificial intelligence (AI), and high-performance
computing (HPC), there is a huge increase in the data exchanged between the processor …
computing (HPC), there is a huge increase in the data exchanged between the processor …
A survey on deep learning hardware accelerators for heterogeneous hpc platforms
Recent trends in deep learning (DL) imposed hardware accelerators as the most viable
solution for several classes of high-performance computing (HPC) applications such as …
solution for several classes of high-performance computing (HPC) applications such as …
Summarizing CPU and GPU design trends with product data
Moore's Law and Dennard Scaling have guided the semiconductor industry for the past few
decades. Recently, both laws have faced validity challenges as transistor sizes approach …
decades. Recently, both laws have faced validity challenges as transistor sizes approach …
Centaur: A chiplet-based, hybrid sparse-dense accelerator for personalized recommendations
Personalized recommendations are the backbone machine learning (ML) algorithm that
powers several important application domains (eg, ads, e-commerce, etc) serviced from …
powers several important application domains (eg, ads, e-commerce, etc) serviced from …
Energy-efficient artificial intelligence of things with intelligent edge
Artificial Intelligence of Things (AIoT) is an emerging area of future Internet of Things (IoT) to
support intelligent IoT applications. In AIoT, intelligent edge computing technologies …
support intelligent IoT applications. In AIoT, intelligent edge computing technologies …