[HTML][HTML] A survey on hardware accelerators: Taxonomy, trends, challenges, and perspectives
In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …
Gemmini: Enabling systematic deep-learning architecture evaluation via full-stack integration
DNN accelerators are often developed and evaluated in isolation without considering the
cross-stack, system-level effects in real-world environments. This makes it difficult to …
cross-stack, system-level effects in real-world environments. This makes it difficult to …
Processors, methods, and systems with a configurable spatial accelerator
KE Fleming, KD Glossop, SC Steely Jr, J Tang… - US Patent …, 2020 - Google Patents
2017-08-09 Assigned to INTEL CORPORATION reassignment INTEL CORPORATION
ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors …
ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors …
Mosaic: a GPU memory manager with application-transparent support for multiple page sizes
Contemporary discrete GPUs support rich memory management features such as virtual
memory and demand paging. These features simplify GPU programming by providing a …
memory and demand paging. These features simplify GPU programming by providing a …
Telekine: Secure computing with cloud {GPUs}
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …
enable in domains such as machine learning and computer vision. However, offloading …
Thunderclap: Exploring vulnerabilities in operating system IOMMU protection via DMA from untrustworthy peripherals
AT Markettos, C Rothwell, BF Gutstein, A Pearce… - 2019 - repository.cam.ac.uk
Thunderclap: Exploring Vulnerabilities in Operating System IOMMU Protection via DMA
from Untrustworthy Peripherals Page 1 Thunderclap: Exploring Vulnerabilities in Operating …
from Untrustworthy Peripherals Page 1 Thunderclap: Exploring Vulnerabilities in Operating …
Batch-aware unified memory management in GPUs for irregular workloads
While unified virtual memory and demand paging in modern GPUs provide convenient
abstractions to programmers for working with large-scale applications, they come at a …
abstractions to programmers for working with large-scale applications, they come at a …
G10: Enabling an efficient unified gpu memory and storage architecture with smart tensor migrations
To break the GPU memory wall for scaling deep learning workloads, a variety of architecture
and system techniques have been proposed recently. Their typical approaches include …
and system techniques have been proposed recently. Their typical approaches include …
A framework for memory oversubscription management in graphics processing units
Modern discrete GPUs support unified memory and demand paging. Automatic
management of data movement between CPU memory and GPU memory dramatically …
management of data movement between CPU memory and GPU memory dramatically …
Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency
Graphics Processing Units (GPUs) exploit large amounts of threadlevel parallelism to
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …