Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Oblivm: A programming framework for secure computation
We design and develop ObliVM, a programming framework for secure computation. ObliVM
offers a domain specific language designed for compilation of programs into efficient …
offers a domain specific language designed for compilation of programs into efficient …
Mgpusim: Enabling multi-gpu performance modeling and optimization
The rapidly growing popularity and scale of data-parallel workloads demand a
corresponding increase in raw computational power of Graphics Processing Units (GPUs) …
corresponding increase in raw computational power of Graphics Processing Units (GPUs) …
Beyond the socket: NUMA-aware GPUs
GPUs achieve high throughput and power efficiency by employing many small single
instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance …
instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance …
The locality descriptor: A holistic cross-layer abstraction to express data locality in GPUs
Exploiting data locality in GPUs is critical to making more efficient use of the existing caches
and the NUMA-based memory hierarchy expected in future GPUs. While modern GPU …
and the NUMA-based memory hierarchy expected in future GPUs. While modern GPU …
Griffin: Hardware-software support for efficient page migration in multi-gpu systems
As transistor scaling becomes increasingly more difficult to achieve, scaling the core count
on a single GPU chip has also become extremely challenging. As the volume of data to …
on a single GPU chip has also become extremely challenging. As the volume of data to …
EAIS: Energy-aware adaptive scheduling for CNN inference on high-performance GPUs
C Yao, W Liu, W Tang, S Hu - Future Generation Computer Systems, 2022 - Elsevier
Recently, a large number of convolutional neural network (CNN) inference services have
emerged on high-performance Graphic Processing Units (GPUs). However, GPUs are high …
emerged on high-performance Graphic Processing Units (GPUs). However, GPUs are high …
vSoC: Efficient Virtual System-on-Chip on Heterogeneous Hardware
Emerging mobile apps such as UHD video and AR/VR access diverse high-throughput
hardware devices, eg, video codecs, cameras, and image processors. However, today's …
hardware devices, eg, video codecs, cameras, and image processors. However, today's …
DRAGON: breaking GPU memory capacity limits with direct NVM access
Heterogeneous computing with accelerators is growing in importance in high performance
computing (HPC). Recently, application datasets have expanded beyond the memory …
computing (HPC). Recently, application datasets have expanded beyond the memory …
Coda: Enabling co-location of computation and data for multiple gpu systems
To exploit parallelism and scalability of multiple GPUs in a system, it is critical to place
compute and data together. However, two key techniques that have been used to hide …
compute and data together. However, two key techniques that have been used to hide …
Exploiting adaptive data compression to improve performance and energy-efficiency of compute workloads in multi-GPU systems
Graphics Processing Unit (GPU) performance has relied heavily on our ability to scale of
number of transistors on chip, in order to satisfy the ever-increasing demands for more …
number of transistors on chip, in order to satisfy the ever-increasing demands for more …