GPGPU performance and power estimation using machine learning
G Wu, JL Greathouse, A Lyashevsky… - 2015 IEEE 21st …, 2015 - ieeexplore.ieee.org
Graphics Processing Units (GPUs) have numerous configuration and design options,
including core frequency, number of parallel compute units (CUs), and available memory …
including core frequency, number of parallel compute units (CUs), and available memory …
A survey on techniques for cooperative CPU-GPU computing
Abstract Graphical Processing Unit provides massive parallelism due to the presence of
hundreds of cores. Usage of GPUs for general purpose computation (GPGPU) has resulted …
hundreds of cores. Usage of GPUs for general purpose computation (GPGPU) has resulted …
Understanding co-running behaviors on integrated CPU/GPU architectures
Architecture designers tend to integrate both CPUs and GPUs on the same chip to deliver
energy-efficient designs. It is still an open problem to effectively leverage the advantages of …
energy-efficient designs. It is still an open problem to effectively leverage the advantages of …
Harmonia: Balancing compute and memory power in high-performance gpus
In this paper, we address the problem of efficiently managing the relative power demands of
a high-performance GPU and its memory subsystem. We develop a management approach …
a high-performance GPU and its memory subsystem. We develop a management approach …
A hybrid B+-tree as solution for in-memory indexing on CPU-GPU heterogeneous computing platforms
An in-memory indexing tree is a critical component of many databases. Modern many-core
processors, such as GPUs, are offering tremendous amounts of computing power making …
processors, such as GPUs, are offering tremendous amounts of computing power making …
Exploring query processing on cpu-gpu integrated edge device
Huge amounts of data have been generated on edge devices every day, which requires
efficient data analytics and management. However, due to the limited computing capacity of …
efficient data analytics and management. However, due to the limited computing capacity of …
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications
Heterogeneous systems are ubiquitous in the field of High-Performance Computing (HPC).
Graphics processing units (GPUs) are widely used as accelerators for their enormous …
Graphics processing units (GPUs) are widely used as accelerators for their enormous …
Harmonia: a high throughput B+ tree for GPUs
Z Yan, Y Lin, L Peng, W Zhang - Proceedings of the 24th symposium on …, 2019 - dl.acm.org
B+ tree is one of the most important data structures and has been widely used in different
fields. With the increase of concurrent queries and data-scale in storage, designing an …
fields. With the increase of concurrent queries and data-scale in storage, designing an …
Hybrid CPU/GPU/APU accelerated query, insert, update and erase operations in hash tables with string keys
Modern computer systems can use different types of hardware acceleration to achieve
massive performance improvements. Some accelerators like FPGA and dedicated GPU …
massive performance improvements. Some accelerators like FPGA and dedicated GPU …
Parallel construction of Random Forest on GPU
There is tremendous growth of data generated from different industries, ie, health,
agriculture, engineering, etc. Consequently, there is demand for more processing power …
agriculture, engineering, etc. Consequently, there is demand for more processing power …