[HTML][HTML] GPU-based embedded intelligence architectures and applications

LM Ang, KP Seng - Electronics, 2021 - mdpi.com
This paper present contributions to the state-of-the art for graphics processing unit (GPU-
based) embedded intelligence (EI) research for architectures and applications. This paper …

Oversubscribing gpu unified virtual memory: Implications and suggestions

C Shao, J Guo, P Wang, J Wang, C Li… - … of the 2022 ACM/SPEC on …, 2022 - dl.acm.org
Recent GPU architectures support unified virtual memory (UVM), which offers great
opportunities to solve larger problems by memory oversubscription. Although some studies …

Hpc ontology: Towards a unified ontology for managing training datasets and ai models for high-performance computing

C Liao, PH Lin, G Verma… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Machine learning (ML) techniques have been widely studied to address various challenges
of productively and efficiently running large-scale scientific applications on heterogeneous …

Hpcfair: Enabling fair ai for hpc applications

G Verma, M Emani, C Liao, PH Lin… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Artificial Intelligence (AI) is being adopted in different domains at an unprecedented scale. A
significant interest in the scientific community also involves leveraging machine learning …

Xunified: a framework for guiding optimal use of GPU Unified Memory

H Xu, PH Lin, M Emani, L Hu, C Liao - IEEE Access, 2022 - ieeexplore.ieee.org
Unified Memory is a single memory address space that is accessible by any processor
(GPUs or CPUs) in a system. NVIDIA's unified memory creates a pool of managed memory …

Co-concurrency mechanism for multi-GPUs in distributed heterogeneous environments

X Zhang, Z Tang, X Zhang, K Li - IEEE Transactions on Parallel …, 2022 - ieeexplore.ieee.org
The high concurrency and high throughput characteristics of graphics processing units
(GPUs) have made researchers continue to use it to optimize distributed parallel computing …

An incremental iterative acceleration architecture in distributed heterogeneous environments with GPUs for deep learning

X Zhang, Z Tang, L Du, L Yang - IEEE Transactions on Parallel …, 2021 - ieeexplore.ieee.org
The parallel computing capabilities of GPUs have a significant impact on computationally
intensive iterative tasks. Offloading part or all of the deep learning tasks from the CPU to the …

Making Machine Learning Datasets and Models FAIR for HPC: A Methodology and Case Study

PH Lin, C Liao, W Chen… - 2022 Fourth …, 2022 - ieeexplore.ieee.org
The FAIR Guiding Principles aim to improve the findability, accessibility, interoperability, and
reusability of digital content by making them both human and machine actionable. However …

Survey of shared register file design for unified shader array in GPUs

T Ze, Z Jun, R **anglong, F Feihu… - 2022 IEEE 9th …, 2022 - ieeexplore.ieee.org
Unified Shader Array (USA) is the computing core of the Unified Shader Array Graphic
Processing Unit (GPU), and the shader cores is the basic shader Unit of the Unified Shader …

[PDF][PDF] Heterogeneous computing with graphical processing unit: improvised back-propagation algorithm for water level prediction

N Singh, SP Panda - International Journal of Electrical and Computer …, 2022 - academia.edu
A multitude of research has been rising for predicting the behavior of different real-world
problems through machine learning models. An erratic nature occurs due to the augmented …