Distributed artificial intelligence empowered by end-edge-cloud computing: A survey

S Duan, D Wang, J Ren, F Lyu, Y Zhang… - … Surveys & Tutorials, 2022 - ieeexplore.ieee.org
As the computing paradigm shifts from cloud computing to end-edge-cloud computing, it
also supports artificial intelligence evolving from a centralized manner to a distributed one …

Performance enhancement of artificial intelligence: A survey

M Krichen, MS Abdalzaher - Journal of Network and Computer Applications, 2024 - Elsevier
The advent of machine learning (ML) and Artificial intelligence (AI) has brought about a
significant transformation across multiple industries, as it has facilitated the automation of …

DAPPLE: A pipelined data parallel approach for training large models

S Fan, Y Rong, C Meng, Z Cao, S Wang… - Proceedings of the 26th …, 2021 - dl.acm.org
It is a challenging task to train large DNN models on sophisticated GPU platforms with
diversified interconnect capabilities. Recently, pipelined training has been proposed as an …

Contrastive machine learning reveals the structure of neuroanatomical variation within autism

A Aglinskas, JK Hartshorne, S Anzellotti - Science, 2022 - science.org
Autism spectrum disorder (ASD) is highly heterogeneous. Identifying systematic individual
differences in neuroanatomy could inform diagnosis and personalized interventions. The …

Optimus-cc: Efficient large nlp model training with 3d parallelism aware communication compression

J Song, J Yim, J Jung, H Jang, HJ Kim, Y Kim… - Proceedings of the 28th …, 2023 - dl.acm.org
In training of modern large natural language processing (NLP) models, it has become a
common practice to split models using 3D parallelism to multiple GPUs. Such technique …

Energy-and area-efficient CMOS synapse and neuron for spiking neural networks with STDP learning

B Joo, JW Han, BS Kong - … on Circuits and Systems I: Regular …, 2022 - ieeexplore.ieee.org
This paper proposes CMOS synapse and neuron for use in spiking neural networks to
perform cognitive functions in a bio-inspired manner. The proposed synapse can trace the …

Multi-GPU parallel acceleration scheme for meshfree peridynamic simulations

X Wang, S Li, W Dong, B An, H Huang, Q He… - Theoretical and Applied …, 2024 - Elsevier
Peridynamics (PD) using integral equations has unique advantages in modeling structural
damage evolution, but the expensive computational and memory costs associated with the …

A hybrid fuzzy convolutional neural network based mechanism for photovoltaic cell defect detection with electroluminescence images

C Ge, Z Liu, L Fang, H Ling, A Zhang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
In the intelligent manufacturing process of solar photovoltaic (PV) cells, the automatic defect
detection system using the Industrial Internet of Things (IIoT) smart cameras and sensors …

XPipe: Efficient pipeline model parallelism for multi-GPU DNN training

L Guan, W Yin, D Li, X Lu - arxiv preprint arxiv:1911.04610, 2019 - arxiv.org
We propose XPipe, an efficient asynchronous pipeline model parallelism approach for multi-
GPU DNN training. XPipe is designed to use multiple GPUs to concurrently and …