Orchestrating the development lifecycle of machine learning-based IoT applications: A taxonomy and survey
Machine Learning (ML) and Internet of Things (IoT) are complementary advances: ML
techniques unlock the potential of IoT with intelligence, and IoT applications increasingly …
techniques unlock the potential of IoT with intelligence, and IoT applications increasingly …
A survey on scheduling techniques in computing and network convergence
S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …
computing power. This trend results in the urgent need for higher-level computing resource …
[HTML][HTML] Pre-trained models: Past, present and future
Large-scale pre-trained models (PTMs) such as BERT and GPT have recently achieved
great success and become a milestone in the field of artificial intelligence (AI). Owing to …
great success and become a milestone in the field of artificial intelligence (AI). Owing to …
Zero-infinity: Breaking the gpu memory wall for extreme scale deep learning
In the last three years, the largest dense deep learning models have grown over 1000x to
reach hundreds of billions of parameters, while the GPU memory has only grown by 5x (16 …
reach hundreds of billions of parameters, while the GPU memory has only grown by 5x (16 …
Zero: Memory optimizations toward training trillion parameter models
Large deep learning models offer significant accuracy gains, but training billions to trillions
of parameters is challenging. Existing solutions such as data and model parallelisms exhibit …
of parameters is challenging. Existing solutions such as data and model parallelisms exhibit …
PanGu-: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation
Large-scale Pretrained Language Models (PLMs) have become the new paradigm for
Natural Language Processing (NLP). PLMs with hundreds of billions parameters such as …
Natural Language Processing (NLP). PLMs with hundreds of billions parameters such as …
Wireless network intelligence at the edge
Fueled by the availability of more data and computing power, recent breakthroughs in cloud-
based machine learning (ML) have transformed every aspect of our lives from face …
based machine learning (ML) have transformed every aspect of our lives from face …
DAPPLE: A pipelined data parallel approach for training large models
It is a challenging task to train large DNN models on sophisticated GPU platforms with
diversified interconnect capabilities. Recently, pipelined training has been proposed as an …
diversified interconnect capabilities. Recently, pipelined training has been proposed as an …
A generic communication scheduler for distributed DNN training acceleration
We present ByteScheduler, a generic communication scheduler for distributed DNN training
acceleration. ByteScheduler is based on our principled analysis that partitioning and …
acceleration. ByteScheduler is based on our principled analysis that partitioning and …
P3: Distributed deep graph learning at scale
Graph Neural Networks (GNNs) have gained significant attention in the recent past, and
become one of the fastest growing subareas in deep learning. While several new GNN …
become one of the fastest growing subareas in deep learning. While several new GNN …