- Academic Search

{HetPipe}: Enabling large {DNN} training on (whimpy) heterogeneous {GPU} clusters through...

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Performance enhancement of artificial intelligence: A survey‏

M Krichen, MS Abdalzaher - Journal of Network and Computer Applications, 2024‏ - Elsevier‏

The advent of machine learning (ML) and Artificial intelligence (AI) has brought about a
significant transformation across multiple industries, as it has facilitated the automation of …‏

שמור צטט צוטט על ידי 6 מאמרים בנושא זה כל 2 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey of resource-efficient llm and multimodal foundation models‏

M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu… - arxiv preprint arxiv …, 2024‏ - arxiv.org‏

Large foundation models, including large language models (LLMs), vision transformers
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …‏

שמור צטט צוטט על ידי 102 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient large-scale language model training on gpu clusters using megatron-lm‏

D Narayanan, M Shoeybi, J Casper… - Proceedings of the …, 2021‏ - dl.acm.org‏

Large language models have led to state-of-the-art accuracies across several tasks.
However, training these models efficiently is challenging because: a) GPU memory capacity …‏

שמור צטט צוטט על ידי 749 מאמרים בנושא זה כל 15 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PanGu-: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation‏

W Zeng, X Ren, T Su, H Wang, Y Liao, Z Wang… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

Large-scale Pretrained Language Models (PLMs) have become the new paradigm for
Natural Language Processing (NLP). PLMs with hundreds of billions parameters such as …‏

שמור צטט צוטט על ידי 252 מאמרים בנושא זה כל 2 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] caidongqi.com

Resource-efficient algorithms and systems of foundation models: A survey‏

M Xu, D Cai, W Yin, S Wang, X **, X Liu - ACM Computing Surveys, 2025‏ - dl.acm.org‏

Large foundation models, including large language models, vision transformers, diffusion,
and large language model based multimodal models, are revolutionizing the entire machine …‏

שמור צטט צוטט על ידי 2 מאמרים בנושא זה כל 3 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Decentralized training of foundation models in heterogeneous environments‏

B Yuan, Y He, J Davis, T Zhang… - Advances in …, 2022‏ - proceedings.neurips.cc‏

Training foundation models, such as GPT-3 and PaLM, can be extremely expensive, often
involving tens of thousands of GPUs running continuously for months. These models are …‏

שמור צטט צוטט על ידי 93 מאמרים בנושא זה כל 10 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] sjtu.edu.cn

GNNLab: a factored system for sample-based GNN training over GPUs‏

J Yang, D Tang, X Song, L Wang, Q Yin… - Proceedings of the …, 2022‏ - dl.acm.org‏

We propose GNNLab, a sample-based GNN training system in a single machine multi-GPU
setup. GNNLab adopts a factored design for multiple GPUs, where each GPU is dedicated to …‏

שמור צטט צוטט על ידי 90 מאמרים בנושא זה כל 4 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

P3: Distributed deep graph learning at scale‏

S Gandhi, AP Iyer - 15th {USENIX} Symposium on Operating Systems …, 2021‏ - usenix.org‏

Graph Neural Networks (GNNs) have gained significant attention in the recent past, and
become one of the fastest growing subareas in deep learning. While several new GNN …‏

שמור צטט צוטט על ידי 179 מאמרים בנושא זה כל 9 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Oobleck: Resilient distributed training of large models using pipeline templates‏

I Jang, Z Yang, Z Zhang, X **… - Proceedings of the 29th …, 2023‏ - dl.acm.org‏

Oobleck enables resilient distributed training of large DNN models with guaranteed fault
tolerance. It takes a planning-execution co-design approach, where it first generates a set of …‏

שמור צטט צוטט על ידי 35 מאמרים בנושא זה כל 7 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] yibozhu.com

Lyra: Elastic scheduling for deep learning clusters‏

J Li, H Xu, Y Zhu, Z Liu, C Guo, C Wang - Proceedings of the Eighteenth …, 2023‏ - dl.acm.org‏

Organizations often build separate training and inference clusters for deep learning, and use
separate schedulers to manage them. This leads to problems for both: inference clusters …‏

שמור צטט צוטט על ידי 33 מאמרים בנושא זה כל 6 הגרסאות

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

{HetPipe}: Enabling large {DNN} training on (whimpy) heterogeneous {GPU} clusters through...

Performance enhancement of artificial intelligence: A survey‏

A survey of resource-efficient llm and multimodal foundation models‏

Efficient large-scale language model training on gpu clusters using megatron-lm‏

PanGu-: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation‏

Resource-efficient algorithms and systems of foundation models: A survey‏

Decentralized training of foundation models in heterogeneous environments‏

GNNLab: a factored system for sample-based GNN training over GPUs‏

P3: Distributed deep graph learning at scale‏

Oobleck: Resilient distributed training of large models using pipeline templates‏

Lyra: Elastic scheduling for deep learning clusters‏