Μελετητής Google

S Liu, B Guo, C Fang, Z Wang, S Luo… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org

The emerging field of artificial intelligence of things (AIoT, AI+ IoT) is driven by the
widespread use of intelligent infrastructures and the impressive success of deep learning …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 30 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] google.com

A survey on spatio-temporal big data analytics ecosystem: Resource management, processing platform, and applications

H Liang, Z Zhang, C Hu, Y Gong… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

With the rapid evolution of the Internet, Internet of Things (IoT), and geographic information
systems (GIS), spatio-temporal Big Data (STBD) is experiencing exponential growth …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 13 Σχετικά άρθρα Όλες οι 3 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training

S Zhao, F Li, X Chen, X Guan, J Jiang… - … on Parallel and …, 2021 - ieeexplore.ieee.org

The increasing computational complexity of DNNs achieved unprecedented successes in
various areas such as machine vision and natural language processing (NLP), eg, the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 39 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] usenix.org

Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelism

S Eliad, I Hakimi, A De Jagger, M Silberstein… - 2021 USENIX Annual …, 2021 - usenix.org

Fine-tuning is an increasingly common technique that leverages transfer learning to
dramatically expedite the training of huge, high-quality models. Critically, fine-tuning holds …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 40 Σχετικά άρθρα Όλες οι 4 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] purdue.edu

Tsplit: Fine-grained gpu memory management for efficient dnn training via tensor splitting

X Nie, X Miao, Z Yang, B Cui - 2022 IEEE 38th International …, 2022 - ieeexplore.ieee.org

Since Deep Neural Networks (DNNs) are deeper and larger, performing DNNs training on
existing accelerators (eg, GPUs) is challenging due to their limited device memory capacity …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 21 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

S Singh, P Singhania, A Ranjan… - … Conference for High …, 2024 - ieeexplore.ieee.org

Training and fine-tuning large language models (LLMs) with hundreds of billions to trillions
of parameters requires tens of thousands of GPUs, and a highly scalable software stack. In …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 1 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

MegTaiChi: Dynamic tensor-based memory management optimization for DNN training

Z Hu, J **ao, Z Deng, M Li, K Zhang, X Zhang… - Proceedings of the 36th …, 2022 - dl.acm.org

In real applications, it is common to train deep neural networks (DNNs) on modest clusters.
With the continuous increase of model size and batch size, the training of DNNs becomes …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 11 Σχετικά άρθρα Όλες οι 2 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

FedDCT: Federated learning of large convolutional neural networks on resource-constrained devices using divide and collaborative training

Q Nguyen, HH Pham, KS Wong… - … on Network and …, 2023 - ieeexplore.ieee.org

In Federated Learning (FL), the size of local models matters. On the one hand, it is logical to
use large-capacity neural networks in pursuit of high performance. On the other hand, deep …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 5 Σχετικά άρθρα Όλες οι 10 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An oracle for guiding large-scale model/hybrid parallel training of convolutional neural networks

AN Kahira, TT Nguyen, LB Gomez, R Takano… - Proceedings of the 30th …, 2021 - dl.acm.org

Deep Neural Network (DNN) frameworks use distributed training to enable faster time to
convergence and alleviate memory capacity limitations when training large models and/or …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 12 Σχετικά άρθρα Όλες οι 7 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

PERKS: a locality-optimized execution model for iterative memory-bound GPU applications

L Zhang, M Wahib, P Chen, J Meng, X Wang… - Proceedings of the 37th …, 2023 - dl.acm.org

Iterative memory-bound solvers commonly occur in HPC codes. Typical GPU
implementations have a loop on the host side that invokes the GPU kernel as much as …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 7 Σχετικά άρθρα Όλες οι 9 εκδοχές

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Scaling distributed deep learning workloads beyond the memory capacity with karma

Enabling resource-efficient aiot system with cross-level optimization: A survey

A survey on spatio-temporal big data analytics ecosystem: Resource management, processing platform, and applications

vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training

Fine-tuning giant neural networks on commodity hardware with automatic pipeline model parallelism

Tsplit: Fine-grained gpu memory management for efficient dnn training via tensor splitting

Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers

MegTaiChi: Dynamic tensor-based memory management optimization for DNN training

FedDCT: Federated learning of large convolutional neural networks on resource-constrained devices using divide and collaborative training

An oracle for guiding large-scale model/hybrid parallel training of convolutional neural networks

PERKS: a locality-optimized execution model for iterative memory-bound GPU applications