Google Académico

M Mazumder, C Banbury, X Yao… - Advances in …, 2023 - proceedings.neurips.cc

Abstract Machine learning research has long focused on models rather than datasets, and
prominent datasets are used for common ML tasks without regard to the breadth, difficulty …

Guardar Citar Citado por 135 Artículos relacionados Las 6 versiones Versión en HTML

[Free GPT-4]

[PDF] arxiv.org

A survey of on-device machine learning: An algorithms and learning theory perspective

S Dhar, J Guo, J Liu, S Tripathi, U Kurup… - ACM Transactions on …, 2021 - dl.acm.org

The predominant paradigm for using machine learning models on a device is to train a
model in the cloud and perform inference using the trained model on the device. However …

Guardar Citar Citado por 208 Artículos relacionados Las 3 versiones

[Free GPT-4]

[PDF] mlsys.org

Priority-based parameter propagation for distributed DNN training

A Jayarajan, J Wei, G Gibson… - Proceedings of …, 2019 - proceedings.mlsys.org

Data parallel training is widely used for scaling distributed deep neural network (DNN)
training. However, the performance benefits are often limited by the communication-heavy …

Guardar Citar Citado por 195 Artículos relacionados Las 9 versiones Versión en HTML

[Free GPT-4]

[PDF] toronto.edu

Gist: Efficient data encoding for deep neural network training

A Jain, A Phanishayee, J Mars, L Tang… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org

Modern deep neural networks (DNNs) training typically relies on GPUs to train complex
hundred-layer deep networks. A significant problem facing both researchers and industry …

Guardar Citar Citado por 206 Artículos relacionados Las 8 versiones

[Free GPT-4]

[PDF] arxiv.org

Analysis of dawnbench, a time-to-accuracy machine learning performance benchmark

C Coleman, D Kang, D Narayanan, L Nardi… - ACM SIGOPS …, 2019 - dl.acm.org

Researchers have proposed hardware, software, and algorithmic optimizations to improve
the computational performance of deep learning. While some of these optimizations perform …

Guardar Citar Citado por 148 Artículos relacionados Las 12 versiones

[Free GPT-4]

[PDF] acm.org

Parameter hub: a rack-scale parameter server for distributed deep neural network training

L Luo, J Nelson, L Ceze, A Phanishayee… - Proceedings of the …, 2018 - dl.acm.org

Distributed deep neural network (DDNN) training constitutes an increasingly important
workload that frequently runs in the cloud. Larger DNN models and faster compute engines …

Guardar Citar Citado por 154 Artículos relacionados Las 6 versiones

[Free GPT-4]

[PDF] arxiv.org

An overview of the data-loader landscape: Comparative performance analysis

I Ofeidis, D Kiedanski… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

The efficiency of Deep Learning (DL) training jobs is critically dependent on dataloaders,
which facilitate the transfer of data from storage to DL-accelerated hardware during training …

Guardar Citar Citado por 13 Artículos relacionados Las 2 versiones

[Free GPT-4]

[PDF] springer.com

DLBench: a comprehensive experimental evaluation of deep learning frameworks

R Elshawi, A Wahab, A Barnawi, S Sakr - Cluster Computing, 2021 - Springer

Deep Learning (DL) has achieved remarkable progress over the last decade on various
tasks such as image recognition, speech recognition, and natural language processing. In …

Guardar Citar Citado por 66 Artículos relacionados Las 5 versiones

[Free GPT-4]

[PDF] arxiv.org

A modular benchmarking infrastructure for high-performance and reproducible deep learning

T Ben-Nun, M Besta, S Huber… - 2019 IEEE …, 2019 - ieeexplore.ieee.org

We introduce Deep500: the first customizable benchmarking infrastructure that enables fair
comparison of the plethora of deep learning frameworks, algorithms, libraries, and …

Guardar Citar Citado por 95 Artículos relacionados Las 25 versiones

[Free GPT-4]

[HTML] peerj.com

[HTML][HTML] Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics

A Ravikumar, H Sriraman, PMS Saketh… - PeerJ Computer …, 2022 - peerj.com

Background In deep learning the most significant breakthrough in the field of image
recognition, object detection language processing was done by Convolutional Neural …

Guardar Citar Citado por 35 Artículos relacionados Las 8 versiones En caché

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Tbd: Benchmarking and analyzing deep neural network training

Dataperf: Benchmarks for data-centric ai development

A survey of on-device machine learning: An algorithms and learning theory perspective

Priority-based parameter propagation for distributed DNN training

Gist: Efficient data encoding for deep neural network training

Analysis of dawnbench, a time-to-accuracy machine learning performance benchmark

Parameter hub: a rack-scale parameter server for distributed deep neural network training

An overview of the data-loader landscape: Comparative performance analysis

DLBench: a comprehensive experimental evaluation of deep learning frameworks

A modular benchmarking infrastructure for high-performance and reproducible deep learning

[HTML][HTML] Effect of neural network structure in accelerating performance and accuracy of a convolutional neural network with GPU/TPU for image analytics