- Academic Search

CJ Wu, R Raghavendra, U Gupta… - Proceedings of …, 2022 - proceedings.mlsys.org

This paper explores the environmental impact of the super-linear growth trends for AI from a
holistic perspective, spanning Data, Algorithms, and System Hardware. We characterize the …

保存引用被引用数: 573 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Communication-efficient distributed deep learning: A comprehensive survey

Z Tang, S Shi, W Wang, B Li, X Chu - arxiv preprint arxiv:2003.06307, 2020 - arxiv.org

Distributed deep learning (DL) has become prevalent in recent years to reduce training time
by leveraging multiple computing devices (eg, GPUs/TPUs) due to larger models and …

保存引用被引用数: 158 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] thecvf.com

Lira: Learnable, imperceptible and robust backdoor attacks

K Doan, Y Lao, W Zhao, P Li - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Recently, machine learning models have demonstrated to be vulnerable to backdoor
attacks, primarily due to the lack of transparency in black-box models such as deep neural …

保存引用被引用数: 278 関連記事全 5 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Zero-infinity: Breaking the gpu memory wall for extreme scale deep learning

S Rajbhandari, O Ruwase, J Rasley, S Smith… - Proceedings of the …, 2021 - dl.acm.org

In the last three years, the largest dense deep learning models have grown over 1000x to
reach hundreds of billions of parameters, while the GPU memory has only grown by 5x (16 …

保存引用被引用数: 346 関連記事全 5 バージョン

[Free GPT-4]

[PDF] arxiv.org

RecSSD: near data processing for solid state drive based recommendation inference

M Wilkening, U Gupta, S Hsia, C Trippel… - Proceedings of the 26th …, 2021 - dl.acm.org

Neural personalized recommendation models are used across a wide variety of datacenter
applications including search, social media, and entertainment. State-of-the-art models …

保存引用被引用数: 116 関連記事全 6 バージョン

[Free GPT-4]

[PDF] arxiv.org

Understanding training efficiency of deep learning recommendation models at scale

B Acun, M Murphy, X Wang, J Nie… - … Symposium on High …, 2021 - ieeexplore.ieee.org

The use of GPUs has proliferated for machine learning workflows and is now considered
mainstream for many deep learning models. Meanwhile, when training state-of-the-art …

保存引用被引用数: 123 関連記事全 5 バージョン

[Free GPT-4]

[PDF] arxiv.org

RecShard: statistical feature-based memory optimization for industry-scale neural recommendation

G Sethi, B Acun, N Agarwal, C Kozyrakis… - Proceedings of the 27th …, 2022 - dl.acm.org

We propose RecShard, a fine-grained embedding table (EMB) partitioning and placement
technique for deep learning recommendation models (DLRMs). RecShard is designed …

保存引用被引用数: 68 関連記事全 7 バージョン

[Free GPT-4]

[PDF] arxiv.org

A comprehensive survey on trustworthy recommender systems

W Fan, X Zhao, X Chen, J Su, J Gao, L Wang… - arxiv preprint arxiv …, 2022 - arxiv.org

As one of the most successful AI-powered applications, recommender systems aim to help
people make appropriate decisions in an effective and efficient way, by providing …

保存引用被引用数: 48 関連記事全 3 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Dreamshard: Generalizable embedding table placement for recommender systems

D Zha, L Feng, Q Tan, Z Liu, KH Lai… - Advances in …, 2022 - proceedings.neurips.cc

We study embedding table placement for distributed recommender systems, which aims to
partition and place the tables on multiple hardware devices (eg, GPUs) to balance the …

保存引用被引用数: 35 関連記事全 6 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

HET: scaling out huge embedding model training via cache-enabled distributed framework

X Miao, H Zhang, Y Shi, X Nie, Z Yang, Y Tao… - arxiv preprint arxiv …, 2021 - arxiv.org

Embedding models have been an effective learning paradigm for high-dimensional data.
However, one open issue of embedding models is that their representations (latent factors) …

保存引用被引用数: 55 関連記事全 6 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

Distributed hierarchical gpu parameter server for massive scale deep learning ads systems

Sustainable ai: Environmental implications, challenges and opportunities

Communication-efficient distributed deep learning: A comprehensive survey

Lira: Learnable, imperceptible and robust backdoor attacks

Zero-infinity: Breaking the gpu memory wall for extreme scale deep learning

RecSSD: near data processing for solid state drive based recommendation inference

Understanding training efficiency of deep learning recommendation models at scale

RecShard: statistical feature-based memory optimization for industry-scale neural recommendation

A comprehensive survey on trustworthy recommender systems

Dreamshard: Generalizable embedding table placement for recommender systems

HET: scaling out huge embedding model training via cache-enabled distributed framework