„Google“ mokslinčius

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Išsaugoti Cituoti Cituoja 142 Susiję straipsniai Visos 7 versijos

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recent advances on neural network pruning at initialization

H Wang, C Qin, Y Bai, Y Zhang, Y Fu - arxiv preprint arxiv:2103.06460, 2021 - arxiv.org

Neural network pruning typically removes connections or neurons from a pretrained
converged model; while a new pruning paradigm, pruning at initialization (PaI), attempts to …

Išsaugoti Cituoti Cituoja 81 Susiję straipsniai Visos 5 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Reproducible scaling laws for contrastive language-image learning

M Cherti, R Beaumont, R Wightman… - Proceedings of the …, 2023 - openaccess.thecvf.com

Scaling up neural networks has led to remarkable performance across a wide range of
tasks. Moreover, performance often follows reliable scaling laws as a function of training set …

Išsaugoti Cituoti Cituoja 743 Susiję straipsniai Visos 7 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Model sparsity can simplify machine unlearning

J Jia, J Liu, P Ram, Y Yao, G Liu, Y Liu… - Advances in …, 2023 - proceedings.neurips.cc

In response to recent data regulation requirements, machine unlearning (MU) has emerged
as a critical process to remove the influence of specific examples from a given model …

Išsaugoti Cituoti Cituoja 89 Susiję straipsniai Visos 8 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Pre-trained image processing transformer

H Chen, Y Wang, T Guo, C Xu… - Proceedings of the …, 2021 - openaccess.thecvf.com

As the computing power of modern hardware is increasing strongly, pre-trained deep
learning models (eg, BERT, GPT-3) learned on large-scale datasets have shown their …

Išsaugoti Cituoti Cituoja 2156 Susiję straipsniai Visos 10 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Chasing sparsity in vision transformers: An end-to-end exploration

T Chen, Y Cheng, Z Gan, L Yuan… - Advances in Neural …, 2021 - proceedings.neurips.cc

Vision transformers (ViTs) have recently received explosive popularity, but their enormous
model sizes and training costs remain daunting. Conventional post-training pruning often …

Išsaugoti Cituoti Cituoja 227 Susiję straipsniai Visos 8 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

A unified lottery ticket hypothesis for graph neural networks

T Chen, Y Sui, X Chen, A Zhang… - … conference on machine …, 2021 - proceedings.mlr.press

With graphs rapidly growing in size and deeper graph neural networks (GNNs) emerging,
the training and inference of GNNs become increasingly expensive. Existing network weight …

Išsaugoti Cituoti Cituoja 210 Susiję straipsniai Visos 5 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Sparse training via boosting pruning plasticity with neuroregeneration

S Liu, T Chen, X Chen, Z Atashgahi… - Advances in …, 2021 - proceedings.neurips.cc

Works on lottery ticket hypothesis (LTH) and single-shot network pruning (SNIP) have raised
a lot of attention currently on post-training pruning (iterative magnitude pruning), and before …

Išsaugoti Cituoti Cituoja 137 Susiję straipsniai Visos 15 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Federated dynamic sparse training: Computing less, communicating less, yet learning better

S Bibikar, H Vikalo, Z Wang, X Chen - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Federated learning (FL) enables distribution of machine learning workloads from the cloud
to resource-limited edge devices. Unfortunately, current deep networks remain not only too …

Išsaugoti Cituoti Cituoja 110 Susiję straipsniai Visos 7 versijos HTML kopija

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Advancing model pruning via bi-level optimization

Y Zhang, Y Yao, P Ram, P Zhao… - Advances in …, 2022 - proceedings.neurips.cc

The deployment constraints in practical applications necessitate the pruning of large-scale
deep learning models, ie, promoting their weight sparsity. As illustrated by the Lottery Ticket …

Išsaugoti Cituoti Cituoja 68 Susiję straipsniai Visos 8 versijos HTML kopija

Kurti įspėjimą

Cituoti

Išplėstinė paieška

Išsaugota skiltyje „Mano biblioteka“

The lottery tickets hypothesis for supervised and self-supervised pre-training in computer...

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

Recent advances on neural network pruning at initialization

Reproducible scaling laws for contrastive language-image learning

Model sparsity can simplify machine unlearning

Pre-trained image processing transformer

Chasing sparsity in vision transformers: An end-to-end exploration

A unified lottery ticket hypothesis for graph neural networks

Sparse training via boosting pruning plasticity with neuroregeneration

Federated dynamic sparse training: Computing less, communicating less, yet learning better

Advancing model pruning via bi-level optimization