- Academic Search

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Simpan Kutip Dirujuk 134 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Recent advances on neural network pruning at initialization

H Wang, C Qin, Y Bai, Y Zhang, Y Fu - arxiv preprint arxiv:2103.06460, 2021 - arxiv.org

Neural network pruning typically removes connections or neurons from a pretrained
converged model; while a new pruning paradigm, pruning at initialization (PaI), attempts to …

Simpan Kutip Dirujuk 79 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Exploring lottery ticket hypothesis in spiking neural networks

Y Kim, Y Li, H Park, Y Venkatesha, R Yin… - European Conference on …, 2022 - Springer

Abstract Spiking Neural Networks (SNNs) have recently emerged as a new generation of
low-power deep neural networks, which is suitable to be implemented on low-power …

Simpan Kutip Dirujuk 60 kali Artikel terkait 6 versi

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

[PDF][PDF] Searching lottery tickets in graph neural networks: A dual perspective

K Wang, Y Liang, P Wang, X Wang, P Gu… - The Eleventh …, 2022 - openreview.net

Graph Neural Networks (GNNs) have shown great promise in various graph learning tasks.
However, the computational overheads of fitting GNNs to large-scale graphs grow rapidly …

Simpan Kutip Dirujuk 33 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The snowflake hypothesis: Training deep GNN with one node one receptive field

K Wang, G Li, S Wang, G Zhang, K Wang, Y You… - arxiv preprint arxiv …, 2023 - arxiv.org

Despite Graph Neural Networks demonstrating considerable promise in graph
representation learning tasks, GNNs predominantly face significant issues with over-fitting …

Simpan Kutip Dirujuk 17 kali Artikel terkait 5 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Parameter-efficient masking networks

Y Bai, H Wang, X Ma, Y Zhang… - Advances in Neural …, 2022 - proceedings.neurips.cc

A deeper network structure generally handles more complicated non-linearity and performs
more competitively. Nowadays, advanced network designs often contain a large number of …

Simpan Kutip Dirujuk 13 kali Artikel terkait 6 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dimensionality reduced training by pruning and freezing parts of a deep neural network: a survey

P Wimmer, J Mehnert, AP Condurache - Artificial Intelligence Review, 2023 - Springer

State-of-the-art deep learning models have a parameter count that reaches into the billions.
Training, storing and transferring such models is energy and time consuming, thus costly. A …

Simpan Kutip Dirujuk 24 kali Artikel terkait 7 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enhanced sparsification via stimulative training

S Tang, W Lin, H Ye, P Ye, C Yu, B Li… - European Conference on …, 2024 - Springer

Sparsification-based pruning has been an important category in model compression.
Existing methods commonly set sparsity-inducing penalty terms to suppress the importance …

Simpan Kutip Dirujuk 2 kali Artikel terkait 2 versi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning

W Huang, J Liang, Z Shi, D Zhu, G Wan, H Li… - arxiv preprint arxiv …, 2024 - arxiv.org

Multimodal Large Language Model (MLLM) have demonstrated strong generalization
capabilities across diverse distributions and tasks, largely due to extensive pre-training …

Simpan Kutip Dirujuk 2 kali Artikel terkait 2 versi Versi HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Distributionally robust ensemble of lottery tickets towards calibrated sparse network training

H Sapkota, D Wang, Z Tao… - Advances in Neural …, 2024 - proceedings.neurips.cc

The recently developed sparse network training methods, such as Lottery Ticket Hypothesis
(LTH) and its variants, have shown impressive learning capacity by finding sparse sub …

Simpan Kutip Dirujuk 6 kali Artikel terkait 4 versi Versi HTML

Buat notifikasi

Kutip

Penelusuran lanjutan

Disimpan ke Koleksi saya

Dual lottery ticket hypothesis

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

Recent advances on neural network pruning at initialization

Exploring lottery ticket hypothesis in spiking neural networks

[PDF][PDF] Searching lottery tickets in graph neural networks: A dual perspective

The snowflake hypothesis: Training deep GNN with one node one receptive field

Parameter-efficient masking networks

Dimensionality reduced training by pruning and freezing parts of a deep neural network: a survey

Enhanced sparsification via stimulative training

Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning

Distributionally robust ensemble of lottery tickets towards calibrated sparse network training