Študovňa Google

C White, M Safari, R Sukthanker, B Ru, T Elsken… - arxiv preprint arxiv …, 2023 - arxiv.org

In the past decade, advances in deep learning have resulted in breakthroughs in a variety of
areas, including computer vision, natural language understanding, speech recognition, and …

Uložiť Citovať Citované 117-krát Súvisiace články Všetky verzie 2 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Neural architecture search for transformers: A survey

KT Chitty-Venkata, M Emani, V Vishwanath… - IEEE …, 2022 - ieeexplore.ieee.org

Transformer-based Deep Neural Network architectures have gained tremendous interest
due to their effectiveness in various applications across Natural Language Processing (NLP) …

Uložiť Citovať Citované 80-krát Súvisiace články Všetky verzie 6

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Evolutionary neural architecture search for transformer in knowledge tracing

S Yang, X Yu, Y Tian, X Yan, H Ma… - Advances in Neural …, 2023 - proceedings.neurips.cc

Abstract Knowledge tracing (KT) aims to trace students' knowledge states by predicting
whether students answer correctly on exercises. Despite the excellent performance of …

Uložiť Citovať Citované 27-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Zico: Zero-shot nas via inverse coefficient of variation on gradients

G Li, Y Yang, K Bhardwaj, R Marculescu - arxiv preprint arxiv:2301.11300, 2023 - arxiv.org

Neural Architecture Search (NAS) is widely used to automatically obtain the neural network
with the best performance among a large number of candidate architectures. To reduce the …

Uložiť Citovať Citované 68-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Elasticvit: Conflict-aware supernet training for deploying fast vision transformer on diverse mobile devices

C Tang, LL Zhang, H Jiang, J Xu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Neural Architecture Search (NAS) has shown promising performance in the
automatic design of vision transformers (ViT) exceeding 1G FLOPs. However, designing …

Uložiť Citovať Citované 23-krát Súvisiace články Všetky verzie 7 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Boosting Order-Preserving and Transferability for Neural Architecture Search: a Joint Architecture Refined Search and Fine-tuning Approach

B Zhang, X Wang, X Qin, J Yan - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Supernet is a core component in many recent Neural Architecture Search (NAS) methods. It
not only helps embody the search space but also provides a (relative) estimation of the final …

Uložiť Citovať Citované 2-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

EMT-NAS: Transferring architectural knowledge between tasks from different datasets

P Liao, Y **, W Du - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com

The success of multi-task learning (MTL) can largely be attributed to the shared
representation of related tasks, allowing the models to better generalise. In deep learning …

Uložiť Citovať Citované 9-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Pa&da: Jointly sampling path and data for consistent nas

S Lu, Y Hu, L Yang, Z Sun, J Mei… - Proceedings of the …, 2023 - openaccess.thecvf.com

Based on the weight-sharing mechanism, one-shot NAS methods train a supernet and then
inherit the pre-trained weights to evaluate sub-models, largely reducing the search cost …

Uložiť Citovať Citované 10-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Femtodet: An object detection baseline for energy versus performance tradeoffs

P Tu, X **e, G Ai, Y Li, Y Huang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Efficient detectors for edge devices are often optimized for parameters or speed count
metrics, which remain in weak correlation with the energy of detectors. However, some …

Uložiť Citovať Citované 12-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

SKDBERT: compressing BERT via stochastic knowledge distillation

Z Ding, G Jiang, S Zhang, L Guo, W Lin - Proceedings of the AAAI …, 2023 - ojs.aaai.org

In this paper, we propose Stochastic Knowledge Distillation (SKD) to obtain compact BERT-
style language model dubbed SKDBERT. In each distillation iteration, SKD samples a …

Uložiť Citovať Citované 16-krát Súvisiace články Všetky verzie 5 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Analyzing and mitigating interference in neural architecture search

Neural architecture search: Insights from 1000 papers

Neural architecture search for transformers: A survey

Evolutionary neural architecture search for transformer in knowledge tracing

Zico: Zero-shot nas via inverse coefficient of variation on gradients

Elasticvit: Conflict-aware supernet training for deploying fast vision transformer on diverse mobile devices

Boosting Order-Preserving and Transferability for Neural Architecture Search: a Joint Architecture Refined Search and Fine-tuning Approach

EMT-NAS: Transferring architectural knowledge between tasks from different datasets

Pa&da: Jointly sampling path and data for consistent nas

Femtodet: An object detection baseline for energy versus performance tradeoffs

SKDBERT: compressing BERT via stochastic knowledge distillation