Академия Google

F Stahlberg - Journal of Artificial Intelligence Research, 2020 - jair.org

The field of machine translation (MT), the automatic translation of written text from one
natural language into another, has experienced a major paradigm shift in recent years …

Сохранить Цитировать Цитируется: 463 Похожие статьи Все версии статьи (7) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of neural network compression

JO Neill - arxiv preprint arxiv:2006.03669, 2020 - arxiv.org

Overparameterized networks trained to convergence have shown impressive performance
in domains such as computer vision and natural language processing. Pushing state of the …

Сохранить Цитировать Цитируется: 149 Похожие статьи Все версии статьи (3) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Knowledge distillation: A survey

J Gou, B Yu, SJ Maybank, D Tao - International Journal of Computer Vision, 2021 - Springer

In recent years, deep neural networks have been successful in both industry and academia,
especially for computer vision tasks. The great success of deep learning is mainly due to its …

Сохранить Цитировать Цитируется: 3322 Похожие статьи Все версии статьи (12)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards understanding ensemble, knowledge distillation and self-distillation in deep learning

Z Allen-Zhu, Y Li - arxiv preprint arxiv:2012.09816, 2020 - arxiv.org

We formally study how ensemble of deep learning models can improve test accuracy, and
how the superior performance of ensemble can be distilled into a single model using …

Сохранить Цитировать Цитируется: 459 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multilingual neural machine translation with knowledge distillation

X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu - arxiv preprint arxiv …, 2019 - arxiv.org

Multilingual machine translation, which translates multiple languages with a single model,
has attracted much attention due to its efficiency of offline training and online serving …

Сохранить Цитировать Цитируется: 272 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Alp-kd: Attention-based layer projection for knowledge distillation

P Passban, Y Wu, M Rezagholizadeh… - Proceedings of the AAAI …, 2021 - ojs.aaai.org

Abstract Knowledge distillation is considered as a training and compression strategy in
which two neural networks, namely a teacher and a student, are coupled together during …

Сохранить Цитировать Цитируется: 129 Похожие статьи Все версии статьи (4) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Compression of deep learning models for text: A survey

M Gupta, P Agrawal - ACM Transactions on Knowledge Discovery from …, 2022 - dl.acm.org

In recent years, the fields of natural language processing (NLP) and information retrieval (IR)
have made tremendous progress thanks to deep learning models like Recurrent Neural …

Сохранить Цитировать Цитируется: 128 Похожие статьи Все версии статьи (5)

[Free GPT-4]
[DeepSeek]

[PDF] jair.org Full View

Domain adaptation and multi-domain adaptation for neural machine translation: A survey

D Saunders - Journal of Artificial Intelligence Research, 2022 - jair.org

The development of deep learning techniques has allowed Neural Machine Translation
(NMT) models to become extremely powerful, given sufficient training data and training time …

Сохранить Цитировать Цитируется: 107 Похожие статьи Все версии статьи (10) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

End-to-end speech translation with knowledge distillation

Y Liu, H **ong, Z He, J Zhang, H Wu, H Wang… - arxiv preprint arxiv …, 2019 - arxiv.org

End-to-end speech translation (ST), which directly translates from source language speech
into target language text, has attracted intensive attentions in recent years. Compared to …

Сохранить Цитировать Цитируется: 169 Похожие статьи Все версии статьи (9) В виде HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Data diversification: A simple strategy for neural machine translation

XP Nguyen, S Joty, K Wu… - Advances in Neural …, 2020 - proceedings.neurips.cc

Abstract We introduce Data Diversification: a simple but effective strategy to boost neural
machine translation (NMT) performance. It diversifies the training data by using the …

Сохранить Цитировать Цитируется: 97 Похожие статьи Все версии статьи (7) В виде HTML

Создать оповещение

Цитировать

Расширенный поиск

Сохранено в вашей библиотеке

Ensemble distillation for neural machine translation

Neural machine translation: A review

An overview of neural network compression

Knowledge distillation: A survey

Towards understanding ensemble, knowledge distillation and self-distillation in deep learning

Multilingual neural machine translation with knowledge distillation

Alp-kd: Attention-based layer projection for knowledge distillation

Compression of deep learning models for text: A survey

Domain adaptation and multi-domain adaptation for neural machine translation: A survey

End-to-end speech translation with knowledge distillation

Data diversification: A simple strategy for neural machine translation