- Academic Search

Z Tan, S Wang, Z Yang, G Chen, X Huang, M Sun… - AI Open, 2020 - Elsevier

Abstract Machine translation (MT) is an important sub-field of natural language processing
that aims to translate natural languages using computers. In recent years, end-to-end neural …

Tallenna Viittaa Viittausten määrä 178 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Are sixteen heads really better than one?

P Michel, O Levy, G Neubig - Advances in neural …, 2019 - proceedings.neurips.cc

Multi-headed attention is a driving force behind recent state-of-the-art NLP models. By
applying multiple attention mechanisms in parallel, it can express sophisticated functions …

Tallenna Viittaa Viittausten määrä 1160 Aiheeseen liittyviä artikkeleita Kaikki 8 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mask-predict: Parallel decoding of conditional masked language models

M Ghazvininejad, O Levy, Y Liu… - arxiv preprint arxiv …, 2019 - arxiv.org

Most machine translation systems generate text autoregressively from left to right. We,
instead, use a masked language modeling objective to train a model to predict any subset of …

Tallenna Viittaa Viittausten määrä 596 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Non-autoregressive machine translation with latent alignments

C Saharia, W Chan, S Saxena, M Norouzi - arxiv preprint arxiv …, 2020 - arxiv.org

This paper presents two strong methods, CTC and Imputer, for non-autoregressive machine
translation that model latent alignments with dynamic programming. We revisit CTC for …

Tallenna Viittaa Viittausten määrä 152 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Beyond BLEU: training neural machine translation with semantic similarity

J Wieting, T Berg-Kirkpatrick, K Gimpel… - arxiv preprint arxiv …, 2019 - arxiv.org

While most neural machine translation (NMT) systems are still trained using maximum
likelihood estimation, recent work has demonstrated that optimizing systems to directly …

Tallenna Viittaa Viittausten määrä 180 Aiheeseen liittyviä artikkeleita Kaikki 6 versiota HTML-versio

Unsupervised multimodal machine translation for low-resource distant language pairs

T Tayir, L Li - ACM Transactions on Asian and Low-Resource …, 2024 - dl.acm.org

Unsupervised machine translation (UMT) has recently attracted more attention from
researchers, enabling models to translate when languages lack parallel corpora. However …

Tallenna Viittaa Viittausten määrä 28 Aiheeseen liittyviä artikkeleita

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Very deep transformers for neural machine translation

X Liu, K Duh, L Liu, J Gao - arxiv preprint arxiv:2008.07772, 2020 - arxiv.org

We explore the application of very deep Transformer models for Neural Machine Translation
(NMT). Using a simple yet effective initialization technique that stabilizes training, we show …

Tallenna Viittaa Viittausten määrä 137 Aiheeseen liittyviä artikkeleita Kaikki 2 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Aligned cross entropy for non-autoregressive machine translation

M Ghazvininejad, V Karpukhin… - International …, 2020 - proceedings.mlr.press

Non-autoregressive machine translation models significantly speed up decoding by
allowing for parallel prediction of the entire target sequence. However, modeling word order …

Tallenna Viittaa Viittausten määrä 116 Aiheeseen liittyviä artikkeleita Kaikki 5 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fixed encoder self-attention patterns in transformer-based machine translation

A Raganato, Y Scherrer, J Tiedemann - arxiv preprint arxiv:2002.10260, 2020 - arxiv.org

Transformer-based models have brought a radical change to neural machine translation. A
key feature of the Transformer architecture is the so-called multi-head attention mechanism …

Tallenna Viittaa Viittausten määrä 108 Aiheeseen liittyviä artikkeleita Kaikki 10 versiota HTML-versio

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dinoiser: Diffused conditional sequence learning by manipulating noises

J Ye, Z Zheng, Y Bao, L Qian, M Wang - arxiv preprint arxiv:2302.10025, 2023 - arxiv.org

While diffusion models have achieved great success in generating continuous signals such
as images and audio, it remains elusive for diffusion models in learning discrete sequence …

Tallenna Viittaa Viittausten määrä 36 Aiheeseen liittyviä artikkeleita Kaikki 3 versiota HTML-versio

Luo ilmoitus

Viittaa

Tarkennettu haku

Tallennettu omaan kirjastoon

compare-mt: A tool for holistic comparison of language generation systems

[HTML][HTML] Neural machine translation: A review of methods, resources, and tools

Are sixteen heads really better than one?

Mask-predict: Parallel decoding of conditional masked language models

Non-autoregressive machine translation with latent alignments

Beyond BLEU: training neural machine translation with semantic similarity

Unsupervised multimodal machine translation for low-resource distant language pairs

Very deep transformers for neural machine translation

Aligned cross entropy for non-autoregressive machine translation

Fixed encoder self-attention patterns in transformer-based machine translation

Dinoiser: Diffused conditional sequence learning by manipulating noises