Google Acadèmic

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Desa Cita Citat per 135 Articles relacionats Totes les 7 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Continual learning for recurrent neural networks: an empirical evaluation

A Cossu, A Carta, V Lomonaco, D Bacciu - Neural Networks, 2021 - Elsevier

Learning continuously during all model lifetime is fundamental to deploy machine learning
solutions robust to drifts in the data distribution. Advances in Continual Learning (CL) with …

Desa Cita Citat per 123 Articles relacionats Totes les 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

Desa Cita Citat per 882 Articles relacionats Totes les 27 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned

E Voita, D Talbot, F Moiseev, R Sennrich… - arxiv preprint arxiv …, 2019 - arxiv.org

Multi-head self-attention is a key component of the Transformer, a state-of-the-art
architecture for neural machine translation. In this work we evaluate the contribution made …

Desa Cita Citat per 1369 Articles relacionats Totes les 13 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] mit.edu

Mind the GAP: A balanced corpus of gendered ambiguous pronouns

K Webster, M Recasens, V Axelrod… - Transactions of the …, 2018 - direct.mit.edu

Coreference resolution is an important task for natural language understanding, and the
resolution of ambiguous pronouns a longstanding challenge. Nonetheless, existing corpora …

Desa Cita Citat per 350 Articles relacionats Totes les 9 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english

F Guzmán, PJ Chen, M Ott, J Pino, G Lample… - arxiv preprint arxiv …, 2019 - arxiv.org

For machine translation, a vast majority of language pairs in the world are considered low-
resource because they have little parallel data available. Besides the technical challenges …

Desa Cita Citat per 321 Articles relacionats Totes les 6 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Cross-lingual transfer learning for multilingual task oriented dialog

S Schuster, S Gupta, R Shah, M Lewis - arxiv preprint arxiv:1810.13327, 2018 - arxiv.org

One of the first steps in the utterance interpretation pipeline of many task-oriented
conversational AI systems is to identify user intents and the corresponding slots. Since data …

Desa Cita Citat per 327 Articles relacionats Totes les 8 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Context-aware neural machine translation learns anaphora resolution

E Voita, P Serdyukov, R Sennrich, I Titov - arxiv preprint arxiv:1805.10163, 2018 - arxiv.org

Standard machine translation systems process sentences in isolation and hence ignore
extra-sentential information, even though extended context can both prevent mistakes in …

Desa Cita Citat per 345 Articles relacionats Totes les 13 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

When a good translation is wrong in context: Context-aware machine translation improves on deixis, ellipsis, and lexical cohesion

E Voita, R Sennrich, I Titov - arxiv preprint arxiv:1905.05979, 2019 - arxiv.org

Though machine translation errors caused by the lack of context beyond one sentence have
long been acknowledged, the development of context-aware NMT systems is hampered by …

Desa Cita Citat per 255 Articles relacionats Totes les 10 versions Free GPT-4 DeepSeek Versió HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Self-training improves pre-training for natural language understanding

J Du, E Grave, B Gunel, V Chaudhary, O Celebi… - arxiv preprint arxiv …, 2020 - arxiv.org

Unsupervised pre-training has led to much recent progress in natural language
understanding. In this paper, we study self-training as another way to leverage unlabeled …

Desa Cita Citat per 177 Articles relacionats Totes les 5 versions Free GPT-4 DeepSeek Versió HTML

Crea una alerta

Cita

Cerca avançada

S'ha desat a La meva biblioteca

OpenSubtitles2018: Statistical rescoring of sentence alignments in large, noisy parallel corpora

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

Continual learning for recurrent neural networks: an empirical evaluation

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Analyzing multi-head self-attention: Specialized heads do the heavy lifting, the rest can be pruned

Mind the GAP: A balanced corpus of gendered ambiguous pronouns

The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english

Cross-lingual transfer learning for multilingual task oriented dialog

Context-aware neural machine translation learns anaphora resolution

When a good translation is wrong in context: Context-aware machine translation improves on deixis, ellipsis, and lexical cohesion

Self-training improves pre-training for natural language understanding