- Academic Search

C Cao, F Zhou, Y Dai, J Wang, K Zhang - ACM Computing Surveys, 2024‏ - dl.acm.org‏

Data augmentation (DA) is indispensable in modern machine learning and deep neural
networks. The basic idea of DA is to construct new training data to improve the model's …‏

שמור צטט צוטט על ידי 26 מאמרים בנושא זה כל 5 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lauragpt: Listen, attend, understand, and regenerate audio with gpt‏

Z Du, J Wang, Q Chen, Y Chu, Z Gao, Z Li, K Hu… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Generative Pre-trained Transformer (GPT) models have achieved remarkable performance
on various natural language processing tasks, and have shown great potential as …‏

שמור צטט צוטט על ידי 69 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

On compositional generalization of transformer-based neural machine translation‏

Y Yin, L Fu, Y Li, Y Zhang - Information Fusion, 2024‏ - Elsevier‏

Neural networks have been shown to have deficiencies in the ability of compositional
generalization while existing work has generally targeted semantic parsing tasks. In this …‏

שמור צטט צוטט על ידי 5 מאמרים בנושא זה כל 2 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

On the complementarity between pre-training and random-initialization for resource-rich machine translation‏

C Zan, L Ding, L Shen, Y Cao, W Liu, D Tao - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

Pre-Training (PT) of text representations has been successfully applied to low-resource
Neural Machine Translation (NMT). However, it usually fails to achieve notable gains …‏

שמור צטט צוטט על ידי 30 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Causal document-grounded dialogue pre-training‏

Y Zhao, B Yu, H Yu, B Li, J Li, C Wang, F Huang… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

The goal of document-grounded dialogue (DocGD) is to generate a response by grounding
the evidence in a supporting document in accordance with the dialogue context. This …‏

שמור צטט צוטט על ידי 7 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

EMMA-X: an EM-like multilingual pre-training algorithm for cross-lingual representation learning‏

P Guo, X Wei, Y Hu, B Yang, D Liu… - Advances in Neural …, 2023‏ - proceedings.neurips.cc‏

Expressing universal semantics common to all languages is helpful to understand the
meanings of complex and culture-specific sentences. The research theme underlying this …‏

שמור צטט צוטט על ידי 2 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lae-st-moe: Boosted language-aware encoder using speech translation auxiliary task for e2e code-switching asr‏

G Ma, W Wang, Y Li, Y Yang, B Du… - 2023 IEEE Automatic …, 2023‏ - ieeexplore.ieee.org‏

Recently, to mitigate the confusion between different languages in code-switching (CS)
automatic speech recognition (ASR), the conditionally factorized models, such as the …‏

שמור צטט צוטט על ידי 5 מאמרים בנושא זה כל 3 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models‏

Q Zhou, Z Yang, P Li, Y Liu - arxiv preprint arxiv:2306.08909, 2023‏ - arxiv.org‏

Conventional knowledge distillation (KD) methods require access to the internal information
of teachers, eg, logits. However, such information may not always be accessible for large pre …‏

שמור צטט צוטט על ידי 4 מאמרים בנושא זה כל 4 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[HTML] cyberleninka.ru

[HTML][HTML] Research on the Development of Data Augmentation Techniques in the Field of Machine Translation‏

Z Zhipeng, P Aleksey - International Journal of Open Information …, 2023‏ - cyberleninka.ru‏

Neural machine translation usually requires a large number of bilingual parallel corpus for
training, which is very easy to overfit on the training set of small data. Through a large …‏

שמור צטט צוטט על ידי 3 מאמרים בנושא זה כל 5 הגרסאות במטמון

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task‏

Y Wan, K Bao, D Liu, B Yang, DF Wong… - arxiv preprint arxiv …, 2022‏ - arxiv.org‏

In this report, we present our submission to the WMT 2022 Metrics Shared Task. We build
our system based on the core idea of UNITE (Unified Translation Evaluation), which unifies …‏

שמור צטט צוטט על ידי 7 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Learning to generalize to more: Continuous semantic augmentation for neural machine translation

A survey of mix-based data augmentation: Taxonomy, methods, applications, and explainability‏

Lauragpt: Listen, attend, understand, and regenerate audio with gpt‏

On compositional generalization of transformer-based neural machine translation‏

On the complementarity between pre-training and random-initialization for resource-rich machine translation‏

Causal document-grounded dialogue pre-training‏

EMMA-X: an EM-like multilingual pre-training algorithm for cross-lingual representation learning‏

Lae-st-moe: Boosted language-aware encoder using speech translation auxiliary task for e2e code-switching asr‏

Bridging the Gap between Decision and Logits in Decision-based Knowledge Distillation for Pre-trained Language Models‏

[HTML][HTML] Research on the Development of Data Augmentation Techniques in the Field of Machine Translation‏

Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task‏