Študovňa Google

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Uložiť Citovať Citované 450-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Progress in machine translation

H Wang, H Wu, Z He, L Huang, KW Church - Engineering, 2022 - Elsevier

After more than 70 years of evolution, great achievements have been made in machine
translation. Especially in recent years, translation quality has been greatly improved with the …

Uložiť Citovať Citované 230-krát Súvisiace články Všetky verzie 2

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Uložiť Citovať Citované 182-krát Súvisiace články Všetky verzie 6 Vyhľadávanie knižnice

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Seamless: Multilingual Expressive and Streaming Speech Translation

L Barrault, YA Chung, MC Meglioli, D Dale… - ar**, converting images to a top-down view of the world, as
a translation problem. We show how a novel form of transformer network can be used to …

Uložiť Citovať Citované 165-krát Súvisiace články Všetky verzie 14

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Fairseq S2T: Fast speech-to-text modeling with fairseq

C Wang, Y Tang, X Ma, A Wu, S Popuri… - ar** artificial learning systems that can understand and generate natural language
has been one of the long-standing goals of artificial intelligence. Recent decades have …

Uložiť Citovať Citované 61-krát Súvisiace články Všetky verzie 23 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Monotonic multihead attention

X Ma, J Pino, J Cross, L Puzon, J Gu - arxiv preprint arxiv:1909.12406, 2019 - arxiv.org

Simultaneous machine translation models start generating a target sequence before they
have encoded or read the source sequence. Recent approaches for this task either apply a …

Uložiť Citovať Citované 151-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Dual-mode ASR: Unify and improve streaming ASR with full-context modeling

J Yu, W Han, A Gulati, CC Chiu, B Li… - arxiv preprint arxiv …, 2020 - arxiv.org

Streaming automatic speech recognition (ASR) aims to emit each hypothesized word as
quickly and accurately as possible, while full-context ASR waits for the completion of a full …

Uložiť Citovať Citované 87-krát Súvisiace články Všetky verzie 8 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SimulMT to SimulST: Adapting simultaneous text translation to end-to-end simultaneous speech translation

X Ma, J Pino, P Koehn - arxiv preprint arxiv:2011.02048, 2020 - arxiv.org

Simultaneous text translation and end-to-end speech translation have recently made great
progress but little work has combined these tasks together. We investigate how to adapt …

Uložiť Citovať Citované 86-krát Súvisiace články Všetky verzie 3 HTML verzia

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

Monotonic infinite lookback attention for simultaneous machine translation

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

[HTML][HTML] Progress in machine translation

End-to-end speech recognition: A survey

Seamless: Multilingual Expressive and Streaming Speech Translation

Fairseq S2T: Fast speech-to-text modeling with fairseq

Monotonic multihead attention

Dual-mode ASR: Unify and improve streaming ASR with full-context modeling

SimulMT to SimulST: Adapting simultaneous text translation to end-to-end simultaneous speech translation