Μελετητής Google

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 448 Σχετικά άρθρα Όλες οι 8 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Automatic speech recognition: Systematic literature review

S Alharbi, M Alrazgan, A Alrashed, T Alnomasi… - Ieee …, 2021 - ieeexplore.ieee.org

A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 136 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 181 Σχετικά άρθρα Όλες οι 6 εκδοχές Αναζήτηση βιβλιοθήκης

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Transvg: End-to-end visual grounding with transformers

J Deng, Z Yang, T Chen, W Zhou… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

In this paper, we present a neat yet effective transformer-based framework for visual
grounding, namely TransVG, to address the task of grounding a language query to the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 372 Σχετικά άρθρα Όλες οι 6 εκδοχές Προβολή ως HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

E-branchformer: Branchformer with enhanced merging for speech recognition

K Kim, F Wu, Y Peng, J Pan, P Sridhar… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org

Conformer, combining convolution and self-attention sequentially to capture both local and
global information, has shown remarkable performance and is currently regarded as the …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 117 Σχετικά άρθρα Όλες οι 5 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Develo** real-time streaming transformer transducer for speech recognition on large-scale dataset

X Chen, Y Wu, Z Wang, S Liu… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Recently, Transformer based end-to-end models have achieved great success in many
areas including speech recognition. However, compared to LSTM models, the heavy …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 218 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on time-series pre-trained models

Q Ma, Z Liu, Z Zheng, Z Huang, S Zhu… - … on Knowledge and …, 2024 - ieeexplore.ieee.org

Time-Series Mining (TSM) is an important research area since it shows great potential in
practical applications. Deep learning models that rely on massive labeled data have been …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 58 Σχετικά άρθρα Όλες οι 6 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition

Y Shi, Y Wang, C Wu, CF Yeh, J Chan… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

This paper proposes an efficient memory transformer Emformer for low latency streaming
speech recognition. In Emformer, the long-range history context is distilled into an …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 184 Σχετικά άρθρα Όλες οι 4 εκδοχές

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multichannel long-term streaming neural speech enhancement for static and moving speakers

C Quan, X Li - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org

In this work, we extend our previously proposed offline SpatialNet for long-term streaming
multichannel speech enhancement in both static and moving speaker scenarios. SpatialNet …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 21 Σχετικά άρθρα Όλες οι 4 εκδοχές

Exploring the integration of IoT and Generative AI in English language education: Smart tools for personalized learning experiences

W Dong, D Pan, S Kim - Journal of Computational Science, 2024 - Elsevier

Abstract English language education is undergoing a transformative shift, propelled by
advancements in technology. This research explores the integration of the Internet of Things …

Αποθήκευση Παράθεση Γίνεται αναφορά σε 13 Σχετικά άρθρα

Δημιουργία ειδοποίησης

Παράθεση

Σύνθετη αναζήτηση

Αποθηκεύτηκε στη Βιβλιοθήκη μου

Streaming automatic speech recognition with the transformer model

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

Automatic speech recognition: Systematic literature review

End-to-end speech recognition: A survey

Transvg: End-to-end visual grounding with transformers

E-branchformer: Branchformer with enhanced merging for speech recognition

Develo** real-time streaming transformer transducer for speech recognition on large-scale dataset

A survey on time-series pre-trained models

Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition

Multichannel long-term streaming neural speech enhancement for static and moving speakers

Exploring the integration of IoT and Generative AI in English language education: Smart tools for personalized learning experiences