Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
[PDF][PDF] Recent advances in end-to-end automatic speech recognition
J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
Automatic speech recognition: Systematic literature review
A huge amount of research has been done in the field of speech signal processing in recent
years. In particular, there has been increasing interest in the automatic speech recognition …
years. In particular, there has been increasing interest in the automatic speech recognition …
End-to-end speech recognition: A survey
In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …
learning has brought considerable reductions in word error rate of more than 50% relative …
Transvg: End-to-end visual grounding with transformers
In this paper, we present a neat yet effective transformer-based framework for visual
grounding, namely TransVG, to address the task of grounding a language query to the …
grounding, namely TransVG, to address the task of grounding a language query to the …
E-branchformer: Branchformer with enhanced merging for speech recognition
Conformer, combining convolution and self-attention sequentially to capture both local and
global information, has shown remarkable performance and is currently regarded as the …
global information, has shown remarkable performance and is currently regarded as the …
Develo** real-time streaming transformer transducer for speech recognition on large-scale dataset
Recently, Transformer based end-to-end models have achieved great success in many
areas including speech recognition. However, compared to LSTM models, the heavy …
areas including speech recognition. However, compared to LSTM models, the heavy …
A survey on time-series pre-trained models
Time-Series Mining (TSM) is an important research area since it shows great potential in
practical applications. Deep learning models that rely on massive labeled data have been …
practical applications. Deep learning models that rely on massive labeled data have been …
Emformer: Efficient memory transformer based acoustic model for low latency streaming speech recognition
This paper proposes an efficient memory transformer Emformer for low latency streaming
speech recognition. In Emformer, the long-range history context is distilled into an …
speech recognition. In Emformer, the long-range history context is distilled into an …
Multichannel long-term streaming neural speech enhancement for static and moving speakers
In this work, we extend our previously proposed offline SpatialNet for long-term streaming
multichannel speech enhancement in both static and moving speaker scenarios. SpatialNet …
multichannel speech enhancement in both static and moving speaker scenarios. SpatialNet …
Exploring the integration of IoT and Generative AI in English language education: Smart tools for personalized learning experiences
W Dong, D Pan, S Kim - Journal of Computational Science, 2024 - Elsevier
Abstract English language education is undergoing a transformative shift, propelled by
advancements in technology. This research explores the integration of the Internet of Things …
advancements in technology. This research explores the integration of the Internet of Things …