- Academic Search

C Zhang, C Zhang, S Zheng, Y Qiao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org

As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …

Salva Cita Citato da 206 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] nowpublishers.com

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Salva Cita Citato da 440 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] springer.com

Machine learning: Algorithms, real-world applications and research directions

IH Sarker - SN computer science, 2021 - Springer

In the current age of the Fourth Industrial Revolution (4 IR or Industry 4.0), the digital world
has a wealth of data, such as Internet of Things (IoT) data, cybersecurity data, mobile data …

Salva Cita Citato da 4486 Articoli correlati Tutte e 13 le versioni

[Free GPT-4]

[HTML] mdpi.com

[HTML][HTML] Recurrent neural networks: A comprehensive review of architectures, variants, and applications

ID Mienye, TG Swart, G Obaido - Information, 2024 - mdpi.com

Recurrent neural networks (RNNs) have significantly advanced the field of machine learning
(ML) by enabling the effective processing of sequential data. This paper provides a …

Salva Cita Citato da 66 Articoli correlati Tutte e 6 le versioni Copia cache

[Free GPT-4]

[PDF] arxiv.org

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Salva Cita Citato da 467 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Viola: Conditional language models for speech recognition, synthesis, and translation

T Wang, L Zhou, Z Zhang, Y Wu, S Liu… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

Recent research shows a big convergence in model architecture, training objectives, and
inference methods across various tasks for different modalities. In this paper, we propose …

Salva Cita Citato da 102 Articoli correlati Tutte e 2 le versioni

[Free GPT-4]

[PDF] arxiv.org

Conformer: Convolution-augmented transformer for speech recognition

A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang… - arxiv preprint arxiv …, 2020 - arxiv.org

Recently Transformer and Convolution neural network (CNN) based models have shown
promising results in Automatic Speech Recognition (ASR), outperforming Recurrent neural …

Salva Cita Citato da 3541 Articoli correlati Tutte e 12 le versioni Versione HTML

[Free GPT-4]

[PDF] openreview.net

Gshard: Scaling giant models with conditional computation and automatic sharding

D Lepikhin, HJ Lee, Y Xu, D Chen, O Firat… - arxiv preprint arxiv …, 2020 - arxiv.org

Neural network scaling has been critical for improving the model quality in many real-world
machine learning applications with vast amounts of training data and compute. Although this …

Salva Cita Citato da 1073 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Salva Cita Citato da 176 Articoli correlati Tutte e 6 le versioni

[Free GPT-4]

[PDF] aaai.org

Tabnet: Attentive interpretable tabular learning

SÖ Arik, T Pfister - Proceedings of the AAAI conference on artificial …, 2021 - ojs.aaai.org

We propose a novel high-performance and interpretable canonical deep tabular data
learning architecture, TabNet. TabNet uses sequential attention to choose which features to …

Salva Cita Citato da 1581 Articoli correlati Tutte e 9 le versioni Versione HTML

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

State-of-the-art speech recognition with sequence-to-sequence models

A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

Machine learning: Algorithms, real-world applications and research directions

[HTML][HTML] Recurrent neural networks: A comprehensive review of architectures, variants, and applications

A survey on neural speech synthesis

Viola: Conditional language models for speech recognition, synthesis, and translation

Conformer: Convolution-augmented transformer for speech recognition

Gshard: Scaling giant models with conditional computation and automatic sharding

End-to-end speech recognition: A survey

Tabnet: Attentive interpretable tabular learning