Google Академія

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com

Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Зберегти Послатися Цитовано в 448 джерелах Пов’язані статті Кількість версій: 8 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Julia language in machine learning: Algorithms, applications, and open issues

K Gao, G Mei, F Piccialli, S Cuomo, J Tu… - Computer Science Review, 2020 - Elsevier

Abstract Machine learning is driving development across many fields in science and
engineering. A simple and efficient programming language could accelerate applications of …

Зберегти Послатися Цитовано в 106 джерелах Пов’язані статті Кількість версій: 5

[Free GPT-4]
[DeepSeek]

[PDF] 3dvar.com

[PDF][PDF] Scaling autoregressive models for content-rich text-to-image generation

J Yu, Y Xu, JY Koh, T Luong, G Baid, Z Wang… - arxiv preprint arxiv …, 2022 - 3dvar.com

Abstract We present the Pathways [1] Autoregressive Text-to-Image (Parti) model, which
generates high-fidelity photorealistic images and supports content-rich synthesis involving …

Зберегти Послатися Цитовано в 1112 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

End-to-end speech recognition: A survey

R Prabhavalkar, T Hori, TN Sainath… - … on Audio, Speech …, 2023 - ieeexplore.ieee.org

In the last decade of automatic speech recognition (ASR) research, the introduction of deep
learning has brought considerable reductions in word error rate of more than 50% relative …

Зберегти Послатися Цитовано в 181 джерелах Пов’язані статті Кількість версій: 6 Пошук бібліотеки

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lamda: Language models for dialog applications

R Thoppilan, D De Freitas, J Hall, N Shazeer… - arxiv preprint arxiv …, 2022 - arxiv.org

We present LaMDA: Language Models for Dialog Applications. LaMDA is a family of
Transformer-based neural language models specialized for dialog, which have up to 137B …

Зберегти Послатися Цитовано в 1665 джерелах Пов’язані статті Кількість версій: 3 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection

Y Li, AW Yu, T Meng, B Caine… - Proceedings of the …, 2022 - openaccess.thecvf.com

Lidars and cameras are critical sensors that provide complementary information for 3D
detection in autonomous driving. While prevalent multi-modal methods simply decorate raw …

Зберегти Послатися Цитовано в 437 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SpeechBrain: A general-purpose speech toolkit

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arxiv preprint arxiv …, 2021 - arxiv.org

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

Зберегти Послатися Цитовано в 793 джерелах Пов’язані статті Кількість версій: 5 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Conformer: Convolution-augmented transformer for speech recognition

A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang… - arxiv preprint arxiv …, 2020 - arxiv.org

Recently Transformer and Convolution neural network (CNN) based models have shown
promising results in Automatic Speech Recognition (ASR), outperforming Recurrent neural …

Зберегти Послатися Цитовано в 3590 джерелах Пов’язані статті Кількість версій: 9 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Gshard: Scaling giant models with conditional computation and automatic sharding

D Lepikhin, HJ Lee, Y Xu, D Chen, O Firat… - arxiv preprint arxiv …, 2020 - arxiv.org

Neural network scaling has been critical for improving the model quality in many real-world
machine learning applications with vast amounts of training data and compute. Although this …

Зберегти Послатися Цитовано в 1102 джерелах Пов’язані статті Кількість версій: 6 Показати у форматі HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

Scaling up models and data with t5x and seqio

A Roberts, HW Chung, G Mishra, A Levskaya… - Journal of Machine …, 2023 - jmlr.org

Scaling up training datasets and model parameters have benefited neural network-based
language models, but also present challenges like distributed compute, input data …

Зберегти Послатися Цитовано в 162 джерелах Пов’язані статті Кількість версій: 4 Показати у форматі HTML

Створити сповіщення

Послатися

Розширений пошук

Збережено в моїй бібліотеці

Lingvo: a modular and scalable framework for sequence-to-sequence modeling

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

[HTML][HTML] Julia language in machine learning: Algorithms, applications, and open issues

[PDF][PDF] Scaling autoregressive models for content-rich text-to-image generation

End-to-end speech recognition: A survey

Lamda: Language models for dialog applications

Deepfusion: Lidar-camera deep fusion for multi-modal 3d object detection

SpeechBrain: A general-purpose speech toolkit

Conformer: Convolution-augmented transformer for speech recognition

Gshard: Scaling giant models with conditional computation and automatic sharding

Scaling up models and data with t5x and seqio