Google Академик

Чланци

Академик

Око 84 резултата (0,02 сек)

Мој профил Моја библиотека

Fast conformer with linearly scalable attention for efficient speech recognition

Претражи унутар чланака са цитатима

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Salm: Speech-augmented language model with in-context learning for speech recognition and translation

Z Chen, H Huang, A Andrusenko… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

We present a novel Speech Augmented Language Model (SALM) with multitask and in-
context learning capabilities. SALM comprises a frozen text LLM, a audio encoder, a …

Сачувај Цитирај 37 пута наведен Сродни чланци Све верзије (5)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multilingual audio-visual speech recognition with hybrid CTC/RNN-T fast conformer

M Burchi, KC Puvvada, J Balam… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org

Humans are adept at leveraging visual cues from lip movements for recognizing speech in
adverse listening conditions. Audio-Visual Speech Recognition (AVSR) models follow …

Сачувај Цитирај 13 пута наведен Сродни чланци Све верзије (5)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Less is more: Accurate speech recognition & translation without web-scale data

KC Puvvada, P Żelasko, H Huang, O Hrinchuk… - ar** ubiquitous technologies to analyze adult-
child speech in naturalistic settings such as free play in order to support children's social and …

Сачувај Цитирај 2 пута наведен Сродни чланци Све верзије (2)

Направи обавештење

Цитирај

Напредна претрага

Сачувано у мојој библиотеци

Fast conformer with linearly scalable attention for efficient speech recognition

Salm: Speech-augmented language model with in-context learning for speech recognition and translation

Multilingual audio-visual speech recognition with hybrid CTC/RNN-T fast conformer

Less is more: Accurate speech recognition & translation without web-scale data