- Academic Search

K Rao, F Peng, H Sak… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org

Grapheme-to-phoneme (G2P) models are key components in speech recognition and text-to-
speech systems as they describe how words are pronounced. We propose a G2P model …

Lưu Trích dẫn Trích dẫn 284 bài viết Bài viết có liên quan Tất cả 6 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] ed.ac.uk

Improving seq2seq tts frontends with transcribed speech audio

S Sun, K Richmond, H Tang - IEEE/ACM Transactions on …, 2023 - ieeexplore.ieee.org

Due to the data inefficiency and low speech quality of grapheme-based end-to-end text-to-
speech (TTS), having a separate high-performance TTS linguistic frontend is still commonly …

Lưu Trích dẫn Trích dẫn 5 bài viết Bài viết có liên quan Tất cả 5 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] aaronlspringer.com

" Play PRBLMS" Identifying and Correcting Less Accessible Content in Voice Interfaces

A Springer, H Cramer - Proceedings of the 2018 CHI Conference on …, 2018 - dl.acm.org

Voice interfaces often struggle with specific types of named content. Domain-specific
terminology and naming may push the bounds of standard language, especially in domains …

Lưu Trích dẫn Trích dẫn 30 bài viết Bài viết có liên quan Tất cả 2 phiên bản

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Learning Personalized Pronunciations for Contact Name Recognition.

A Bruguier, F Peng, F Beaufays - INTERSPEECH, 2016 - isca-archive.org

Automatic speech recognition that involves people's names is difficult because names follow
a long-tail distribution and they have no commonly accepted spelling or pronunciation. This …

Lưu Trích dẫn Trích dẫn 20 bài viết Bài viết có liên quan Tất cả 6 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Pronunciation Learning with RNN-Transducers.

A Bruguier, D Gnanapragasam, L Johnson, K Rao… - …, 2017 - isca-archive.org

Most speech recognition systems rely on pronunciation dictionaries to provide accurate
transcriptions. Typically, some pronunciations are carved manually, but many are produced …

Lưu Trích dẫn Trích dẫn 18 bài viết Bài viết có liên quan Tất cả 3 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks.

D van Esch, M Chua, K Rao - INTERSPEECH, 2016 - isca-archive.org

Word pronunciations, consisting of phoneme sequences and the associated syllabification
and stress patterns, are vital for both speech recognition and text-to-speech (TTS) systems …

Lưu Trích dẫn Trích dẫn 18 bài viết Bài viết có liên quan Tất cả 5 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] googleapis.com

System and method for eliciting open-ended natural language responses to questions to train natural language processors

SJ Rothwell, D Braga, AK Elshenawy… - US Patent …, 2017 - Google Patents

Systems and methods gathering text commands in response to a command context using a
first crowdsourced are dis cussed herein. A command context for a natural language …

Lưu Trích dẫn Trích dẫn 21 bài viết Bài viết có liên quan Tất cả 4 phiên bản Bản lưu

[Free GPT-4]
[DeepSeek]

[PDF] googleapis.com

System and method for validating natural language content using crowdsourced validation jobs

SJ Rothwell, D Braga, AK Elshenawy… - US Patent …, 2016 - Google Patents

8/2014 Rhoads......................... 382/255 9, 2014 AO1G 7,045 315,307 senting whether or not
the text accurately represents the natu ral language content may be received from each of …

Lưu Trích dẫn Trích dẫn 18 bài viết Bài viết có liên quan Tất cả 2 phiên bản Bản lưu

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning

S Sun, K Richmond - ar** an integrated sequence-
to-sequence (Seq2Seq) linguistic frontend from a traditional pipeline-based frontend for text …

Lưu Trích dẫn Bài viết có liên quan Tất cả 4 phiên bản Xem dạng HTML

[Free GPT-4]
[DeepSeek]

[PDF] googleapis.com

System and method of recording utterances using unmanaged crowds for natural language processing

D Braga, SJ Rothwell, F Romani… - US Patent …, 2017 - Google Patents

2016-07-20 Assigned to VOICEBOX TECHNOLOGIES CORPORATION reassignment
VOICEBOX TECHNOLOGIES CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST …

Lưu Trích dẫn Trích dẫn 16 bài viết Bài viết có liên quan Tất cả 4 phiên bản Bản lưu

Tạo thông báo

Trích dẫn

Tìm kiếm nâng cao

Đã lưu vào Thư viện của tôi

Pronunciation learning for named-entities through crowd-sourcing.

Grapheme-to-phoneme conversion using long short-term memory recurrent neural networks

Improving seq2seq tts frontends with transcribed speech audio

" Play PRBLMS" Identifying and Correcting Less Accessible Content in Voice Interfaces

[PDF][PDF] Learning Personalized Pronunciations for Contact Name Recognition.

[PDF][PDF] Pronunciation Learning with RNN-Transducers.

[PDF][PDF] Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks.

System and method for eliciting open-ended natural language responses to questions to train natural language processors

System and method for validating natural language content using crowdsourced validation jobs

Acquiring Pronunciation Knowledge from Transcribed Speech Audio via Multi-task Learning

System and method of recording utterances using unmanaged crowds for natural language processing