Systems and methods for multi-speaker neural text-to-speech

G DIAMOS, A GIBIANSKY, J Miller, P Kainan… - US Patent …, 2021 - Google Patents
Described herein are systems and methods for augmenting neural speech synthesis
networks with low-dimensional trainable speaker embeddings in order to generate speech …

Systems and methods for parallel wave generation in end-to-end text-to-speech

P Wei, P Kainan, C Jitong - US Patent 10,872,596, 2020 - Google Patents
Described herein are embodiments of an end-to-end text-to speech (TTS) system with
parallel wave generation. In one or more embodiments, a Gaussian inverse autoregressive …

Systems and methods for real-time neural text-to-speech

M Chrzanowski, A Coates, G DIAMOS… - US Patent …, 2020 - Google Patents
Embodiments of a production-quality text-to-speech (TTS) system constructed from deep
neural networks described. System embodiments comprise five major build ing blocks: a …

Parallel neural text-to-speech

P Kainan, P Wei, S Zhao, Z Kexin - US Patent 11,017,761, 2021 - Google Patents
Presented herein are embodiments of a non-autoregressive sequence-to-sequence model
that converts text to an audio representation. Embodiment are fully convolutional, and a …

Systems and methods for neural text-to-speech using convolutional sequence learning

P Wei, P Kainan, S NARANG, A KANNAN… - US Patent …, 2020 - Google Patents
Described herein are embodiments of a fully-convolutional attention-based neural text-to-
speech (TTS) system, which various embodiments may generally be referred to as Deep …

Waveform generation using end-to-end text-to-waveform system

P Wei, P Kainan, C Jitong - US Patent 11,482,207, 2022 - Google Patents
Described herein are embodiments of an end-to-end text-to-speech (TTS) system with
parallel wave generation. In one or more embodiments, a Gaussian inverse autoregressive …

System and method for outlier identification to remove poor alignments in speech synthesis

EV Raghavendra, A Ganapathiraju - US Patent 10,497,362, 2019 - Google Patents
A system and method are presented for outlier identification to remove poor alignments in
speech synthesis. The quality of the output of a text-to-speech system directly depends on …

Method for pronunciation transcription using speech-to-text model

D Shin - US Patent 12,051,421, 2024 - Google Patents
Disclosed is a pronunciation transcription method performed by a computing device. The
method may include: acquiring a partial audio signal of a first sound unit generated by …

Acoustic model training method, speech recognition method, apparatus, device and medium

H Liang, J Wang, N Cheng, J **ao - US Patent 11,030,998, 2021 - Google Patents
An acoustic model training method, a speech recognition method, an apparatus, a device
and a medium. The acoustic model training method comprises: performing feature extraction …

Multi-speaker neural text-to-speech

G DIAMOS, A GIBIANSKY, J Miller, P Kainan… - US Patent …, 2023 - Google Patents
the present disclosure relates generally to systems and methods for machine learning that
can provide improved computer performance, features, and uses. More particularly, the …