- Academic Search

D Bromand - US Patent 11,501,764, 2022 - Google Patents

Methods, systems, and related products for voice-enabled computer systems are described.
A machine learning model is trained to produce pronunciation output based on text input …

Save Cite Cited by 7 Related articles All 4 versions Free GPT-4 Cached

Audio information processing method, audio information processing apparatus, electronic device, and storage medium

S Zhang, M Lei - US Patent 12,154,545, 2024 - freepatentsonline.com

In various embodiments, this application provides an audio information processing method,
an audio information processing apparatus, an electronic device, and a storage medium. An …

Save Cite Related articles Cached

[Free GPT-4]

[PDF] googleapis.com

Automated domain-specific constrained decoding from speech inputs to structured resources

AR Mittal, S Bharadwaj, S Khare… - US Patent …, 2024 - Google Patents

Methods, systems, and computer program products for automated domain-specific
constrained decoding from speech inputs to structured resources are provided herein. A …

Method for training speech recognition model, method and system for speech recognition

J Tao, Z Tian, J Yi - US Patent 11,580,957, 2023 - Google Patents

2022-06-09 Assigned to INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF
SCIENCES reassignment INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF …

[Free GPT-4]

[PDF] googleapis.com

Method and device for compressing finite-state transducers data

Z Liang - US Patent App. 17/782,152, 2023 - Google Patents

(57) ABSTRACT A method and device for compressing FST data are pro vided. The method
includes: acquiring to-be-compressed FST data, where the FST data includes state transition …

Systems and methods for generating locale-specific phonetic spelling variations

A Gupta, A Raghuveer, A Sharma, N Raut… - US Patent …, 2024 - Google Patents

Abstract Systems and methods for generating phonetic spelling variations of a given word
based on locale-specific pronunciations. A phoneme-letter density model may be configured …

[Free GPT-4]

[PDF] googleapis.com

Alphanumeric sequence biasing for automatic speech recognition using a grammar and a speller finite state transducer

B Haynor, P Aleksic - US Patent 11,942,091, 2024 - Google Patents

Speech processing techniques are disclosed that enable determining a text representation
of alphanumeric sequences in captured audio data. Various implementations include …

[Free GPT-4]

[PDF] googleapis.com

Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion

DH Le, FNU Mahaveer, G Keren, C Fuegen… - US Patent …, 2024 - Google Patents

In one embodiment, a method includes receiving a user's utterance comprising a word in a
custom vocabulary list of the user, generating a previous token to represent a previous audio …

Create alert

Cite

Advanced search

Saved to My library

Phoneme-based contextualization for cross-lingual speech recognition in end-to-end models

Apparatus for media entity pronunciation using deep learning

Audio information processing method, audio information processing apparatus, electronic device, and storage medium

Automated domain-specific constrained decoding from speech inputs to structured resources

Method for training speech recognition model, method and system for speech recognition

Method and device for compressing finite-state transducers data

Systems and methods for generating locale-specific phonetic spelling variations

Alphanumeric sequence biasing for automatic speech recognition using a grammar and a speller finite state transducer

Contextualized streaming end-to-end speech recognition with trie-based deep biasing and shallow fusion