- Academic Search

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org

Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

Zapisz Cytuj Cytowane przez 469 Powiązane artykuły Wszystkie wersje 2 Wersja HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Spoken Language Identification: An overview of past and present research trends

D O'Shaughnessy - Speech Communication, 2024 - Elsevier

Identification of the language used in spoken utterances is useful for multiple applications,
eg, assist in directing or automating telephone calls, or selecting which language-specific …

Zapisz Cytuj Cytowane przez 1 Powiązane artykuły

A novel tracking deep wavelet auto-encoder method for intelligent fault diagnosis of electric locomotive bearings

S Haidong, J Hongkai, Z Ke, W Dongdong… - Mechanical Systems and …, 2018 - Elsevier

The condition monitoring of electric locomotive has attracted more and more attention due to
its significance for improving the security, reliability and automation level. In this paper, a …

Zapisz Cytuj Cytowane przez 114 Powiązane artykuły Wszystkie wersje 2

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources

H Barakat, O Turk, C Demiroglu - EURASIP Journal on Audio, Speech, and …, 2024 - Springer

Speech synthesis has made significant strides thanks to the transition from machine learning
to deep learning models. Contemporary text-to-speech (TTS) models possess the capability …

Zapisz Cytuj Cytowane przez 11 Powiązane artykuły Wszystkie wersje 6

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Speech prosody enhances the neural processing of syntax

G Degano, PW Donhauser, L Gwilliams… - Communications …, 2024 - nature.com

Human language relies on the correct processing of syntactic information, as it is essential
for successful communication between speakers. As an abstract level of language, syntax …

Zapisz Cytuj Cytowane przez 10 Powiązane artykuły Wszystkie wersje 6 Wyszukiwanie bibliotek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hierarchical prosody modeling for non-autoregressive speech synthesis

CM Chien, H Lee - 2021 IEEE Spoken Language Technology …, 2021 - ieeexplore.ieee.org

Prosody modeling is an essential component in modern text-to-speech (TTS) frameworks.
By explicitly providing prosody features to the TTS model, the style of synthesized utterances …

Zapisz Cytuj Cytowane przez 37 Powiązane artykuły Wszystkie wersje 4

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Prosody-controllable spontaneous TTS with neural HMMs

H Lameris, S Mehta, GE Henter… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Spontaneous speech has many affective and pragmatic functions that are interesting and
challenging to model in TTS. However, the presence of reduced articulation, fillers …

Zapisz Cytuj Cytowane przez 18 Powiązane artykuły Wszystkie wersje 5

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Prosody and fluency of Finland Swedish as a second language: Investigating global parameters for automated speaking assessment

H Kallio, M Kautonen, M Kuronen - Speech Communication, 2023 - Elsevier

This study investigates prosody and fluency of Finland Swedish as a second language (L2).
The main objective is to investigate global measures of prosody and fluency as predictors of …

Zapisz Cytuj Cytowane przez 10 Powiązane artykuły Wszystkie wersje 3

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Event-related responses reflect chunk boundaries in natural speech

I Anurova, S Vetchinnikova, A Dobrego, N Williams… - NeuroImage, 2022 - Elsevier

Chunking language has been proposed to be vital for comprehension enabling the
extraction of meaning from a continuous stream of speech. However, neurocognitive …

Zapisz Cytuj Cytowane przez 16 Powiązane artykuły Wszystkie wersje 11

[Free GPT-4]
[DeepSeek]

[PDF] jneurosci.org Free from Publisher

Intonation Units in spontaneous speech evoke a neural response

M Inbar, S Genzer, A Perry, E Grossman… - Journal of …, 2023 - Soc Neuroscience

Spontaneous speech is produced in chunks called intonation units (IUs). IUs are defined by
a set of prosodic cues and presumably occur in all human languages. Recent work has …

Zapisz Cytuj Cytowane przez 11 Powiązane artykuły Wszystkie wersje 10

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

Hierarchical representation and estimation of prosody using continuous wavelet transform

A survey on neural speech synthesis

[HTML][HTML] Spoken Language Identification: An overview of past and present research trends

A novel tracking deep wavelet auto-encoder method for intelligent fault diagnosis of electric locomotive bearings

Deep learning-based expressive speech synthesis: a systematic review of approaches, challenges, and resources

Speech prosody enhances the neural processing of syntax

Hierarchical prosody modeling for non-autoregressive speech synthesis

Prosody-controllable spontaneous TTS with neural HMMs

[HTML][HTML] Prosody and fluency of Finland Swedish as a second language: Investigating global parameters for automated speaking assessment

[HTML][HTML] Event-related responses reflect chunk boundaries in natural speech

Intonation Units in spontaneous speech evoke a neural response