- Academic Search

Accented text-to-speech synthesis with limited data

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Controllable accented text-to-speech synthesis with fine and coarse-grained intensity rendering

R Liu, B Sisman, G Gao, H Li - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org

Accented text-to-speech (TTS) synthesis seeks to generate speech with an accent (L2) as a
variant of the standard version (L1), which is challenging as L2 is different from L1 in terms …

บันทึก อ้างอิง อ้างโดย16 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

An embedded end-to-end voice assistant

L Lazzaroni, F Bellotti, R Berta - Engineering Applications of Artificial …, 2024 - Elsevier

Voice assistants are spreading in various environments, such as houses and cars, bringing
the possibility of controlling heterogeneous Internet of Things devices with simple voice …

บันทึก อ้างอิง อ้างโดย4 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

Improving mispronunciation detection using speech reconstruction

A Das, R Gutierrez-Osuna - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org

Training related machine learning tasks simultaneously can lead to improved performance
on both tasks. Text-to-speech (TTS) and mispronunciation detection and diagnosis (MDD) …

บันทึก อ้างอิง อ้างโดย3 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-scale accent modeling with disentangling for multi-speaker multi-accent TTS synthesis

X Zhou, M Zhang, Y Zhou, Z Wu, H Li - arxiv preprint arxiv:2406.10844, 2024 - arxiv.org

Synthesizing speech across different accents while preserving the speaker identity is
essential for various real-world customer applications. However, the individual and accurate …

บันทึก อ้างอิง อ้างโดย3 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

Towards zero-shot multi-speaker multi-accent text-to-speech synthesis

M Zhang, X Zhou, Z Wu, H Li - IEEE Signal Processing Letters, 2023 - ieeexplore.ieee.org

This letter presents a framework towards multi-accent neural text-to-speech synthesis for
zero-shot multi-speaker, which employs an encoder-decoder architecture and an accent …

บันทึก อ้างอิง อ้างโดย4 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Zero-shot emotion transfer for cross-lingual speech synthesis

Y Li, X Zhu, Y Lei, H Li, J Liu, D **e… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

Zero-shot emotion transfer in cross-lingual speech synthesis aims to transfer emotion from
an arbitrary speech reference in the source language to the synthetic speech in the target …

บันทึก อ้างอิง อ้างโดย2 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Non-autoregressive real-time accent conversion model with voice cloning

V Nechaev, S Kosyakov - arxiv preprint arxiv:2405.13162, 2024 - arxiv.org

Currently, the development of Foreign Accent Conversion (FAC) models utilizes deep neural
network architectures, as well as ensembles of neural networks for speech recognition and …

บันทึก อ้างอิง อ้างโดย1 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] researchgate.net

[PDF][PDF] Neural speech synthesis for austrian dialects with standard german grapheme-to-phoneme conversion and dialect embeddings

L Gutscher, M Pucher, V Garcia - Proc. 2nd Annual Meeting of the …, 2023 - researchgate.net

For languages where extensive audio data and text transcriptions are available, text-to-
speech (TTS) systems have showcased the ability to generate speech that closely …

บันทึก อ้างอิง อ้างโดย3 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

AccentBox: Towards High-Fidelity Zero-Shot Accent Generation

J Zhong, K Richmond, Z Su, S Sun - arxiv preprint arxiv:2409.09098, 2024 - arxiv.org

While recent Zero-Shot Text-to-Speech (ZS-TTS) models have achieved high naturalness
and speaker similarity, they fall short in accent fidelity and control. To address this issue, we …

บันทึก อ้างอิง บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

生成式文本质量的自动评估方法综述 (A Survey of Automatic Evaluation on the Quality of Generated Text)

L Tian, M Ziao, Z Yanghao, X Chen… - Proceedings of the 23rd …, 2024 - aclanthology.org

Abstract “人工评估, 作为生成式文本质量评价的金标准, 成本太高; 自动评估,
核心思想在于要使其评估结果与人工评估高度相关, 从而实现对生成式文本质量的自动化分析和 …

บันทึก อ้างอิง บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

สร้างการแจ้งเตือน

อ้างอิง

การค้นหาขั้นสูง

บันทึกไปยังคลังของฉันแล้ว

Accented text-to-speech synthesis with limited data

Controllable accented text-to-speech synthesis with fine and coarse-grained intensity rendering

An embedded end-to-end voice assistant

Improving mispronunciation detection using speech reconstruction

Multi-scale accent modeling with disentangling for multi-speaker multi-accent TTS synthesis

Towards zero-shot multi-speaker multi-accent text-to-speech synthesis

Zero-shot emotion transfer for cross-lingual speech synthesis

Non-autoregressive real-time accent conversion model with voice cloning

[PDF][PDF] Neural speech synthesis for austrian dialects with standard german grapheme-to-phoneme conversion and dialect embeddings

AccentBox: Towards High-Fidelity Zero-Shot Accent Generation

生成式文本质量的自动评估方法综述 (A Survey of Automatic Evaluation on the Quality of Generated Text)