- Academic Search

H Sumioka, M Shiomi, M Honda… - Frontiers in Robotics and …, 2021 - frontiersin.org

Due to cognitive and socio-emotional decline and mental diseases, senior citizens,
especially people with dementia (PwD), struggle to interact smoothly with their caregivers …

Spara Citera Citerat av 28 Relaterade artiklar Alla 8 versionerna Cachad

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Controllable emotion transfer for end-to-end speech synthesis

T Li, S Yang, L Xue, L **e - 2021 12th International Symposium …, 2021 - ieeexplore.ieee.org

Emotion embedding space learned from references is a straight-forward approach for
emotion transfer in encoder-decoder structured emotional text to speech (TTS) systems …

Spara Citera Citerat av 100 Relaterade artiklar Alla 3 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] uni-augsburg.de

Stargan for emotional speech conversion: Validated by data augmentation of end-to-end emotion recognition

G Rizos, A Baird, M Elliott… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org

In this paper, we propose an adversarial network implementation for speech emotion
conversion as a data augmentation method, validated by a multi-class speech affect …

Spara Citera Citerat av 79 Relaterade artiklar Alla 4 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] ed.ac.uk

Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis

J Lorenzo-Trueba, GE Henter, S Takaki… - Speech …, 2018 - Elsevier

In this paper, we investigate the simultaneous modeling of multiple emotions in DNN-based
expressive speech synthesis, and how to represent the emotional labels, such as emotional …

Spara Citera Citerat av 101 Relaterade artiklar Alla 4 versionerna

Multi-type features separating fusion learning for Speech Emotion Recognition

X Xu, D Li, Y Zhou, Z Wang - Applied Soft Computing, 2022 - Elsevier

Abstract Speech Emotion Recognition (SER) is a challengeable task to improve human–
computer interaction. Speech data have different representations, and choosing the …

Spara Citera Citerat av 23 Relaterade artiklar Alla 2 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] acm.org Full View

Speech melody matters—how robots profit from using charismatic speech

K Fischer, O Niebuhr, LC Jensen… - ACM Transactions on …, 2019 - dl.acm.org

In this article, we address to what extent the proverb “the sound makes the music” also
applies to human-robot interaction, and whether robots could profit from using speech …

Spara Citera Citerat av 60 Relaterade artiklar Alla 5 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey

T **e, Y Rong, P Zhang, L Liu - arxiv preprint arxiv:2412.06602, 2024 - arxiv.org

Text-to-speech (TTS), also known as speech synthesis, is a prominent research area that
aims to generate natural-sounding human speech from text. Recently, with the increasing …

Spara Citera Citerat av 1 Relaterade artiklar Alla 2 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep encoder-decoder models for unsupervised learning of controllable speech synthesis

GE Henter, J Lorenzo-Trueba, X Wang… - arxiv preprint arxiv …, 2018 - arxiv.org

Generating versatile and appropriate synthetic speech requires control over the output
expression separate from the spoken text. Important non-textual speech variation is seldom …

Spara Citera Citerat av 68 Relaterade artiklar Alla 2 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

End-to-end triplet loss based emotion embedding system for speech emotion recognition

P Kumar, S Jain, B Raman, PP Roy… - … Conference on Pattern …, 2021 - ieeexplore.ieee.org

In this paper, an end-to-end neural embedding system based on triplet loss and residual
learning has been proposed for speech emotion recognition. The proposed system learns …

Spara Citera Citerat av 34 Relaterade artiklar Alla 14 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

A survey on speech synthesis techniques in Indian languages

SP Panda, AK Nayak, SC Rai - Multimedia Systems, 2020 - Springer

The text to speech technology has achieved significant progress during the past decade and
is an active area of research and development in providing different human–computer …

Spara Citera Citerat av 36 Relaterade artiklar Alla 3 versionerna

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

Emotion transplantation through adaptation in HMM-based speech synthesis

Technical challenges for smooth interaction with seniors with dementia: Lessons from humanitude™

Controllable emotion transfer for end-to-end speech synthesis

Stargan for emotional speech conversion: Validated by data augmentation of end-to-end emotion recognition

Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis

Multi-type features separating fusion learning for Speech Emotion Recognition

Speech melody matters—how robots profit from using charismatic speech

Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey

Deep encoder-decoder models for unsupervised learning of controllable speech synthesis

End-to-end triplet loss based emotion embedding system for speech emotion recognition

A survey on speech synthesis techniques in Indian languages