- Academic Search

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Emotional speech synthesis aims to synthesize human voices with various emotional effects.
The current studies are mostly focused on imitating an averaged style belonging to a specific …

Save Cite Cited by 62 Related articles All 7 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Learning efficient representations for keyword spotting with triplet loss

R Vygon, N Mikhaylovskiy - … 2021, St. Petersburg, Russia, September 27 …, 2021 - Springer

In the past few years, triplet loss-based metric embeddings have become a de-facto
standard for several important computer vision problems, most notably, person …

Save Cite Cited by 61 Related articles All 6 versions Free GPT-4

A survey on automatic multimodal emotion recognition in the wild

G Sharma, A Dhall - Advances in data science: Methodologies and …, 2021 - Springer

Affective computing has been an active area of research for the past two decades. One of
the major component of affective computing is automatic emotion recognition. This chapter …

Save Cite Cited by 53 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Self-supervised endoscopic image key-points matching

M Farhat, H Chaabouni-Chouayakh… - Expert Systems with …, 2023 - Elsevier

Feature matching and finding correspondences between endoscopic images is a key step in
many clinical applications such as patient follow-up and generation of panoramic image …

Save Cite Cited by 18 Related articles All 6 versions Free GPT-4

Multi-cultural speech emotion recognition using language and speaker cues

SK Pandey, HS Shekhawat, SRM Prasanna - … Signal Processing and …, 2023 - Elsevier

Abstract Speech Emotion Recognition (SER) has been an active area of research to make
Human–Computer Interaction (HCI) smoother and more natural. However, due to the …

Save Cite Cited by 7 Related articles

[Free GPT-4]

[PDF] ieee.org

Quantifying emotional similarity in speech

J Harvill, SG Leem, M AbdelWahab… - IEEE Transactions …, 2021 - ieeexplore.ieee.org

This study proposes the novel formulation of measuring emotional similarity between
speech recordings. This formulation explores the ordinal nature of emotions by comparing …

Save Cite Cited by 12 Related articles All 3 versions Free GPT-4

Domain generalization with triplet network for cross-corpus speech emotion recognition

S Lee - 2021 IEEE Spoken Language Technology Workshop …, 2021 - ieeexplore.ieee.org

Domain generalization is a major challenge for cross-corpus speech emotion recognition.
The recognition performance built on" seen" source corpora is inevitably degraded when the …

Save Cite Cited by 15 Related articles

[Free GPT-4]

[PDF] acm.org

MSP-face corpus: a natural audiovisual emotional database

A Vidal, A Salman, WC Lin, C Busso - Proceedings of the 2020 …, 2020 - dl.acm.org

Expressive behaviors conveyed during daily interactions are difficult to determine, because
they often consist of a blend of different emotions. The complexity in expressive human …

Save Cite Cited by 14 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] mdpi.com

Learning low-dimensional embeddings of audio shingles for cross-version retrieval of classical music

F Zalkow, M Müller - Applied Sciences, 2019 - mdpi.com

Cross-version music retrieval aims at identifying all versions of a given piece of music using
a short query audio fragment. One previous approach, which is particularly suited for …

Save Cite Cited by 21 Related articles All 9 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] utdallas.edu

Use of triplet-loss function to improve driving anomaly detection using conditional generative adversarial network

Y Qiu, T Misu, C Busso - 2020 IEEE 23rd International …, 2020 - ieeexplore.ieee.org

Driving anomaly detection is an important problem in advanced driver assistance systems
(ADAS). The ability to immediately detect potentially hazardous scenarios will prevent …

Save Cite Cited by 10 Related articles All 4 versions Free GPT-4

Create alert

Cite

Advanced search

Saved to My library

Retrieving speech samples with similar emotional content using a triplet loss function

Speech synthesis with mixed emotions

Learning efficient representations for keyword spotting with triplet loss

A survey on automatic multimodal emotion recognition in the wild

Self-supervised endoscopic image key-points matching

Multi-cultural speech emotion recognition using language and speaker cues

Quantifying emotional similarity in speech

Domain generalization with triplet network for cross-corpus speech emotion recognition

MSP-face corpus: a natural audiovisual emotional database

Learning low-dimensional embeddings of audio shingles for cross-version retrieval of classical music

Use of triplet-loss function to improve driving anomaly detection using conditional generative adversarial network