الباحث العلمي من Google

R Aloufi, H Haddadi, D Boyle - Proceedings of the 2020 ACM SIGSAC …, 2020‏ - dl.acm.org‏

Voice User Interfaces (VUIs) are increasingly popular and built into smartphones, home
assistants, and Internet of Things (IoT) devices. Despite offering an always-on convenient …‏

حفظ اقتباس تم اقتباسها في عدد: 75 مقالات ذات صلة الإصدارات الـ 4كلها

[Free GPT-4]

[PDF] sciencedirect.com

A study of bias mitigation strategies for speaker recognition‏

R Peri, K Somandepalli, S Narayanan - Computer Speech & Language, 2023‏ - Elsevier‏

Speaker recognition is increasingly used in several everyday applications including smart
speakers, customer care centers and other speech-driven analytics. It is crucial to accurately …‏

حفظ اقتباس تم اقتباسها في عدد: 11 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]

[PDF] polyu.edu.hk

Contrastive self-supervised speaker embedding with sequential disentanglement‏

Y Tu, MW Mak, JT Chien - IEEE/ACM Transactions on Audio …, 2024‏ - ieeexplore.ieee.org‏

Contrastive self-supervised learning has been widely used in speaker embedding to
address the labeling challenge. Contrastive speaker embedding assumes that the contrast …‏

حفظ اقتباس تم اقتباسها في عدد: 6 مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]

[PDF] arxiv.org

Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm‏

J Williams, Y Zhao, E Cooper… - ICASSP 2021-2021 …, 2021‏ - ieeexplore.ieee.org‏

We present a new approach to disentangle speaker voice and phone content by introducing
new components to the VQ-VAE architecture for speech synthesis. The original VQ-VAE …‏

حفظ اقتباس تم اقتباسها في عدد: 24 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]

[PDF] arxiv.org

Contrastive speaker embedding with sequential disentanglement‏

Y Tu, MW Mak, JT Chien - ICASSP 2024-2024 IEEE …, 2024‏ - ieeexplore.ieee.org‏

Contrastive speaker embedding assumes that the contrast between the positive and
negative pairs of speech segments is attributed to speaker identity only. However, this …‏

حفظ اقتباس تم اقتباسها في عدد: 5 مقالات ذات صلة الإصدارات الـ 4كلها

[Free GPT-4]

[PDF] google.com

Random cycle loss and its application to voice conversion‏

H Sun, D Wang, L Li, C Chen… - IEEE Transactions on …, 2023‏ - ieeexplore.ieee.org‏

Speech disentanglement aims to decompose independent causal factors of speech signals
into separate codes. Perfect disentanglement benefits to a broad range of speech …‏

حفظ اقتباس تم اقتباسها في عدد: 6 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]

[PDF] arxiv.org

Paralinguistic privacy protection at the edge‏

R Aloufi, H Haddadi, D Boyle - ACM Transactions on Privacy and …, 2023‏ - dl.acm.org‏

Voice user interfaces and digital assistants are rapidly entering our lives and becoming
singular touch points spanning our devices. These always-on services capture and transmit …‏

حفظ اقتباس تم اقتباسها في عدد: 14 مقالات ذات صلة الإصدارات الـ 4كلها

[Free GPT-4]

[PDF] arxiv.org

Acted vs. improvised: Domain adaptation for elicitation approaches in audio-visual emotion recognition‏

H Li, Y Kim, CH Kuo, S Narayanan - ar** generalized automatic emotion recognition systems include
scarcity of labeled data and lack of gold-standard references. Even for the cues that are …‏

حفظ اقتباس تم اقتباسها في عدد: 12 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]

[PDF] arxiv.org

Exploring disentanglement with multilingual and monolingual VQ-VAE‏

J Williams, J Fong, E Cooper, J Yamagishi - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

This work examines the content and usefulness of disentangled phone and speaker
representations from two separately trained VQ-VAE systems: one trained on multilingual …‏

حفظ اقتباس تم اقتباسها في عدد: 12 مقالات ذات صلة الإصدارات الـ 7كلها إصدار HTML‏

[Free GPT-4]

[PDF] biorxiv.org

Large-Scale Functional Connectome Fingerprinting for Generalization and Transfer Learning in Neuroimaging‏

M Ogg, L Kitchell - bioRxiv, 2024‏ - biorxiv.org‏

Functional MRI currently supports a limited application space stemming from modest dataset
sizes, large interindividual variability and heterogeneity among scanning protocols. These …‏

حفظ اقتباس مقالات ذات صلة الإصدارات الـ 4كلها إصدار HTML‏

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

An empirical analysis of information encoded in disentangled neural speaker representations

Privacy-preserving voice analysis via disentangled representations‏

A study of bias mitigation strategies for speaker recognition‏

Contrastive self-supervised speaker embedding with sequential disentanglement‏

Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm‏

Contrastive speaker embedding with sequential disentanglement‏

Random cycle loss and its application to voice conversion‏

Paralinguistic privacy protection at the edge‏

Acted vs. improvised: Domain adaptation for elicitation approaches in audio-visual emotion recognition‏

Exploring disentanglement with multilingual and monolingual VQ-VAE‏

Large-Scale Functional Connectome Fingerprinting for Generalization and Transfer Learning in Neuroimaging‏