- Academic Search

Y Tu, W Lin, MW Mak - IEEE Access, 2022 - ieeexplore.ieee.org

Speaker verification (SV) aims to detect an individual's identity from his/her voice. SV has
been successfully applied in various areas such as access control, remote service …

Opslaan Citeren Geciteerd door 29 Verwante artikelen Alle 5 versies

[Free GPT-4]
[DeepSeek]

[PDF] polyu.edu.hk

Wav2Spk: A simple DNN architecture for learning speaker embeddings from waveforms

W Lin, MW Mak - 2020 - ira.lib.polyu.edu.hk

Speaker recognition has seen impressive advances with the advent of deep neural networks
(DNNs). However, state-of-the-art speaker recognition systems still rely on human …

Opslaan Citeren Geciteerd door 41 Verwante artikelen Alle 8 versies HTML-versie

Text-independent speaker verification employing CNN-LSTM-TDNN hybrid networks

J Alam, A Fathan, WH Kang - International Conference on Speech and …, 2021 - Springer

Abstract Time Delay Neural Network (TDNN)-based speaker embeddings extraction have
become the dominant approach for text-independent speaker verification. Several single …

Opslaan Citeren Geciteerd door 19 Verwante artikelen Alle 3 versies

Robust speaker verification using deep weight space ensemble

W Lin, MW Mak - IEEE/ACM Transactions on Audio, Speech …, 2023 - ieeexplore.ieee.org

Domain shift is one of the most challenging problems in speaker verification. Although
numerous methods have been proposed to address domain shift, most approaches optimize …

Opslaan Citeren Geciteerd door 9 Verwante artikelen Alle 4 versies

Mixture representation learning for deep speaker embedding

W Lin, MW Mak - IEEE/ACM Transactions on Audio, Speech …, 2022 - ieeexplore.ieee.org

How to effectively convert a sequence of variable-length acoustic features to a fixed-
dimension representation has always been a research focus in speaker recognition. In state …

Opslaan Citeren Geciteerd door 12 Verwante artikelen Alle 4 versies

[Free GPT-4]
[DeepSeek]

[PDF] polyu.edu.hk

Robust speaker verification using population-based data augmentation

W Lin, MW Mak - … 2022-2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org

Speaker recognition under environments with a low signal-to-noise ratio (SNR) and high
reverberation level has always been challenging. Data augmentation can be applied to …

Opslaan Citeren Geciteerd door 13 Verwante artikelen Alle 4 versies

[Free GPT-4]
[DeepSeek]

[PDF] polyu.edu.hk

Promoting independence of depression and speaker features for speaker disentanglement in speech-based depression detection

L Zuo, MW Mak, Y Tu - ICASSP 2024-2024 IEEE International …, 2024 - ieeexplore.ieee.org

Recent studies have demonstrated the effectiveness of speaker disentanglement in
mitigating the interference caused by speaker features in speech-based depression …

Opslaan Citeren Geciteerd door 4 Verwante artikelen Alle 3 versies

[Free GPT-4]
[DeepSeek]

[PDF] polyu.edu.hk

Aggregating frame-level information in the spectral domain with self-attention for speaker embedding

Y Tu, MW Mak - IEEE/ACM Transactions on Audio, Speech …, 2022 - ieeexplore.ieee.org

Most pooling methods in state-of-the-art speaker embedding networks are implemented in
the temporal domain. However, due to the high non-stationarity in the feature maps …

Opslaan Citeren Geciteerd door 11 Verwante artikelen Alle 7 versies

[Free GPT-4]
[DeepSeek]

[PDF] isca-archive.org

[PDF][PDF] Mutual Information Enhanced Training for Speaker Embedding.

Y Tu, MW Mak - Interspeech, 2021 - isca-archive.org

Mutual information (MI) is useful in unsupervised and selfsupervised learning. Maximizing
the MI between the low-level features and the learned embeddings can preserve meaningful …

Opslaan Citeren Geciteerd door 7 Verwante artikelen Alle 4 versies HTML-versie

[Free GPT-4]
[DeepSeek]

[PDF] sigport.org

Short-time spectral aggregation for speaker embedding

Y Tu, MW Mak - … 2021-2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org

State-of-the-art speaker verification systems take frame-level acoustics features as input and
produce fixed-dimensional embeddings as utterance-level representations. Thus, how to …

Opslaan Citeren Geciteerd door 7 Verwante artikelen Alle 4 versies

Melding maken

Citeren

Geavanceerd zoeken

Opgeslagen in Mijn bibliotheek

Learning mixture representation for deep speaker embedding using attention

A survey on text-dependent and text-independent speaker verification

Wav2Spk: A simple DNN architecture for learning speaker embeddings from waveforms

Text-independent speaker verification employing CNN-LSTM-TDNN hybrid networks

Robust speaker verification using deep weight space ensemble

Mixture representation learning for deep speaker embedding

Robust speaker verification using population-based data augmentation

Promoting independence of depression and speaker features for speaker disentanglement in speech-based depression detection

Aggregating frame-level information in the spectral domain with self-attention for speaker embedding

[PDF][PDF] Mutual Information Enhanced Training for Speaker Embedding.

Short-time spectral aggregation for speaker embedding