- Academic Search

A Nagrani, JS Chung, W **e, A Zisserman - Computer Speech & Language, 2020 - Elsevier

The objective of this work is speaker recognition under noisy and unconstrained conditions.
We make two key contributions. First, we introduce a very large-scale audio-visual dataset …

保存引用被引用数: 792 関連記事全 11 バージョン

[Free GPT-4]

[PDF] arxiv.org

Optimization of data-driven filterbank for automatic speaker verification

S Sarangi, M Sahidullah, G Saha - Digital Signal Processing, 2020 - Elsevier

Most of the speech processing applications use triangular filters spaced in mel-scale for
feature extraction. In this paper, we propose a new data-driven filter design method which …

保存引用被引用数: 69 関連記事全 8 バージョン

[Free GPT-4]

[PDF] arxiv.org

**-vector embedding for speaker recognition

KA Lee, Q Wang, T Koshinaka - IEEE Signal Processing Letters, 2021 - ieeexplore.ieee.org

We present a Bayesian formulation for deep speaker embedding, wherein the xi-vector is
the Bayesian counterpart of the x-vector, taking into account the uncertainty estimate. On the …

保存引用被引用数: 36 関連記事全 4 バージョン

[Free GPT-4]

[PDF] arxiv.org

Audio-visual speaker recognition with a cross-modal discriminative network

R Tao, RK Das, H Li - arxiv preprint arxiv:2008.03894, 2020 - arxiv.org

Audio-visual speaker recognition is one of the tasks in the recent 2019 NIST speaker
recognition evaluation (SRE). Studies in neuroscience and computer science all point to the …

保存引用被引用数: 46 関連記事全 9 バージョン HTMLバージョン

[Free GPT-4]

[PDF] arxiv.org

Voxceleb enrichment for age and gender recognition

K Hechmi, TN Trong, V Hautamäki… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

VoxCeleb datasets are widely used in speaker recognition studies. Our work serves two
purposes. First, we provide speaker age labels and (an alternative) annotation of speaker …

保存引用被引用数: 30 関連記事全 3 バージョン

[Free GPT-4]

[PDF] sciencedirect.com

A study of bias mitigation strategies for speaker recognition

R Peri, K Somandepalli, S Narayanan - Computer Speech & Language, 2023 - Elsevier

Speaker recognition is increasingly used in several everyday applications including smart
speakers, customer care centers and other speech-driven analytics. It is crucial to accurately …

保存引用被引用数: 11 関連記事全 3 バージョン

[Free GPT-4]

[PDF] arxiv.org

Towards robust speaker verification with target speaker enhancement

C Zhang, M Yu, C Weng, D Yu - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

This paper proposes the target speaker enhancement based speaker verification network
(TASE-SVNet), an all neural model that couples target speaker enhancement and speaker …

保存引用被引用数: 21 関連記事全 3 バージョン

[Free GPT-4]

[PDF] nsf.gov

An investigation of domain adaptation in speaker embedding space for speaker recognition

F Bahmaninezhad, C Zhang, JHL Hansen - Speech Communication, 2021 - Elsevier

Speaker recognition continues to grow as a research challenge in the field with expanded
application in commercial, forensic, educational and general speech technology interfaces …

保存引用被引用数: 19 関連記事全 2 バージョン

[Free GPT-4]

[PDF] arxiv.org

Incorporating uncertainty from speaker embedding estimation to speaker verification

Q Wang, KA Lee, T Liu - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org

Speech utterances recorded under differing conditions exhibit varying degrees of
confidence in their embedding estimates, ie, uncertainty, even if they are extracted using the …

保存引用被引用数: 7 関連記事全 4 バージョン

NEC-TT system for mixed-bandwidth and multi-domain speaker recognition

KA Lee, H Yamamoto, K Okabe, Q Wang, L Guo… - Computer Speech & …, 2020 - Elsevier

This paper describes the NEC-TT speaker recognition system designed for the 2018
Speaker Recognition Evaluation (SRE'18) benchmarking. The NEC-TT submission was …

保存引用被引用数: 22 関連記事全 3 バージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

I4U submission to NIST SRE 2018: Leveraging from a decade of shared experiences

[HTML][HTML] Voxceleb: Large-scale speaker verification in the wild

Optimization of data-driven filterbank for automatic speaker verification

**-vector embedding for speaker recognition

Audio-visual speaker recognition with a cross-modal discriminative network

Voxceleb enrichment for age and gender recognition

A study of bias mitigation strategies for speaker recognition

Towards robust speaker verification with target speaker enhancement

An investigation of domain adaptation in speaker embedding space for speaker recognition

Incorporating uncertainty from speaker embedding estimation to speaker verification

NEC-TT system for mixed-bandwidth and multi-domain speaker recognition