- Academic Search

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

保存引用被引用次数：439 相关文章所有 9 个版本

[Free GPT-4]

[PDF] arxiv.org

Towards neural diarization for unlimited numbers of speakers using global and local attractors

S Horiguchi, S Watanabe, P García… - 2021 IEEE Automatic …, 2021 - ieeexplore.ieee.org

Attractor-based end-to-end diarization is achieving comparable accuracy to the carefully
tuned conventional clustering-based methods on challenging datasets. However, the main …

保存引用被引用次数：43 相关文章所有 6 个版本

[Free GPT-4]

[PDF] arxiv.org

Online end-to-end neural diarization with speaker-tracing buffer

Y Xue, S Horiguchi, Y Fujita… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org

This paper proposes a novel online speaker diarization algorithm based on a fully
supervised self-attention mechanism (SA-EEND). Online diarization inherently presents a …

保存引用被引用次数：58 相关文章所有 8 个版本

[Free GPT-4]

[PDF] arxiv.org

Multi-speaker and wide-band simulated conversations as training data for end-to-end neural diarization

F Landini, M Diez, A Lozano-Diez… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

End-to-end diarization presents an attractive alternative to standard cascaded diarization
systems because a single system can handle all aspects of the task at once. Many flavors of …

保存引用被引用次数：17 相关文章所有 6 个版本

[Free GPT-4]

[PDF] arxiv.org

From Modular to End-to-End Speaker Diarization

F Landini - arxiv preprint arxiv:2407.08752, 2024 - arxiv.org

Speaker diarization is usually referred to as the task that determines``who spoke when''in a
recording. Until a few years ago, all competitive approaches were modular. Systems based …

保存引用被引用次数：1 相关文章所有 3 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Optimal Map** Loss: A Faster Loss for End-to-End Speaker Diarization.

Speaker recognition based on deep learning: An overview

Towards neural diarization for unlimited numbers of speakers using global and local attractors

Online end-to-end neural diarization with speaker-tracing buffer

Multi-speaker and wide-band simulated conversations as training data for end-to-end neural diarization

From Modular to End-to-End Speaker Diarization