الباحث العلمي من Google

F Landini, T Stafylakis, L Burget - IEEE/ACM Transactions on …, 2024‏ - ieeexplore.ieee.org‏

Until recently, the field of speaker diarization was dominated by cascaded systems. Due to
their limitations, mainly regarding overlapped speech and cumbersome pipelines, end-to …‏

حفظ اقتباس تم اقتباسها في عدد: 14 مقالات ذات صلة الإصدارات الـ 2كلها

X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion‏

F Hao, X Li, C Zheng - Information Fusion, 2024‏ - Elsevier‏

Target speaker extraction (TSE) which has the capability to directly extract desired speech
given enrollment utterances of the target speaker has attracted more and more attention for …‏

حفظ اقتباس تم اقتباسها في عدد: 8 مقالات ذات صلة

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DiaCorrect: Error correction back-end for speaker diarization‏

J Han, F Landini, J Rohdin, M Diez… - ICASSP 2024-2024 …, 2024‏ - ieeexplore.ieee.org‏

In this work, we propose an error correction framework, named DiaCorrect, to refine the
output of a diarization system in a simple yet effective way. This method is inspired by error …‏

حفظ اقتباس تم اقتباسها في عدد: 5 مقالات ذات صلة الإصدارات الـ 3كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Multi-input multi-output target-speaker voice activity detection for unified, flexible, and robust audio-visual speaker diarization‏

M Cheng, M Li - ar** speech has attracted more and more attention …‏

حفظ اقتباس تم اقتباسها في عدد: 5 مقالات ذات صلة الإصدارات الـ 5كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings‏

SH Mun, MH Han, C Moon, NS Kim - arxiv preprint arxiv:2312.06065, 2023‏ - arxiv.org‏

In recent years, there have been studies to further improve the end-to-end neural speaker
diarization (EEND) systems. This letter proposes the EEND-DEMUX model, a novel …‏

حفظ اقتباس تم اقتباسها في عدد: 1 مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors‏

M Rybicka, J Villalba, T Thebaud… - … on Audio, Speech …, 2024‏ - ieeexplore.ieee.org‏

Despite many recent developments in speaker diarization, it remains a challenge and an
active area of research to make diarization robust and effective in real-life scenarios. Well …‏

حفظ اقتباس مقالات ذات صلة الإصدارات الـ 2كلها

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation‏

M Cheng, Y Lin, M Li - arxiv preprint arxiv:2411.13849, 2024‏ - arxiv.org‏

This paper proposes a novel Sequence-to-Sequence Neural Diarization (SSND) framework
to perform online and offline speaker diarization. It is developed from the sequence-to …‏

حفظ اقتباس مقالات ذات صلة الإصدارات الـ 2كلها إصدار HTML‏

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization‏

Y Fujita, T Ogawa, T Kobayashi - IEEE Access, 2023‏ - ieeexplore.ieee.org‏

This paper presents a speaker diarization model that incorporates label dependency via
intermediate predictions. The proposed method is categorized as an end-to-end neural …‏

حفظ اقتباس مقالات ذات صلة الإصدارات الـ 4كلها

إنشاء تنبيه

اقتباس

بحث متقدم

تم حفظ المقالة في مكتبتي.

Neural diarization with non-autoregressive intermediate attractors

Diaper: End-to-end neural diarization with perceiver-based attractors‏

X-tf-gridnet: A time–frequency domain target speaker extraction network with adaptive speaker embedding fusion‏

DiaCorrect: Error correction back-end for speaker diarization‏

Multi-input multi-output target-speaker voice activity detection for unified, flexible, and robust audio-visual speaker diarization‏

EEND-DEMUX: End-to-End Neural Speaker Diarization via Demultiplexed Speaker Embeddings‏

End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors‏

Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation‏

Self-Conditioning via Intermediate Predictions for End-to-End Neural Speaker Diarization‏