Google 학술 검색

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

저장 인용 442회 인용 관련 학술자료 전체 9개의 버전

[Free GPT-4]

[PDF] arxiv.org

A review of speaker diarization: Recent advances with deep learning

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

저장 인용 420회 인용 관련 학술자료 전체 7개의 버전

[Free GPT-4]

[PDF] arxiv.org

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - arxiv preprint arxiv …, 2020 - arxiv.org

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …

저장 인용 363회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]

[PDF] vut.cz

Bayesian hmm clustering of x-vector sequences (vbx) in speaker diarization: theory, implementation and analysis on standard tasks

F Landini, J Profant, M Diez, L Burget - Computer Speech & Language, 2022 - Elsevier

The recently proposed VBx diarization method uses a Bayesian hidden Markov model to
find speaker clusters in a sequence of x-vectors. In this work we perform an extensive …

저장 인용 233회 인용 관련 학술자료 전체 6개의 버전

[Free GPT-4]

[PDF] danielpovey.com

Speaker recognition for multi-speaker conversations using x-vectors

D Snyder, D Garcia-Romero, G Sell… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

Recently, deep neural networks that map utterances to fixed-dimensional embeddings have
emerged as the state-of-the-art in speaker recognition. Our prior work introduced x-vectors …

저장 인용 394회 인용 관련 학술자료 전체 7개의 버전

[Free GPT-4]

[PDF] arxiv.org

The third DIHARD diarization challenge

N Ryant, P Singh, V Krishnamohan, R Varma… - arxiv preprint arxiv …, 2020 - arxiv.org

DIHARD III was the third in a series of speaker diarization challenges intended to improve
the robustness of diarization systems to variability in recording equipment, noise conditions …

[Free GPT-4]

[PDF] arxiv.org

End-to-end neural speaker diarization with self-attention

Y Fujita, N Kanda, S Horiguchi, Y Xue… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

Speaker diarization has been mainly developed based on the clustering of speaker
embeddings. However, the clustering-based approach has two major problems; ie,(i) it is not …

저장 인용 293회 인용 관련 학술자료 전체 7개의 버전

[Free GPT-4]

[PDF] arxiv.org

End-to-end neural speaker diarization with permutation-free objectives

Y Fujita, N Kanda, S Horiguchi, K Nagamatsu… - arxiv preprint arxiv …, 2019 - arxiv.org

In this paper, we propose a novel end-to-end neural-network-based speaker diarization
method. Unlike most existing methods, our proposed method does not have separate …

저장 인용 289회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Target-speaker voice activity detection: a novel approach for multi-speaker diarization in a dinner party scenario

I Medennikov, M Korenevsky, T Prisyach… - arxiv preprint arxiv …, 2020 - arxiv.org

Speaker diarization for real-life scenarios is an extremely challenging problem. Widely used
clustering-based diarization approaches perform rather poorly in such conditions, mainly …

[Free GPT-4]

[PDF] arxiv.org

Spot the conversation: speaker diarisation in the wild

JS Chung, J Huh, A Nagrani, T Afouras… - arxiv preprint arxiv …, 2020 - arxiv.org

The goal of this paper is speaker diarisation of videos collected'in the wild'. We make three
key contributions. First, we propose an automatic audio-visual diarisation method for …

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural...

Speaker recognition based on deep learning: An overview

A review of speaker diarization: Recent advances with deep learning

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

Bayesian hmm clustering of x-vector sequences (vbx) in speaker diarization: theory, implementation and analysis on standard tasks

Speaker recognition for multi-speaker conversations using x-vectors

The third DIHARD diarization challenge

End-to-end neural speaker diarization with self-attention

End-to-end neural speaker diarization with permutation-free objectives

Target-speaker voice activity detection: a novel approach for multi-speaker diarization in a dinner party scenario

Spot the conversation: speaker diarisation in the wild