- Academic Search

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022 - Elsevier

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …

Uložit Citovat Počet citací tohoto článku: 429 Související články Všechny verze (počet: 7)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Uložit Citovat Počet citací tohoto článku: 439 Související články Všechny verze (počet: 10)

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - Ieee Access, 2021 - ieeexplore.ieee.org

Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

Uložit Citovat Počet citací tohoto článku: 124 Související články Všechny verze (počet: 5)

[Free GPT-4]
[DeepSeek]

[PDF] hal.science

Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org

Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

Uložit Citovat Počet citací tohoto článku: 918 Související články Všechny verze (počet: 18)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hawkes processes for events in social media

MA Rizoiu, Y Lee, S Mishra, L ** speech in a diarization system.
First, we detail a neural Long Short-Term Memory-based architecture for overlap detection …

Uložit Citovat Počet citací tohoto článku: 127 Související články Všechny verze (počet: 4)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Deep learning for video classification and captioning

Z Wu, T Yao, Y Fu, YG Jiang - Frontiers of multimedia research, 2017 - dl.acm.org

Today's digital contents are inherently multimedia: text, audio, image, video, and so on.
Video, in particular, has become a new way of communication between Internet users with …

Uložit Citovat Počet citací tohoto článku: 167 Související články Všechny verze (počet: 6)

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

GPU-accelerated guided source separation for meeting transcription

D Raj, D Povey, S Khudanpur - ar** speech for
speaker diarization. Since speech and natural language tasks often benefit from ensemble …

Uložit Citovat Počet citací tohoto článku: 87 Související články Všechny verze (počet: 10)

[Free GPT-4]
[DeepSeek]

[PDF] googleapis.com

Speech recognition model construction method, speech recognition method, computer system, speech recognition apparatus, program, and recording medium

G Kurata, T Nagano, M Suzuki, R Tachibana - US Patent 9,812,122, 2017 - Google Patents

(57) ABSTRACT A construction method for a speech recognition model, in which a computer
system includes; a step of acquiring alignment between speech of each of a plurality of …

Uložit Citovat Počet citací tohoto článku: 130 Související články Všechny verze (počet: 4) Archiv

Vytvořit upozornění

Citovat

Rozšířené vyhledávání

Uloženo do Mojí knihovny

Overlapped speech detection for improved speaker diarization in multiparty meetings

A review of speaker diarization: Recent advances with deep learning

Speaker recognition based on deep learning: An overview

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

Speaker diarization: A review of recent research

Hawkes processes for events in social media

Deep learning for video classification and captioning

GPU-accelerated guided source separation for meeting transcription

Speech recognition model construction method, speech recognition method, computer system, speech recognition apparatus, program, and recording medium