محقق Google

TJ Park, N Kanda, D Dimitriadis, KJ Han… - Computer Speech & …, 2022‏ - Elsevier‏

Speaker diarization is a task to label audio or video recordings with classes that correspond
to speaker identity, or in short, a task to identify “who spoke when”. In the early years …‏

ذخیره ارجاع بیان شده در 429 یافته مقاله‌های مربوط تمام نسخه‌های 7

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

An overview of deep-learning-based audio-visual speech enhancement and separation‏

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021‏ - ieeexplore.ieee.org‏

Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …‏

ذخیره ارجاع بیان شده در 304 یافته مقاله‌های مربوط تمام نسخه‌های 6

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement‏

Y Hu, Y Liu, S Lv, M ** speakers using
a single audio channel has brought us closer to solving the cocktail party problem. However …‏

ذخیره ارجاع بیان شده در 397 یافته مقاله‌های مربوط تمام نسخه‌های 10 نسخه HTML

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks

A review of speaker diarization: Recent advances with deep learning‏

An overview of deep-learning-based audio-visual speech enhancement and separation‏

DCCRN: Deep complex convolution recurrent network for phase-aware speech enhancement‏