- Academic Search

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org

Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

保存引用被引用次数：304 相关文章所有 6 个版本

[Free GPT-4]

[PDF] springer.com

Deep audio-visual learning: A survey

H Zhu, MD Luo, R Wang, AH Zheng, R He - International Journal of …, 2021 - Springer

Audio-visual learning, aimed at exploiting the relationship between audio and visual
modalities, has drawn considerable attention since deep learning started to be used …

保存引用被引用次数：191 相关文章所有 12 个版本

[Free GPT-4]

[PDF] arxiv.org

Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation

Y Luo, Z Chen, T Yoshioka - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org

Recent studies in deep learning-based speech separation have proven the superiority of
time-domain approaches to conventional time-frequency-based methods. Unlike the time …

保存引用被引用次数：889 相关文章所有 6 个版本

[Free GPT-4]

[PDF] arxiv.org

Dual-path transformer network: Direct context-aware modeling for end-to-end monaural speech separation

J Chen, Q Mao, D Liu - ar** speakers using
a single audio channel has brought us closer to solving the cocktail party problem. However …

保存引用被引用次数：390 相关文章所有 11 个版本 HTML 版

创建快讯

引用

高级搜索

已保存到“我的图书馆”

Single-channel multi-speaker separation using deep clustering

An overview of deep-learning-based audio-visual speech enhancement and separation

Deep audio-visual learning: A survey

Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation

Dual-path transformer network: Direct context-aware modeling for end-to-end monaural speech separation