[PDF][PDF] Open-unmix-a reference implementation for music source separation

FR Stöter, S Uhlich, A Liutkus… - Journal of Open Source …, 2019 - joss.theoj.org
Music source separation is the task of decomposing music into its constitutive components,
eg, yielding separated stems for the vocals, bass, and drums. Such a separation has many …

In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning

L Chen, X Yao, C Tan, W He, J Su, F Weng… - Additive …, 2023 - Elsevier
Cracks and keyhole pores are detrimental defects in alloys produced by laser directed
energy deposition (LDED). Laser-material interaction sound may hold information about …

Asteroid: the PyTorch-based audio source separation toolkit for researchers

M Pariente, S Cornell, J Cosentino… - arxiv preprint arxiv …, 2020 - arxiv.org
This paper describes Asteroid, the PyTorch-based audio source separation toolkit for
researchers. Inspired by the most successful neural source separation systems, it provides …

ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration

C Li, J Shi, W Zhang, AS Subramanian… - 2021 IEEE Spoken …, 2021 - ieeexplore.ieee.org
We present ESPnet-SE, which is designed for the quick development of speech
enhancement and speech separation systems in a single framework, along with the optional …

Move2hear: Active audio-visual source separation

S Majumder, Z Al-Halah… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce the active audio-visual source separation problem, where an agent must move
intelligently in order to better isolate the sounds coming from an object of interest in its …

ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding

YJ Lu, X Chang, C Li, W Zhang, S Cornell, Z Ni… - arxiv preprint arxiv …, 2022 - arxiv.org
This paper presents recent progress on integrating speech separation and enhancement
(SSE) into the ESPnet toolkit. Compared with the previous ESPnet-SE work, numerous …

[HTML][HTML] Automating medical simulations

S Gershov, D Braunold, R Spektor, A Ioscovich… - Journal of Biomedical …, 2023 - Elsevier
Objective This study aims to explore speech as an alternative modality for human activity
recognition (HAR) in medical settings. While current HAR technologies rely on video and …

[PDF][PDF] mirdata: Software for Reproducible Usage of Datasets.

RM Bittner, M Fuentes, D Rubinstein, A Jansson… - ISMIR, 2019 - archives.ismir.net
There are a number of efforts in the MIR community towards increased reproducibility, such
as creating more open datasets, publishing code, and the use of common software libraries …

In-situ acoustic monitoring of direct energy deposition process with deep learning-assisted signal denoising

L Chen, X Yao, SK Moon - Materials Today: Proceedings, 2022 - Elsevier
In-situ monitoring is crucial for detecting process anomalies and ensuring part quality in
additive manufacturing. Acoustic-based monitoring techniques offer extra benefits such as …

Adversarial attacks on audio source separation

N Takahashi, S Inoue, Y Mitsufuji - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Despite the excellent performance of neural-network-based audio source separation
methods and their wide range of applications, their robustness against intentional attacks …