[PDF][PDF] Open-unmix-a reference implementation for music source separation
Music source separation is the task of decomposing music into its constitutive components,
eg, yielding separated stems for the vocals, bass, and drums. Such a separation has many …
eg, yielding separated stems for the vocals, bass, and drums. Such a separation has many …
In-situ crack and keyhole pore detection in laser directed energy deposition through acoustic signal and deep learning
Cracks and keyhole pores are detrimental defects in alloys produced by laser directed
energy deposition (LDED). Laser-material interaction sound may hold information about …
energy deposition (LDED). Laser-material interaction sound may hold information about …
Asteroid: the PyTorch-based audio source separation toolkit for researchers
This paper describes Asteroid, the PyTorch-based audio source separation toolkit for
researchers. Inspired by the most successful neural source separation systems, it provides …
researchers. Inspired by the most successful neural source separation systems, it provides …
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
We present ESPnet-SE, which is designed for the quick development of speech
enhancement and speech separation systems in a single framework, along with the optional …
enhancement and speech separation systems in a single framework, along with the optional …
Move2hear: Active audio-visual source separation
S Majumder, Z Al-Halah… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We introduce the active audio-visual source separation problem, where an agent must move
intelligently in order to better isolate the sounds coming from an object of interest in its …
intelligently in order to better isolate the sounds coming from an object of interest in its …
ESPnet-SE++: Speech enhancement for robust speech recognition, translation, and understanding
This paper presents recent progress on integrating speech separation and enhancement
(SSE) into the ESPnet toolkit. Compared with the previous ESPnet-SE work, numerous …
(SSE) into the ESPnet toolkit. Compared with the previous ESPnet-SE work, numerous …
[HTML][HTML] Automating medical simulations
S Gershov, D Braunold, R Spektor, A Ioscovich… - Journal of Biomedical …, 2023 - Elsevier
Objective This study aims to explore speech as an alternative modality for human activity
recognition (HAR) in medical settings. While current HAR technologies rely on video and …
recognition (HAR) in medical settings. While current HAR technologies rely on video and …
[PDF][PDF] mirdata: Software for Reproducible Usage of Datasets.
There are a number of efforts in the MIR community towards increased reproducibility, such
as creating more open datasets, publishing code, and the use of common software libraries …
as creating more open datasets, publishing code, and the use of common software libraries …
In-situ acoustic monitoring of direct energy deposition process with deep learning-assisted signal denoising
In-situ monitoring is crucial for detecting process anomalies and ensuring part quality in
additive manufacturing. Acoustic-based monitoring techniques offer extra benefits such as …
additive manufacturing. Acoustic-based monitoring techniques offer extra benefits such as …
Adversarial attacks on audio source separation
N Takahashi, S Inoue, Y Mitsufuji - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Despite the excellent performance of neural-network-based audio source separation
methods and their wide range of applications, their robustness against intentional attacks …
methods and their wide range of applications, their robustness against intentional attacks …