Supervised speech separation based on deep learning: An overview

DL Wang, J Chen - IEEE/ACM transactions on audio, speech …, 2018 - ieeexplore.ieee.org
Speech separation is the task of separating target speech from background interference.
Traditionally, speech separation is studied as a signal processing problem. A more recent …

Flexible piezoelectric acoustic sensors and machine learning for speech processing

YH Jung, SK Hong, HS Wang, JH Han… - Advanced …, 2020 - Wiley Online Library
Flexible piezoelectric acoustic sensors have been developed to generate multiple sound
signals with high sensitivity, shifting the paradigm of future voice technologies. Speech …

SPICE: Self-supervised pitch estimation

B Gfeller, C Frank, D Roblek, M Sharifi… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
We propose a model to estimate the fundamental frequency in monophonic audio, often
referred to as pitch estimation. We acknowledge the fact that obtaining ground truth …

[КНИГА][B] The digital transformation of labor

A Larsson, R Teigland - 2020 - library.oapen.org
Through a series of studies, the overarching aim of this book is to investigate if and how the
digitalization/digital transformation process causes (or may cause) the autonomy of various …

An analysis of state-of-the-art activation functions for supervised deep neural network

A Nguyen, K Pham, D Ngo, T Ngo… - … conference on system …, 2021 - ieeexplore.ieee.org
This paper provides an analysis of state-of-the-art activation functions with respect to
supervised classification of deep neural network. These activation functions comprise of …

Foundation models for music: A survey

Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, foundation models (FMs) such as large language models (LLMs) and latent
diffusion models (LDMs) have profoundly impacted diverse sectors, including music. This …

Neuromorphic engineering: In memory of misha mahowald

C Mead - Neural Computation, 2023 - ieeexplore.ieee.org
We review the coevolution of hardware and software dedicated to neuromorphic systems.
From modest beginnings, these disciplines have become central to the larger field of …

Deep cepstrum-wavelet autoencoder: A novel intelligent sonar classifier

H Jia, M Khishe, M Mohammadi, S Rashidi - Expert Systems with …, 2022 - Elsevier
Different marine vessels belonging to the same class may have different and time-varying
radiated noise due to different and changing machinery configurations. Further, the time …

[HTML][HTML] On the speech envelope in the cortical tracking of speech

MF Issa, I Khan, M Ruzzoli, N Molinaro, M Lizarazu - NeuroImage, 2024 - Elsevier
The synchronization between the speech envelope and neural activity in auditory regions,
referred to as cortical tracking of speech (CTS), plays a key role in speech processing. The …

A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications

D Baby, A Van Den Broucke, S Verhulst - Nature machine intelligence, 2021 - nature.com
Auditory models are commonly used as feature extractors for automatic speech-recognition
systems or as front-ends for robotics, machine-hearing and hearing-aid applications …