Understanding deep convolutional networks

S Mallat - … Transactions of the Royal Society A …, 2016 - royalsocietypublishing.org
Deep convolutional networks provide state-of-the-art classifications and regressions results
over many high-dimensional problems. We review their architecture, which scatters data …

A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds

F Alías, JC Socoró, X Sevillano - Applied Sciences, 2016 - mdpi.com
Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …

Deep scattering spectrum

J Andén, S Mallat - IEEE Transactions on Signal Processing, 2014 - ieeexplore.ieee.org
A scattering transform defines a locally translation invariant representation which is stable to
time-war** deformation. It extends MFCC representations by computing modulation …

Power-normalized cepstral coefficients (PNCC) for robust speech recognition

C Kim, RM Stern - IEEE/ACM Transactions on audio, speech …, 2016 - ieeexplore.ieee.org
This paper presents a new feature extraction algorithm called power normalized Cepstral
coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …

Multiresolution spectrotemporal analysis of complex sounds

T Chi, P Ru, SA Shamma - The Journal of the Acoustical Society of …, 2005 - pubs.aip.org
A computational model of auditory analysis is described that is inspired by psychoacoustical
and neurophysiological findings in early and central stages of the auditory system. The …

Towards reconstructing intelligible speech from the human auditory cortex

H Akbari, B Khalighinejad, JL Herrero, AD Mehta… - Scientific reports, 2019 - nature.com
Auditory stimulus reconstruction is a technique that finds the best approximation of the
acoustic stimulus from the population of evoked neural activity. Reconstructing speech from …

The hierarchical cortical organization of human speech processing

WA de Heer, AG Huth, TL Griffiths… - Journal of …, 2017 - Soc Neuroscience
Speech comprehension requires that the brain extract semantic meaning from the spectral
features represented at the cochlea. To investigate this process, we performed an fMRI …

Square deal: Lower bounds and improved relaxations for tensor recovery

C Mu, B Huang, J Wright… - … conference on machine …, 2014 - proceedings.mlr.press
Recovering a low-rank tensor from incomplete information is a recurring problem in signal
processing and machine learning. The most popular convex relaxation of this problem …

Low-rank tensor completion with a new tensor nuclear norm induced by invertible linear transforms

C Lu, X Peng, Y Wei - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
This work studies the low-rank tensor completion problem, which aims to exactly recover a
low-rank tensor from partially observed entries. Our model is inspired by the recently …

Behavioral signal processing: Deriving human behavioral informatics from speech and language

S Narayanan, PG Georgiou - Proceedings of the IEEE, 2013 - ieeexplore.ieee.org
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …