Understanding deep convolutional networks
S Mallat - … Transactions of the Royal Society A …, 2016 - royalsocietypublishing.org
Deep convolutional networks provide state-of-the-art classifications and regressions results
over many high-dimensional problems. We review their architecture, which scatters data …
over many high-dimensional problems. We review their architecture, which scatters data …
A review of physical and perceptual feature extraction techniques for speech, music and environmental sounds
Endowing machines with sensing capabilities similar to those of humans is a prevalent
quest in engineering and computer science. In the pursuit of making computers sense their …
quest in engineering and computer science. In the pursuit of making computers sense their …
Deep scattering spectrum
A scattering transform defines a locally translation invariant representation which is stable to
time-war** deformation. It extends MFCC representations by computing modulation …
time-war** deformation. It extends MFCC representations by computing modulation …
Power-normalized cepstral coefficients (PNCC) for robust speech recognition
This paper presents a new feature extraction algorithm called power normalized Cepstral
coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …
coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …
Multiresolution spectrotemporal analysis of complex sounds
A computational model of auditory analysis is described that is inspired by psychoacoustical
and neurophysiological findings in early and central stages of the auditory system. The …
and neurophysiological findings in early and central stages of the auditory system. The …
Towards reconstructing intelligible speech from the human auditory cortex
Auditory stimulus reconstruction is a technique that finds the best approximation of the
acoustic stimulus from the population of evoked neural activity. Reconstructing speech from …
acoustic stimulus from the population of evoked neural activity. Reconstructing speech from …
The hierarchical cortical organization of human speech processing
Speech comprehension requires that the brain extract semantic meaning from the spectral
features represented at the cochlea. To investigate this process, we performed an fMRI …
features represented at the cochlea. To investigate this process, we performed an fMRI …
Square deal: Lower bounds and improved relaxations for tensor recovery
Recovering a low-rank tensor from incomplete information is a recurring problem in signal
processing and machine learning. The most popular convex relaxation of this problem …
processing and machine learning. The most popular convex relaxation of this problem …
Low-rank tensor completion with a new tensor nuclear norm induced by invertible linear transforms
This work studies the low-rank tensor completion problem, which aims to exactly recover a
low-rank tensor from partially observed entries. Our model is inspired by the recently …
low-rank tensor from partially observed entries. Our model is inspired by the recently …
Behavioral signal processing: Deriving human behavioral informatics from speech and language
The expression and experience of human behavior are complex and multimodal and
characterized by individual and contextual heterogeneity and variability. Speech and …
characterized by individual and contextual heterogeneity and variability. Speech and …