Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Slow-fast auditory streams for audio recognition
We propose a two-stream convolutional network for audio recognition, that operates on time-
frequency spectrogram inputs. Following similar success in visual recognition, we learn …
frequency spectrogram inputs. Following similar success in visual recognition, we learn …
Listening to sounds of silence for speech denoising
We introduce a deep learning model for speech denoising, a long-standing challenge in
audio analysis arising in numerous applications. Our approach is based on a key …
audio analysis arising in numerous applications. Our approach is based on a key …
Deep prior-based audio inpainting using multi-resolution harmonic convolutional neural networks
In this manuscript, we propose a novel method to perform audio inpainting, ie, the
restoration of audio signals presenting multiple missing parts. Audio inpainting can be …
restoration of audio signals presenting multiple missing parts. Audio inpainting can be …
Catch-a-waveform: Learning to generate audio from a single short example
G Greshler, T Shaham… - Advances in Neural …, 2021 - proceedings.neurips.cc
Abstract Models for audio generation are typically trained on hours of recordings. Here, we
illustrate that capturing the essence of an audio source is typically possible from as little as a …
illustrate that capturing the essence of an audio source is typically possible from as little as a …
I'm sorry for your loss: Spectrally-based audio distances are bad at pitch
Growing research demonstrates that synthetic failure modes imply poor generalization. We
compare commonly used audio-to-audio losses on a synthetic benchmark, measuring the …
compare commonly used audio-to-audio losses on a synthetic benchmark, measuring the …
Hppnet: Modeling the harmonic structure and pitch invariance in piano transcription
While neural network models are making significant progress in piano transcription, they are
becoming more resource-consuming due to requiring larger model size and more …
becoming more resource-consuming due to requiring larger model size and more …
Deep audio waveform prior
Convolutional neural networks contain strong priors for generating natural looking images
[1]. These priors enable image denoising, super resolution, and inpainting in an …
[1]. These priors enable image denoising, super resolution, and inpainting in an …
[PDF][PDF] Wavelet networks: Scale equivariant learning from raw waveforms
Inducing symmetry equivariance in deep neural architectures has resolved into improved
data efficiency and generalization. In this work, we utilize the concept of scale and …
data efficiency and generalization. In this work, we utilize the concept of scale and …
Denoising cosine similarity: A theory-driven approach for efficient representation learning
T Nakagawa, Y Sanada, H Waida, Y Zhang, Y Wada… - Neural Networks, 2024 - Elsevier
Abstract Representation learning has been increasing its impact on the research and
practice of machine learning, since it enables to learn representations that can apply to …
practice of machine learning, since it enables to learn representations that can apply to …
Anomaly detection from a frequency perspective: M-band wavelet packet anomaly detection network
Anomaly detection is a task of identifying samples that significantly differ from the majority.
However, most typical anomaly detection methods often prioritize accuracy over …
However, most typical anomaly detection methods often prioritize accuracy over …