Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Sparks of large audio models: A survey and outlook
This survey paper provides a comprehensive overview of the recent advancements and
challenges in applying large language models to the field of audio signal processing. Audio …
challenges in applying large language models to the field of audio signal processing. Audio …
High fidelity neural audio compression
We introduce a state-of-the-art real-time, high-fidelity, audio codec leveraging neural
networks. It consists in a streaming encoder-decoder architecture with quantized latent …
networks. It consists in a streaming encoder-decoder architecture with quantized latent …
Soundstream: An end-to-end neural audio codec
We present SoundStream, a novel neural audio codec that can efficiently compress speech,
music and general audio at bitrates normally targeted by speech-tailored codecs …
music and general audio at bitrates normally targeted by speech-tailored codecs …
Universal speech enhancement with score-based diffusion
Removing background noise from speech audio has been the subject of considerable effort,
especially in recent years due to the rise of virtual communication and amateur recordings …
especially in recent years due to the rise of virtual communication and amateur recordings …
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec
This paper presents FunCodec, a fundamental neural speech codec toolkit, which is an
extension of the open-source speech processing toolkit FunASR. FunCodec provides …
extension of the open-source speech processing toolkit FunASR. FunCodec provides …
Tramba: A hybrid transformer and mamba architecture for practical audio and bone conduction speech super resolution and enhancement on mobile and wearable …
We propose TRAMBA, a hybrid transformer and Mamba architecture for acoustic and bone
conduction speech enhancement, suitable for mobile and wearable platforms. Bone …
conduction speech enhancement, suitable for mobile and wearable platforms. Bone …
HILCodec: High-Fidelity and Lightweight Neural Audio Codec
The recent advancement of end-to-end neural audio codecs enables compressing audio at
very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such …
very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such …
Aero: Audio super resolution in the spectral domain
We present AERO, a audio super-resolution model that processes speech and music
signals in the spectral domain. AERO is based on an encoder-decoder architecture with …
signals in the spectral domain. AERO is based on an encoder-decoder architecture with …
Audio super-resolution with robust speech representation learning of masked autoencoder
This paper proposes Fre-Painter, a high-fidelity audio super-resolution system that utilizes
robust speech representation learning with various masking strategies. Recently, masked …
robust speech representation learning with various masking strategies. Recently, masked …
Hifi++: a unified framework for bandwidth extension and speech enhancement
Generative adversarial networks have recently demonstrated outstanding performance in
neural vocoding outperforming best autoregressive and flow-based models. In this paper …
neural vocoding outperforming best autoregressive and flow-based models. In this paper …