High fidelity neural audio compression
We introduce a state-of-the-art real-time, high-fidelity, audio codec leveraging neural
networks. It consists in a streaming encoder-decoder architecture with quantized latent …
networks. It consists in a streaming encoder-decoder architecture with quantized latent …
Soundstream: An end-to-end neural audio codec
We present SoundStream, a novel neural audio codec that can efficiently compress speech,
music and general audio at bitrates normally targeted by speech-tailored codecs …
music and general audio at bitrates normally targeted by speech-tailored codecs …
Quality of experience in telemeetings and videoconferencing: a comprehensive survey
J Skowronek, A Raake, GH Berndtsson… - IEEE …, 2022 - ieeexplore.ieee.org
Telemeetings such as audiovisual conferences or virtual meetings play an increasingly
important role in our professional and private lives. For that reason, system developers and …
important role in our professional and private lives. For that reason, system developers and …
Towards audio language modeling-an overview
Neural audio codecs are initially introduced to compress audio data into compact codes to
reduce transmission latency. Researchers recently discovered the potential of codecs as …
reduce transmission latency. Researchers recently discovered the potential of codecs as …
Audiodec: An open-source streaming high-fidelity neural audio codec
A good audio codec for live applications such as telecommunication is characterized by
three key properties:(1) compression, ie the bitrate that is required to transmit the signal …
three key properties:(1) compression, ie the bitrate that is required to transmit the signal …
Language-codec: Reducing the gaps between discrete codec representation and speech language models
In recent years, large language models have achieved significant success in generative
tasks (eg, speech cloning and audio generation) related to speech, audio, music, and other …
tasks (eg, speech cloning and audio generation) related to speech, audio, music, and other …
Generative speech coding with predictive variance regularization
The recent emergence of machine-learning based generative models for speech suggests a
significant reduction in bit rate for speech codecs is possible. However, the performance of …
significant reduction in bit rate for speech codecs is possible. However, the performance of …
Funcodec: A fundamental, reproducible and integrable open-source toolkit for neural speech codec
This paper presents FunCodec, a fundamental neural speech codec toolkit, which is an
extension of the open-source speech processing toolkit FunASR. FunCodec provides …
extension of the open-source speech processing toolkit FunASR. FunCodec provides …
Bigcodec: Pushing the limits of low-bitrate neural speech codec
We present BigCodec, a low-bitrate neural speech codec. While recent neural speech
codecs have shown impressive progress, their performance significantly deteriorates at low …
codecs have shown impressive progress, their performance significantly deteriorates at low …
Lmcodec: A low bitrate speech codec with causal transformer models
We introduce LMCodec, a causal neural speech codec that provides high quality audio at
very low bitrates. The backbone of the system is a causal convolutional codec that encodes …
very low bitrates. The backbone of the system is a causal convolutional codec that encodes …