- Academic Search

V Pratap, A Tjandra, B Shi, P Tomasello, A Babu… - Journal of Machine …, 2024 - jmlr.org

Expanding the language coverage of speech technology has the potential to improve
access to information for many more people. However, current speech technology is …

Speichern Zitieren Zitiert von: 292 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

High fidelity neural audio compression

A Défossez, J Copet, G Synnaeve, Y Adi - arxiv preprint arxiv:2210.13438, 2022 - arxiv.org

We introduce a state-of-the-art real-time, high-fidelity, audio codec leveraging neural
networks. It consists in a streaming encoder-decoder architecture with quantized latent …

Speichern Zitieren Zitiert von: 685 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] ssrn.com

Enterprise data management: Types, sources, and real-time applications to enhance business performance-a systematic review

K Ngcobo, S Bhengu, A Mudau, B Thango… - Systematic Review …, 2024 - papers.ssrn.com

In the current digital era, Enterprise Data Management (EDM) plays a pivotal role in
enhancing business performance by ensuring efficient handling of diverse data sources and …

Speichern Zitieren Zitiert von: 34 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Voicebox: Text-guided multilingual universal speech generation at scale

M Le, A Vyas, B Shi, B Karrer, L Sari… - Advances in neural …, 2024 - proceedings.neurips.cc

Large-scale generative models such as GPT and DALL-E have revolutionized the research
community. These models not only generate high fidelity outputs, but are also generalists …

Speichern Zitieren Zitiert von: 242 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

SpeechBrain: A general-purpose speech toolkit

M Ravanelli, T Parcollet, P Plantinga, A Rouhe… - arxiv preprint arxiv …, 2021 - arxiv.org

SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …

Speichern Zitieren Zitiert von: 746 Ähnliche Artikel Alle 5 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Conditional diffusion probabilistic model for speech enhancement

YJ Lu, ZQ Wang, S Watanabe… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Speech enhancement is a critical component of many user-oriented audio applications, yet
current systems still suffer from distorted and unnatural outputs. While generative models …

Speichern Zitieren Zitiert von: 181 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

Metricgan+: An improved version of metricgan for speech enhancement

SW Fu, C Yu, TA Hsieh, P Plantinga… - arxiv preprint arxiv …, 2021 - arxiv.org

The discrepancy between the cost function used for training a speech enhancement model
and human auditory perception usually makes the quality of enhanced speech …

Speichern Zitieren Zitiert von: 245 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

Speichern Zitieren Zitiert von: 308 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] arxiv.org

DNSMOS P. 835: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. We …

Speichern Zitieren Zitiert von: 213 Ähnliche Artikel Alle 3 Versionen

[Free GPT-4]

[PDF] springer.com

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis

P Ochieng - Artificial Intelligence Review, 2023 - Springer

Deep neural networks (DNN) techniques have become pervasive in domains such as
natural language processing and computer vision. They have achieved great success in …

Speichern Zitieren Zitiert von: 27 Ähnliche Artikel Alle 8 Versionen

Alert erstellen

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Real time speech enhancement in the waveform domain

Scaling speech technology to 1,000+ languages

High fidelity neural audio compression

Enterprise data management: Types, sources, and real-time applications to enhance business performance-a systematic review

Voicebox: Text-guided multilingual universal speech generation at scale

SpeechBrain: A general-purpose speech toolkit

Conditional diffusion probabilistic model for speech enhancement

Metricgan+: An improved version of metricgan for speech enhancement

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

DNSMOS P. 835: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

Deep neural network techniques for monaural speech enhancement and separation: state of the art analysis