SpeechBrain: A general-purpose speech toolkit
SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to facilitate the
research and development of neural speech processing technologies by being simple …
research and development of neural speech processing technologies by being simple …
Attention is all you need in speech separation
Recurrent Neural Networks (RNNs) have long been the dominant architecture in sequence-
to-sequence learning. RNNs, however, are inherently sequential models that do not allow …
to-sequence learning. RNNs, however, are inherently sequential models that do not allow …
Torchaudio: Building blocks for audio and speech processing
This document describes version 0.10 of TorchAudio: building blocks for machine learning
applications in the audio and speech processing domain. The objective of TorchAudio is to …
applications in the audio and speech processing domain. The objective of TorchAudio is to …
Librimix: An open-source dataset for generalizable speech separation
In recent years, wsj0-2mix has become the reference dataset for single-channel speech
separation. Most deep learning-based speech separation models today are benchmarked …
separation. Most deep learning-based speech separation models today are benchmarked …
Investigating self-supervised learning for speech enhancement and separation
Speech enhancement and separation are two fundamental tasks for robust speech
processing. Speech enhancement suppresses background noise while speech separation …
processing. Speech enhancement suppresses background noise while speech separation …
How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR
It is challenging to improve automatic speech recognition (ASR) performance in noisy
conditions with single-channel speech enhancement (SE). In this paper, we investigate the …
conditions with single-channel speech enhancement (SE). In this paper, we investigate the …
ESPnet-SE: End-to-end speech enhancement and separation toolkit designed for ASR integration
We present ESPnet-SE, which is designed for the quick development of speech
enhancement and speech separation systems in a single framework, along with the optional …
enhancement and speech separation systems in a single framework, along with the optional …
Espnet2-tts: Extending the edge of tts research
This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS) toolkit.
ESPnet2-TTS extends our earlier version, ESPnet-TTS, by adding many new features …
ESPnet2-TTS extends our earlier version, ESPnet-TTS, by adding many new features …
Whole genome deconvolution unveils Alzheimer's resilient epigenetic signature
Abstract Assay for Transposase Accessible Chromatin by sequencing (ATAC-seq)
accurately depicts the chromatin regulatory state and altered mechanisms guiding gene …
accurately depicts the chromatin regulatory state and altered mechanisms guiding gene …
The 2020 espnet update: new features, broadened applications, performance improvements, and future plans
This paper describes the recent development of ESPnet (https://github. com/espnet/espnet),
an end-to-end speech processing toolkit. This project was initiated in December 2017 to …
an end-to-end speech processing toolkit. This project was initiated in December 2017 to …