Federated learning review: Fundamentals, enabling technologies, and future applications
Federated Learning (FL) has been foundational in improving the performance of a wide
range of applications since it was first introduced by Google. Some of the most prominent …
range of applications since it was first introduced by Google. Some of the most prominent …
[PDF][PDF] Recent advances in end-to-end automatic speech recognition
J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …
Google usm: Scaling automatic speech recognition beyond 100 languages
We introduce the Universal Speech Model (USM), a single large model that performs
automatic speech recognition (ASR) across 100+ languages. This is achieved by pre …
automatic speech recognition (ASR) across 100+ languages. This is achieved by pre …
Training spiking neural networks using lessons from deep learning
The brain is the perfect place to look for inspiration to develop more efficient neural
networks. The inner workings of our synapses and neurons provide a glimpse at what the …
networks. The inner workings of our synapses and neurons provide a glimpse at what the …
Class-incremental learning by knowledge distillation with adaptive feature consolidation
We present a novel class incremental learning approach based on deep neural networks,
which continually learns new tasks with limited memory for storing examples in the previous …
which continually learns new tasks with limited memory for storing examples in the previous …
A review on the attention mechanism of deep learning
Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …
A survey on neural speech synthesis
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …
speech given text, is a hot research topic in speech, language, and machine learning …
A unifying review of deep and shallow anomaly detection
Deep learning approaches to anomaly detection (AD) have recently improved the state of
the art in detection performance on complex data sets, such as large collections of images or …
the art in detection performance on complex data sets, such as large collections of images or …
Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding
Conformer has proven to be effective in many speech processing tasks. It combines the
benefits of extracting local dependencies using convolutions and global dependencies …
benefits of extracting local dependencies using convolutions and global dependencies …
Earthquake transformer—an attentive deep-learning model for simultaneous earthquake detection and phase picking
Earthquake signal detection and seismic phase picking are challenging tasks in the
processing of noisy data and the monitoring of microearthquakes. Here we present a global …
processing of noisy data and the monitoring of microearthquakes. Here we present a global …