Federated learning review: Fundamentals, enabling technologies, and future applications

S Banabilah, M Aloqaily, E Alsayed, N Malik… - Information processing & …, 2022 - Elsevier
Federated Learning (FL) has been foundational in improving the performance of a wide
range of applications since it was first introduced by Google. Some of the most prominent …

[PDF][PDF] Recent advances in end-to-end automatic speech recognition

J Li - APSIPA Transactions on Signal and Information …, 2022 - nowpublishers.com
Recently, the speech community is seeing a significant trend of moving from deep neural
network based hybrid modeling to end-to-end (E2E) modeling for automatic speech …

Google usm: Scaling automatic speech recognition beyond 100 languages

Y Zhang, W Han, J Qin, Y Wang, A Bapna… - arxiv preprint arxiv …, 2023 - arxiv.org
We introduce the Universal Speech Model (USM), a single large model that performs
automatic speech recognition (ASR) across 100+ languages. This is achieved by pre …

Training spiking neural networks using lessons from deep learning

JK Eshraghian, M Ward, EO Neftci… - Proceedings of the …, 2023 - ieeexplore.ieee.org
The brain is the perfect place to look for inspiration to develop more efficient neural
networks. The inner workings of our synapses and neurons provide a glimpse at what the …

Class-incremental learning by knowledge distillation with adaptive feature consolidation

M Kang, J Park, B Han - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
We present a novel class incremental learning approach based on deep neural networks,
which continually learns new tasks with limited memory for storing examples in the previous …

A review on the attention mechanism of deep learning

Z Niu, G Zhong, H Yu - Neurocomputing, 2021 - Elsevier
Attention has arguably become one of the most important concepts in the deep learning
field. It is inspired by the biological systems of humans that tend to focus on the distinctive …

A survey on neural speech synthesis

X Tan, T Qin, F Soong, TY Liu - arxiv preprint arxiv:2106.15561, 2021 - arxiv.org
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …

A unifying review of deep and shallow anomaly detection

L Ruff, JR Kauffmann, RA Vandermeulen… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Deep learning approaches to anomaly detection (AD) have recently improved the state of
the art in detection performance on complex data sets, such as large collections of images or …

Branchformer: Parallel mlp-attention architectures to capture local and global context for speech recognition and understanding

Y Peng, S Dalmia, I Lane… - … Conference on Machine …, 2022 - proceedings.mlr.press
Conformer has proven to be effective in many speech processing tasks. It combines the
benefits of extracting local dependencies using convolutions and global dependencies …

Earthquake transformer—an attentive deep-learning model for simultaneous earthquake detection and phase picking

SM Mousavi, WL Ellsworth, W Zhu, LY Chuang… - Nature …, 2020 - nature.com
Earthquake signal detection and seismic phase picking are challenging tasks in the
processing of noisy data and the monitoring of microearthquakes. Here we present a global …