Stability-based generalization analysis of the asynchronous decentralized SGD

X Deng, T Sun, S Li, D Li - Proceedings of the AAAI conference on …, 2023 - ojs.aaai.org
The generalization ability often determines the success of machine learning algorithms in
practice. Therefore, it is of great theoretical and practical importance to understand and …

Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models

A Haider, X Na, E McDermott, T Ng, Z Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper introduces a novel training framework called Focused Discriminative Training
(FDT) to further improve streaming word-piece end-to-end (E2E) automatic speech …

Ravnest: Decentralized Asynchronous Training on Heterogeneous Devices

AR Menon, U Menon, K Ahirwar - arxiv preprint arxiv:2401.01728, 2024 - arxiv.org
Modern deep learning models, growing larger and more complex, have demonstrated
exceptional generalization and accuracy due to training on huge datasets. This trend is …

A Treatise On FST Lattice Based MMI Training

A Haider, T Ng, Z Huang, X Na, AV Rosti - arxiv preprint arxiv:2210.08918, 2022 - arxiv.org
Maximum mutual information (MMI) has become one of the two de facto methods for
sequence-level training of speech recognition acoustic models. This paper aims to isolate …

Distributed Artificial Intelligence over Edge-Assisted Internet-Of-Things

S Shen - 2022 - search.proquest.com
Serving as the bridge between physical and cyber world, Internet-of-Things (IoT) connects a
sheer volume of objects and humans with the functions of communication, storage, learning …