Learning asr pathways: A sparse multilingual asr model

M Yang, A Tjandra, C Liu, D Zhang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Neural network pruning compresses automatic speech recognition (ASR) models effectively.
However, in multilingual ASR, language-agnostic pruning may lead to severe performance …

Human trafficking in social networks: A review of machine learning techniques

M Bermeo, S Escobar, E Cuenca - Conference on Information and …, 2023 - Springer
Human trafficking is a severe problem worldwide and social media platforms have emerged
as a potential tool to detect and prevent this crime. Machine learning (ML) algorithms have …

Dynamic chunk convolution for unified streaming and non-streaming conformer asr

X Li, G Huybrechts, S Ronanki, J Farris… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Recently, there has been an increasing interest in unifying streaming and non-streaming
speech recognition models to reduce development, training and deployment cost. The best …

Ufo2: A unified pre-training framework for online and offline speech recognition

L Fu, S Li, Q Li, L Deng, F Li, L Fan… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
In this paper, we propose a Unified pre-training Framework for Online and Offline (UFO2)
Automatic Speech Recognition (ASR), which 1) simplifies the two separate training …

TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

Y Shangguan, H Yang, D Li, C Wu… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Automatic Speech Recognition (ASR) models need to be optimized for specific hardware
before they can be deployed on devices. This can be done by tuning the model's …

Dynamic ASR pathways: An adaptive masking approach towards efficient pruning of a multilingual ASR model

J **e, K Li, J Guo, A Tjandra… - ICASSP 2024-2024 …, 2024 - ieeexplore.ieee.org
Neural network pruning offers an effective method for compressing a multilingual automatic
speech recognition (ASR) model with minimal performance loss. However, it entails several …

Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer

K Shim, J Lee, S Chang, K Hwang - arxiv preprint arxiv:2308.16415, 2023 - arxiv.org
Streaming automatic speech recognition (ASR) models are restricted from accessing future
context, which results in worse performance compared to the non-streaming models. To …

[HTML][HTML] A metric-driven approach to conformer layer pruning for efficient asr inference

D Bekal, K Gopalakrishnan, K Mundnich, S Ronanki… - 2023 - amazon.science
Conformer-based end-to-end automatic speech recognition (ASR) models have gained
popularity in recent years due to their exceptional performance at scale. However, there are …

[PDF][PDF] A Metric-Driven Approach to Conformer Layer Pruning for Efficient ASR Inference

DBKGK Mundnich, S Ronanki, SBK Kirchhoff - isca-archive.org
Conformer-based end-to-end automatic speech recognition (ASR) models have gained
popularity in recent years due to their exceptional performance at scale. However, there are …

[PDF][PDF] Human Trafficking in Social Networks: a Review of Machine Learning Techniques

E Cuenca - researchgate.net
Human trafficking is a severe problem worldwide and social media platforms have emerged
as a potential tool to detect and prevent this crime. Machine learning (ML) algorithms have …