Transformers in speech processing: A survey

S Latif, A Zaidi, H Cuayahuitl, F Shamshad… - arxiv preprint arxiv …, 2023 - arxiv.org
The remarkable success of transformers in the field of natural language processing has
sparked the interest of the speech-processing community, leading to an exploration of their …

Superbizarre is not superb: Derivational morphology improves BERT's interpretation of complex words

V Hofmann, JB Pierrehumbert, H Schütze - arxiv preprint arxiv …, 2021 - arxiv.org
How does the input segmentation of pretrained language models (PLMs) affect their
interpretations of complex words? We present the first study investigating this question …

MetaTransformer: deep metagenomic sequencing read classification using self-attention models

A Wichmann, E Buschong, A Müller… - NAR Genomics and …, 2023 - academic.oup.com
Deep learning has emerged as a paradigm that revolutionizes numerous domains of
scientific research. Transformers have been utilized in language modeling outperforming …

Cross-sentence neural language models for conversational speech recognition

SH Chiu, TH Lo, B Chen - 2021 International Joint Conference …, 2021 - ieeexplore.ieee.org
An important research direction in automatic speech recognition (ASR) has centered around
the development of effective methods to rerank the output hypotheses of an ASR system with …

Speech Recognition Transformers: Topological-lingualism Perspective

S Singh, M Singh, V Kadyan - arxiv preprint arxiv:2408.14991, 2024 - arxiv.org
Transformers have evolved with great success in various artificial intelligence tasks. Thanks
to our recent prevalence of self-attention mechanisms, which capture long-term …

Compressing transformer-based asr model by task-driven loss and attention-based multi-level feature distillation

Y Lv, L Wang, M Ge, S Li, C Ding, L Pan… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
The current popular knowledge distillation (KD) methods effectively compress the
transformer-based end-to-end speech recognition model. However, existing methods fail to …

Effective cross-utterance language modeling for conversational speech recognition

BC Yan, HW Wang, SH Chiu, HS Chiu… - … Joint Conference on …, 2022 - ieeexplore.ieee.org
Conversational speech normally is embodied with loose syntactic structures at the utterance
level but simultaneously exhibits topical coherence relations across consecutive utterances …

Computational investigations of derivational morphology

V Hofmann - 2023 - ora.ox.ac.uk
The notion that it is difficult to make predictions about derivational morphology has been a
recurring theme in morphological research over the last decades. It can be unclear whether …

Speech recognition for conversational finnish

A Moisio - 2021 - aaltodoc.aalto.fi
Spontaneous conversational Finnish is a challenging type of speech to recognise due to
frequent dysfluencies in sentence structure and the use of various informal wordforms. This …

LSTM-XL: Attention Enhanced Long-Term Memory for LSTM Cells

T Grósz, M Kurimo - International Conference on Text, Speech, and …, 2021 - Springer
Abstract Long Short-Term Memory (LSTM) cells, frequently used in state-of-the-art language
models, struggle with long sequences of inputs. One major problem in their design is that …