Novel speech recognition systems applied to forensics within child exploitation: Wav2vec2. 0 vs. whisper

JC Vásquez-Correa, A Álvarez Muniain - Sensors, 2023 - mdpi.com
The growth in online child exploitation material is a significant challenge for European Law
Enforcement Agencies (LEAs). One of the most important sources of such online information …

Domain adaptation speech-to-text for low-resource European portuguese using deep learning

E Medeiros, L Corado, L Rato, P Quaresma… - Future Internet, 2023 - mdpi.com
Automatic speech recognition (ASR), commonly known as speech-to-text, is the process of
transcribing audio recordings into text, ie, transforming speech into the respective sequence …

ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion

E Casanova, C Shulby, A Korolev, AC Junior… - arxiv preprint arxiv …, 2022 - arxiv.org
We explore cross-lingual multi-speaker speech synthesis and cross-lingual voice
conversion applied to data augmentation for automatic speech recognition (ASR) systems in …

A large dataset of spontaneous speech with the accent spoken in são paulo for automatic speech recognition evaluation

R Lima, SE Leal, AC Junior, SM Aluísio - Brazilian Conference on …, 2024 - Springer
We present a freely available spontaneous speech corpus for the Brazilian Portuguese
language and report preliminary automatic speech recognition (ASR) results, using both the …

Bringing nurc/sp to digital life: the role of open-source automatic speech recognition models

LRS Gris, AC Junior, VG Santos, BAP Dias… - arxiv preprint arxiv …, 2022 - arxiv.org
The NURC Project that started in 1969 to study the cultured linguistic urban norm spoken in
five Brazilian capitals, was responsible for compiling a large corpus for each capital. The …

Development of a diacritic-aware large vocabulary automatic speech recognition for Hausa language

AM Abubakar, D Gupta, S Vekkot - International Journal of Speech …, 2024 - Springer
Research on voice recognition for African languages is limited due to the scarcity of digital
resources for training and adaptation, despite its broad usefulness. The Hausa language …

MuPe Life Stories Dataset: Spontaneous Speech in Brazilian Portuguese with a Case Study Evaluation on ASR Bias against Speakers Groups and Topic Modeling

SE Leal, AC Junior, R Marcacini… - Proceedings of the …, 2025 - aclanthology.org
Recently, several public datasets for automatic speech recognition (ASR) in Brazilian
Portuguese (BP) have been released, improving ASR systems performance. However, these …

Speech recognition model design for Sundanese language using WAV2VEC 2.0

A Cryssiover, A Zahra - International Journal of Speech Technology, 2024 - Springer
Indonesia has a variety of languages, one of which is Sundanese. Sundanese is a regional
language from Indonesia that has the potential to become extinct. One way to prevent …

A probabilistically-oriented analysis of the performance of asr systems for brazilian radios and tvs

DM de Azevedo, GS Rodrigues, M Ladeira - Brazilian Conference on …, 2022 - Springer
With the use of neural network-based technologies, Automatic Speech Recognition (ASR)
systems for Brazilian Portuguese (BP) have shown great progress in the last few years …

[PDF][PDF] Domain Specific Wav2vec 2.0 Fine-tuning for the SE&R 2022 Challenge.

AIS Ferreira, G dos Reis Oliveira - SE&R@ PROPOR, 2022 - ceur-ws.org
This paper presents our efforts to build a robust ASR model for the shared task Automatic
Speech Recognition for spontaneous and prepared speech & Speech Emotion Recognition …