- Academic Search

Novel speech recognition systems applied to forensics within child exploitation: Wav2vec2. 0 vs. whisper

JC Vásquez-Correa, A Álvarez Muniain - Sensors, 2023 - mdpi.com

The growth in online child exploitation material is a significant challenge for European Law
Enforcement Agencies (LEAs). One of the most important sources of such online information …

保存引用被引用次数：33 相关文章所有 12 个版本网页快照

[PDF] springer.com

Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

P Gimeno, I Viñals, A Ortega, A Miguel… - EURASIP Journal on …, 2020 - Springer

This paper presents a new approach based on recurrent neural networks (RNN) to the
multiclass audio segmentation task whose goal is to classify an audio signal as speech …

保存引用被引用次数：54 相关文章所有 9 个版本

Analysis of the but diarization system for voxconverse challenge

F Landini, O Glembek, P Matějka… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

This paper describes the system developed by the BUT team for the fourth track of the
VoxCeleb Speaker Recognition Challenge, focusing on diarization on the VoxConverse …

保存引用被引用次数：42 相关文章所有 8 个版本

Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier

In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

保存引用被引用次数：9 相关文章所有 4 个版本

A comparison of hybrid and end-to-end ASR systems for the IberSpeech-RTVE 2020 speech-to-text transcription challenge

JM Perero-Codosero, FM Espinoza-Cuadros… - Applied Sciences, 2022 - mdpi.com

This paper describes a comparison between hybrid and end-to-end Automatic Speech
Recognition (ASR) systems, which were evaluated on the IberSpeech-RTVE 2020 Speech …

保存引用被引用次数：14 相关文章所有 5 个版本网页快照

An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies

E Lleida, LJ Rodriguez-Fuentes, J Tejedor, A Ortega… - Applied Sciences, 2023 - mdpi.com

Evaluation campaigns provide a common framework with which the progress of speech
technologies can be effectively measured. The aim of this paper is to present a detailed …

保存引用被引用次数：3 相关文章所有 7 个版本网页快照

Tase: Task-aware speech enhancement for wake-up word detection in voice assistants

G Cámbara, F López, D Bonet, P Gómez, C Segura… - Applied Sciences, 2022 - mdpi.com

Wake-up word spotting in noisy environments is a critical task for an excellent user
experience with voice assistants. Unwanted activation of the device is often due to the …

保存引用被引用次数：13 相关文章所有 6 个版本网页快照

Audio-Visual Speaker Diarization: Current Databases, Approaches and Challenges

V Mingote, A Ortega, A Miguel, E Lleida - arxiv preprint arxiv:2409.05659, 2024 - arxiv.org

Nowadays, the large amount of audio-visual content available has fostered the need to
develop new robust automatic speaker diarization systems to analyse and characterise it …

保存引用相关文章所有 2 个版本 HTML 版