Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Backdoor attacks against voice recognition systems: A survey
Voice Recognition Systems (VRSs) employ deep learning for speech recognition and
speaker recognition. They have been widely deployed in various real-world applications …
speaker recognition. They have been widely deployed in various real-world applications …
[HTML][HTML] An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings
We performed an experimental review of current diarization systems for the conversational
telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms …
telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms …
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
The residual neural networks (ResNet) demonstrate the impressive performance in
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge
The ICASSP 2022 Multi-channel Multi-party Meeting Transcription Grand Challenge
(M2MeT) focuses on one of the most valuable and the most challenging scenarios of speech …
(M2MeT) focuses on one of the most valuable and the most challenging scenarios of speech …
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
Recently cross-channel attention, which better leverages multi-channel signals from
microphone array, has shown promising results in the multi-party meeting scenario. Cross …
microphone array, has shown promising results in the multi-party meeting scenario. Cross …
Speaker overlap-aware neural diarization for multi-party meeting analysis
Recently, hybrid systems of clustering and neural diarization models have been successfully
applied in multi-party meeting analysis. However, current models always treat overlapped …
applied in multi-party meeting analysis. However, current models always treat overlapped …
Multi-input multi-output target-speaker voice activity detection for unified, flexible, and robust audio-visual speaker diarization
Audio-visual learning has demonstrated promising results in many classical speech tasks
(eg, speech separation, automatic speech recognition, wake-word spotting). We believe that …
(eg, speech separation, automatic speech recognition, wake-word spotting). We believe that …
End-to-end Online Speaker Diarization with Target Speaker Tracking
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …
diarization tasks, which does not require a priori knowledge from the clustering-based …
Online target speaker voice activity detection for speaker diarization
This paper proposes an online target speaker voice activity detection system for speaker
diarization tasks, which does not require a priori knowledge from the clustering-based …
diarization tasks, which does not require a priori knowledge from the clustering-based …
The Second Multi-Channel Multi-Party Meeting Transcription Challenge (M2MeT 2.0): A Benchmark for Speaker-Attributed ASR
With the success of the first Multi-channel Multi-party Meeting Transcription challenge
(M2MeT), the second M2MeT challenge (M2MeT 2.0) held in ASRU2023 particularly aims to …
(M2MeT), the second M2MeT challenge (M2MeT 2.0) held in ASRU2023 particularly aims to …