Overview of speaker modeling and its applications: From the lens of deep speaker representation learning
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …
By thoroughly and accurately modeling this information, it can be utilized in various …
Transformer-based attention network for in-vehicle intrusion detection
Despite the significant advantages of communication systems between electronic control
units, the controller area network (CAN) protocol is vulnerable to attacks owing to its weak …
units, the controller area network (CAN) protocol is vulnerable to attacks owing to its weak …
Self-supervised learning with cluster-aware-dino for high-performance robust speaker verification
The automatic speaker verification task has achieved great success using deep learning
approaches with a large-scale, manually annotated dataset. However, collecting a …
approaches with a large-scale, manually annotated dataset. However, collecting a …
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification
The residual neural networks (ResNet) demonstrate the impressive performance in
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …
Multivariate graph neural networks on enhancing syntactic and semantic for aspect-based sentiment analysis
Aspect-based sentiment analysis (ABSA) aims to predict sentiment orientations towards
textual aspects by extracting insights from user comments. While pretrained large language …
textual aspects by extracting insights from user comments. While pretrained large language …
[PDF][PDF] DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design.
Embeddings extracted by deep neural networks have become the state-of-the-art utterance
representation in speaker verification (SV). Despite the various network architectures that …
representation in speaker verification (SV). Despite the various network architectures that …
Depth-first neural architecture with attentive feature fusion for efficient speaker verification
Deep speaker embedding learning based on neural networks has become the predominant
approach in speaker verification (SV) currently. In prior studies, researchers have …
approach in speaker verification (SV) currently. In prior studies, researchers have …
Towards a unified conformer structure: from asr to asv task
D Liao, T Jiang, F Wang, L Li… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Transformer has achieved extraordinary performance in Natural Language Processing and
Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant …
Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant …
Self attention networks in speaker recognition
Recently, there has been a significant surge of interest in Self-Attention Networks (SANs)
based on the Transformer architecture. This can be attributed to their notable ability for …
based on the Transformer architecture. This can be attributed to their notable ability for …
One Model to Rule Them All: A Universal Transformer for Biometric Matching
This study introduces the first single branch network designed to tackle a spectrum of
biometric matching scenarios, including unimodal, multimodal, cross-modal, and missing …
biometric matching scenarios, including unimodal, multimodal, cross-modal, and missing …