Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Transformer-based attention network for in-vehicle intrusion detection

TP Nguyen, H Nam, D Kim - IEEE Access, 2023 - ieeexplore.ieee.org
Despite the significant advantages of communication systems between electronic control
units, the controller area network (CAN) protocol is vulnerable to attacks owing to its weak …

Self-supervised learning with cluster-aware-dino for high-performance robust speaker verification

B Han, Z Chen, Y Qian - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
The automatic speaker verification task has achieved great success using deep learning
approaches with a large-scale, manually annotated dataset. However, collecting a …

Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification

T Liu, KA Lee, Q Wang, H Li - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The residual neural networks (ResNet) demonstrate the impressive performance in
automatic speaker verification (ASV). They treat the time and frequency dimensions equally …

Multivariate graph neural networks on enhancing syntactic and semantic for aspect-based sentiment analysis

H Wang, X Qiu, X Tan - Applied Intelligence, 2024 - Springer
Aspect-based sentiment analysis (ABSA) aims to predict sentiment orientations towards
textual aspects by extracting insights from user comments. While pretrained large language …

[PDF][PDF] DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design.

B Liu, Z Chen, S Wang, H Wang, B Han, Y Qian - INTERSPEECH, 2022 - isca-archive.org
Embeddings extracted by deep neural networks have become the state-of-the-art utterance
representation in speaker verification (SV). Despite the various network architectures that …

Depth-first neural architecture with attentive feature fusion for efficient speaker verification

B Liu, Z Chen, Y Qian - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Deep speaker embedding learning based on neural networks has become the predominant
approach in speaker verification (SV) currently. In prior studies, researchers have …

Towards a unified conformer structure: from asr to asv task

D Liao, T Jiang, F Wang, L Li… - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Transformer has achieved extraordinary performance in Natural Language Processing and
Computer Vision tasks thanks to its powerful self-attention mechanism, and its variant …

Self attention networks in speaker recognition

P Safari, M India, J Hernando - Applied Sciences, 2023 - mdpi.com
Recently, there has been a significant surge of interest in Self-Attention Networks (SANs)
based on the Transformer architecture. This can be attributed to their notable ability for …

One Model to Rule Them All: A Universal Transformer for Biometric Matching

M Abdrakhmanova, A Yermekova, Y Barko… - IEEE …, 2024 - ieeexplore.ieee.org
This study introduces the first single branch network designed to tackle a spectrum of
biometric matching scenarios, including unimodal, multimodal, cross-modal, and missing …