Audio deepfake detection: A survey

J Yi, C Wang, J Tao, X Zhang, CY Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org
Audio deepfake detection is an emerging active topic. A growing number of literatures have
aimed to study deepfake detection algorithms and achieved effective performance, the …

Automatic speaker verification systems and spoof detection techniques: review and analysis

A Mittal, M Dua - International Journal of Speech Technology, 2022 - Springer
Automatic speaker verification (ASV) systems are enhanced enough, that industry is
attracted to use them practically in security systems. However, vulnerability of these systems …

On uni-modal feature learning in supervised multi-modal learning

C Du, J Teng, T Li, Y Liu, T Yuan… - International …, 2023 - proceedings.mlr.press
We abstract the features (ie learned representations) of multi-modal data into 1) uni-modal
features, which can be learned from uni-modal training, and 2) paired features, which can …

Clar: Contrastive learning of auditory representations

H Al-Tahan, Y Mohsenzadeh - International Conference on …, 2021 - proceedings.mlr.press
Learning rich visual representations using contrastive self-supervised learning has been
extremely successful. However, it is still a major question whether we could use a similar …

Time–frequency domain deep convolutional neural network for Li-ion battery SoC estimation

KH Kim, KH Oh, HS Ahn… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The state of charge (SoC) estimation is essential for many battery-related applications, such
as electric vehicles, unmanned aerial vehicles, and uninterruptible power supplies. This …

Music deep learning: deep learning methods for music signal processing—a review of the state-of-the-art

L Moysis, LA Iliadis, SP Sotiroudis, AD Boursianis… - Ieee …, 2023 - ieeexplore.ieee.org
The discipline of Deep Learning has been recognized for its strong computational tools,
which have been extensively used in data and signal processing, with innumerable …

[HTML][HTML] 1D Convolution approach to human activity recognition using sensor data and comparison with machine learning algorithms

K Muralidharan, A Ramesh, G Rithvik, S Prem… - International Journal of …, 2021 - Elsevier
Abstract Human Activity Recognition (HAR) has emerged as a major player in this era of
cutting-edge technological advancement. A key role that HAR plays is its ability to remotely …

Improving multi-modal learning with uni-modal teachers

C Du, T Li, Y Liu, Z Wen, T Hua, Y Wang… - arxiv preprint arxiv …, 2021 - arxiv.org
Learning multi-modal representations is an essential step towards real-world robotic
applications, and various multi-modal fusion models have been developed for this purpose …

Fastaudio: A learnable audio front-end for spoof speech detection

Q Fu, Z Teng, J White, ME Powell… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Spoof speech can be used to try and fool speaker verification systems that determine the
identity of the speaker based on voice characteristics. This paper compares popular …

Railway track inspection using deep learning based on audio to spectrogram conversion: An on-the-fly approach

MSA Hashmi, M Ibrahim, IS Bajwa, HUR Siddiqui… - Sensors, 2022 - mdpi.com
The periodic inspection of railroad tracks is very important to find structural and geometrical
problems that lead to railway accidents. Currently, in Pakistan, rail tracks are inspected by …