Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Challenges in predictive maintenance–A review

P Nunes, J Santos, E Rocha - CIRP Journal of Manufacturing Science and …, 2023 - Elsevier
Predictive maintenance (PdM) aims the reduction of costs to increase the competitive
strength of the enterprises. It uses sensor data together with analytics techniques to optimize …

Deep learning enabled semantic communication systems

H **e, Z Qin, GY Li, BH Juang - IEEE Transactions on Signal …, 2021 - ieeexplore.ieee.org
Recently, deep learned enabled end-to-end communication systems have been developed
to merge all physical layer blocks in the traditional communication systems, which make joint …

Automatic speech recognition: a survey

M Malik, MK Malik, K Mehmood… - Multimedia Tools and …, 2021 - Springer
Recently great strides have been made in the field of automatic speech recognition (ASR) by
using various deep learning techniques. In this study, we present a thorough comparison …

A review of predictive coding algorithms

MW Spratling - Brain and cognition, 2017 - Elsevier
Predictive coding is a leading theory of how the brain performs probabilistic inference.
However, there are a number of distinct algorithms which are described by the term …

Audiodec: An open-source streaming high-fidelity neural audio codec

YC Wu, ID Gebru, D Marković… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
A good audio codec for live applications such as telecommunication is characterized by
three key properties:(1) compression, ie the bitrate that is required to transmit the signal …

A generic deep learning based cough analysis system from clinically validated samples for point-of-need COVID-19 test and severity levels

J Andreu-Perez, H Pérez-Espinosa… - IEEE Transactions …, 2021 - ieeexplore.ieee.org
In an attempt to reduce the infection rate of the COrona VIrus Disease-19 (Covid-19)
countries around the world have echoed the exigency for an economical, accessible, point …

Speech emotion recognition based on formant characteristics feature extraction and phoneme type convergence

ZT Liu, A Rehman, M Wu, WH Cao, M Hao - Information Sciences, 2021 - Elsevier
Abstract Speech Emotion Recognition (SER) has numerous applications including human-
robot interaction, online gaming, and health care assistance. While deep learning-based …

Novel dual-channel long short-term memory compressed capsule networks for emotion recognition

I Shahin, N Hindawi, AB Nassif, A Alhudhaif… - Expert Systems with …, 2022 - Elsevier
Recent analysis on speech emotion recognition (SER) has made considerable advances
with the use of MFCC's spectrogram features and the implementation of neural network …

A survey on signal processing based pathological voice detection techniques

R Islam, M Tarique, E Abdel-Raheem - IEEE Access, 2020 - ieeexplore.ieee.org
Voice disability is a barrier to effective communication. Around 1.2% of the World's
population is facing some form of voice disability. Surgical procedures namely laryngoscopy …