A survey on preprocessing and classification techniques for acoustic scene

VK Singh, K Sharma, SN Sur - Expert Systems with Applications, 2023‏ - Elsevier
There are lots of research papers for ASC, and in recent years it is rapidly increasing.
DCASE also provides different types of competition for the submission of several papers to …

Real-time deep learning-assisted mechano-acoustic system for respiratory diagnosis and multifunctional classification

HK Lee, SU Park, S Kong, H Ryu, HB Kim… - npj Flexible …, 2024‏ - nature.com
Epidermally mounted sensors using triaxial accelerometers have been previously used to
monitor physiological processes with the implementation of machine learning (ML) algorithm …

Mobile robot: automatic speech recognition application for automation and STEM education

DT Tran, DH Truong, HS Le, JH Huh - Soft Computing, 2023‏ - Springer
Nowadays, robots are widely applied in life as well as industrial production, medicine,
rescue, learning, and entertainment. There are many kinds of robots using different modern …

Advanced differential evolution for gender-aware english speech emotion recognition

L Yue, P Hu, J Zhu - Scientific Reports, 2024‏ - nature.com
Speech emotion recognition (SER) technology involves feature extraction and prediction
models. However, recognition efficiency tends to decrease because of gender differences …

Prosody features based low resource Punjabi children ASR and T-NT classifier using data augmentation

V Kadyan, T Hasija, A Singh - Multimedia Tools and Applications, 2023‏ - Springer
Automatic children speech recognition is always challenging due to limited corpus and
varying acoustic features. One among those is zero speech corpus and large acoustic …

A noise-robust voice conversion method with controllable background sounds

L Chen, X Zhang, Y Li, M Sun, W Chen - Complex & Intelligent Systems, 2024‏ - Springer
Background noises are usually treated as redundant or even harmful to voice conversion.
Therefore, when converting noisy speech, a pretrained module of speech separation is …

Annotation projection-based dependency parser development for Nepali

P Rai, S Chatterji - ACM Transactions on Asian and Low-Resource …, 2022‏ - dl.acm.org
Building computational resources and tools for the under-resourced languages is strenuous
for any Natural Language Processing task. This article presents the first dependency parser …

[PDF][PDF] Use of bidirectional long short term memory in spoken word detection with reference to the Assamese language

D Kalita, KA Borbora, D Nath - Indian Journal …, 2022‏ - sciresol.s3.us-east-2.amazonaws …
Objectives: The proposed method is based on a unique technique of Deep learning for
identifying spoken words with reference to Assamese language. Most of the DNN based …

Using Speech and Text in Emotions Recognition

VM Gomes, APFM Mascarenhas… - … on Robotics (SBR) …, 2024‏ - ieeexplore.ieee.org
This study explores the intricacies of human communication with a focus on emotion
recognition in speech, emphasizing the significance of voice recognition. To Address …

Advancing Music Genre Identification Through Deep Learning Techniques

C Shetty, SK Debnath, MJ Falleiro… - … Conference on Self …, 2023‏ - ieeexplore.ieee.org
Music plays a huge role in human's lives, inspiring a wide range of feelings including
enthusiasm and nostalgia. 97 million songs have been recorded globally; an incredible …