An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

A review of state-of-the-art in Face Presentation Attack Detection: From early development to advanced deep learning and multi-modal fusion methods

F Abdullakutty, E Elyan, P Johnston - Information fusion, 2021 - Elsevier
Face Recognition is considered one of the most common biometric solutions these days and
is widely used across a range of devices for various security purposes. The performance of …

[HTML][HTML] Sentiment analysis of persian movie reviews using deep learning

K Dashtipour, M Gogate, A Adeel, H Larijani, A Hussain - Entropy, 2021 - mdpi.com
Sentiment analysis aims to automatically classify the subject's sentiment (eg, positive,
negative, or neutral) towards a particular aspect such as a topic, product, movie, news, etc …

A novel context-aware multimodal framework for persian sentiment analysis

K Dashtipour, M Gogate, E Cambria, A Hussain - Neurocomputing, 2021 - Elsevier
Most recent works on sentiment analysis have exploited the text modality. However, millions
of hours of video recordings posted on social media platforms everyday hold vital …

[HTML][HTML] Improved feature parameter extraction from speech signals using machine learning algorithm

AB Abdusalomov, F Safarov, M Rakhimov, B Turaev… - Sensors, 2022 - mdpi.com
Speech recognition refers to the capability of software or hardware to receive a speech
signal, identify the speaker's features in the speech signal, and recognize the speaker …

ER-NeRF++: Efficient region-aware Neural Radiance Fields for high-fidelity talking portrait synthesis

J Li, J Zhang, X Bai, J Zheng, J Zhou, L Gu - Information Fusion, 2024 - Elsevier
Abstract Despite conditional Neural Radiance Fields (NeRF) achieving great success in
modeling audio-driven talking portraits, the generation quality is increasingly hampered by …

[HTML][HTML] Novel deep convolutional neural network-based contextual recognition of Arabic handwritten scripts

R Ahmed, M Gogate, A Tahir, K Dashtipour… - Entropy, 2021 - mdpi.com
Offline Arabic Handwriting Recognition (OAHR) has recently become instrumental in the
areas of pattern recognition and image processing due to its application in several fields …

[HTML][HTML] Detection of atrial fibrillation using a machine learning approach

S Liaqat, K Dashtipour, A Zahid, K Assaleh, K Arshad… - Information, 2020 - mdpi.com
The atrial fibrillation (AF) is one of the most well-known cardiac arrhythmias in clinical
practice, with a prevalence of 1–2% in the community, which can increase the risk of stroke …

Pruning deep neural networks for green energy-efficient models: A survey

J Tmamna, EB Ayed, R Fourati, M Gogate, T Arslan… - Cognitive …, 2024 - Springer
Over the past few years, larger and deeper neural network models, particularly convolutional
neural networks (CNNs), have consistently advanced state-of-the-art performance across …

A machine learning approach involving functional connectivity features to classify rest-EEG psychogenic non-epileptic seizures from healthy controls

G Varone, W Boulila, M Lo Giudice, B Benjdira… - Sensors, 2021 - mdpi.com
Until now, clinicians are not able to evaluate the Psychogenic Non-Epileptic Seizures
(PNES) from the rest-electroencephalography (EEG) readout. No EEG marker can help …