Deep learning-based multimodal emotion recognition from audio, visual, and text modalities: A systematic review of recent advancements and future prospects

S Zhang, Y Yang, C Chen, X Zhang, Q Leng… - Expert Systems with …, 2024 - Elsevier
Emotion recognition has recently attracted extensive interest due to its significant
applications to human–computer interaction. The expression of human emotion depends on …

Large language models in medical and healthcare fields: applications, advances, and challenges

D Wang, S Zhang - Artificial Intelligence Review, 2024 - Springer
Large language models (LLMs) are increasingly recognized for their advanced language
capabilities, offering significant assistance in diverse areas like medical communication …

Transformer-based multimodal feature enhancement networks for multimodal depression detection integrating video, audio and remote photoplethysmograph signals

H Fan, X Zhang, Y Xu, J Fang, S Zhang, X Zhao, J Yu - Information Fusion, 2024 - Elsevier
Depression stands as one of the most widespread psychological disorders and has
garnered increasing attention. Currently, how to effectively achieve automatic multimodal …

[HTML][HTML] A systematic survey of air quality prediction based on deep learning

Z Zhang, S Zhang, C Chen, J Yuan - Alexandria Engineering Journal, 2024 - Elsevier
The impact of air pollution on public health is substantial, and accurate long-term predictions
of air quality are crucial for early warning systems to address this issue. Air quality prediction …

Air quality forecasting using a spatiotemporal hybrid deep learning model based on VMD–GAT–BiLSTM

X Wang, S Zhang, Y Chen, L He, Y Ren, Z Zhang… - Scientific Reports, 2024 - nature.com
The precise forecasting of air quality is of great significance as an integral component of
early warning systems. This remains a formidable challenge owing to the limited information …

Ensemble learning using multivariate variational mode decomposition based on the Transformer for multi-step-ahead streamflow forecasting

J Fang, L Yang, X Wen, H Yu, W Li, JF Adamowski… - Journal of …, 2024 - Elsevier
Accurate streamflow forecasting is critical in the domain of water resources management.
However, the inherently non-stationary and stochastic nature of streamflow poses a …

Text-guided multimodal depression detection via cross-modal feature reconstruction and decomposition

Z Chen, D Wang, L Lou, S Zhang, X Zhao, S Jiang… - Information …, 2025 - Elsevier
Depression, a widespread and debilitating mental health disorder, requires early detection
to facilitate effective intervention. Automated depression detection integrating audio with text …

PointTransform Networks for automatic depression level prediction via facial keypoints

M Niu, M Li, C Fu - Knowledge-Based Systems, 2024 - Elsevier
According to physiological reports, individuals with different levels of depression present
various facial dynamic patterns. Thus, researchers have examined facial changes to predict …

LMTformer: facial depression recognition with lightweight multi-scale transformer from videos

L He, J Zhao, J Zhang, J Jiang, S Qi, Z Wang, D Wu - Applied Intelligence, 2025 - Springer
Depression will become the most common mental disorder worldwide by 2030. A number of
models based on deep learning are proposed to help the clinicians to assess the severity of …

Learning spatiotemporal dependencies using adaptive hierarchical graph convolutional neural network for air quality prediction

W Hu, Z Zhang, S Zhang, C Chen, J Yuan, J Yao… - Journal of Cleaner …, 2024 - Elsevier
Air quality prediction has garnered significant attention from both governmental bodies and
the general public due to its close connection with people's daily lives. Learning spatio …