Generalized video anomaly event detection: Systematic taxonomy and comparison of deep models

Y Liu, D Yang, Y Wang, J Liu, J Liu… - ACM Computing …, 2024 - dl.acm.org
Video Anomaly Detection (VAD) serves as a pivotal technology in the intelligent surveillance
systems, enabling the temporal or spatial identification of anomalous events within videos …

Disentangled representation learning for multimodal emotion recognition

D Yang, S Huang, H Kuang, Y Du… - Proceedings of the 30th …, 2022 - dl.acm.org
Multimodal emotion recognition aims to identify human emotions from text, audio, and visual
modalities. Previous methods either explore correlations between different modalities or …

Aide: A vision-driven multi-view, multi-modal, multi-tasking dataset for assistive driving perception

D Yang, S Huang, Z Xu, Z Li, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Driver distraction has become a significant cause of severe traffic accidents over the past
decade. Despite the growing development of vision-driven driver monitoring systems, the …

Emotion recognition for multiple context awareness

D Yang, S Huang, S Wang, Y Liu, P Zhai, L Su… - European conference on …, 2022 - Springer
Understanding emotion in context is a rising hotspot in the computer vision community.
Existing methods lack reliable context semantics to mitigate uncertainty in expressing …

Improving generalization in visual reinforcement learning via conflict-aware gradient agreement augmentation

S Liu, Z Chen, Y Liu, Y Wang, D Yang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Learning a policy with great generalization to unseen environments remains challenging but
critical in visual reinforcement learning. Despite the success of augmentation combination in …

Learning modality-specific and-agnostic representations for asynchronous multimodal language sequences

D Yang, H Kuang, S Huang, L Zhang - Proceedings of the 30th ACM …, 2022 - dl.acm.org
Understanding human behaviors and intents from videos is a challenging task. Video flows
usually involve time-series data from different modalities, such as natural language, facial …

Deep learning for video anomaly detection: A review

P Wu, C Pan, Y Yan, G Pang, P Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Video anomaly detection (VAD) aims to discover behaviors or events deviating from the
normality in videos. As a long-standing task in the field of computer vision, VAD has …

DSDCLA: Driving style detection via hybrid CNN-LSTM with multi-level attention fusion

J Liu, Y Liu, D Li, H Wang, X Huang, L Song - Applied Intelligence, 2023 - Springer
Driving style detection is an essential real-world requirement in diverse contexts, such as
traffic safety, car insurance and fuel consumption optimization. However, the existing …

Stochastic video normality network for abnormal event detection in surveillance videos

Y Liu, D Yang, G Fang, Y Wang, D Wei, M Zhao… - Knowledge-Based …, 2023 - Elsevier
Abstract Video Anomaly Detection (VAD) aims to automatically identify unexpected spatial–
temporal patterns to detect abnormal events in surveillance videos. Existing unsupervised …

Learning appearance-motion normality for video anomaly detection

Y Liu, J Liu, M Zhao, D Yang, X Zhu… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Video anomaly detection is a challenging task in the Computer vision community. Most
single task-based methods do not consider the independence of unique spatial and …