An empirical survey on explainable ai technologies: Recent trends, use-cases, and categories from technical and application perspectives
In a wide range of industries and academic fields, artificial intelligence is becoming
increasingly prevalent. AI models are taking on more crucial decision-making tasks as they …
increasingly prevalent. AI models are taking on more crucial decision-making tasks as they …
Disentangled representation learning for multimodal emotion recognition
Multimodal emotion recognition aims to identify human emotions from text, audio, and visual
modalities. Previous methods either explore correlations between different modalities or …
modalities. Previous methods either explore correlations between different modalities or …
Aide: A vision-driven multi-view, multi-modal, multi-tasking dataset for assistive driving perception
Driver distraction has become a significant cause of severe traffic accidents over the past
decade. Despite the growing development of vision-driven driver monitoring systems, the …
decade. Despite the growing development of vision-driven driver monitoring systems, the …
Emotion recognition for multiple context awareness
Understanding emotion in context is a rising hotspot in the computer vision community.
Existing methods lack reliable context semantics to mitigate uncertainty in expressing …
Existing methods lack reliable context semantics to mitigate uncertainty in expressing …
Shape matters: deformable patch attack
Though deep neural networks (DNNs) have demonstrated excellent performance in
computer vision, they are susceptible and vulnerable to carefully crafted adversarial …
computer vision, they are susceptible and vulnerable to carefully crafted adversarial …
Towards practical certifiable patch defense with vision transformer
Patch attacks, one of the most threatening forms of physical attack in adversarial examples,
can lead networks to induce misclassification by modifying pixels arbitrarily in a continuous …
can lead networks to induce misclassification by modifying pixels arbitrarily in a continuous …
Towards efficient data free black-box adversarial attack
Classic black-box adversarial attacks can take advantage of transferable adversarial
examples generated by a similar substitute model to successfully fool the target model …
examples generated by a similar substitute model to successfully fool the target model …
T-sea: Transfer-based self-ensemble attack on object detection
Compared to query-based black-box attacks, transfer-based black-box attacks do not
require any information of the attacked models, which ensures their secrecy. However, most …
require any information of the attacked models, which ensures their secrecy. However, most …
Improving generalization in visual reinforcement learning via conflict-aware gradient agreement augmentation
Learning a policy with great generalization to unseen environments remains challenging but
critical in visual reinforcement learning. Despite the success of augmentation combination in …
critical in visual reinforcement learning. Despite the success of augmentation combination in …
Learning modality-specific and-agnostic representations for asynchronous multimodal language sequences
Understanding human behaviors and intents from videos is a challenging task. Video flows
usually involve time-series data from different modalities, such as natural language, facial …
usually involve time-series data from different modalities, such as natural language, facial …