Lightweight deep learning for resource-constrained environments: A survey

HI Liu, M Galindo, H **e, LK Wong, HH Shuai… - ACM Computing …, 2024 - dl.acm.org
Over the past decade, the dominance of deep learning has prevailed across various
domains of artificial intelligence, including natural language processing, computer vision …

Emovit: Revolutionizing emotion insights with visual instruction tuning

H **e, CJ Peng, YW Tseng, HJ Chen… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Visual Instruction Tuning represents a novel learning paradigm involving the fine-
tuning of pre-trained language models using task-specific instructions. This paradigm shows …

Distraction is all you need: Memory-efficient image immunization against diffusion-based image editing

L Lo, CY Yeo, HH Shuai… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent text-to-image (T2I) diffusion models have revolutionized image editing by
empowering users to control outcomes using natural language. However the ease of image …

A Survey of Deep Learning for Group-level Emotion Recognition

X Huang, J Xu, W Zheng, Q Mao, A Dhall - arxiv preprint arxiv:2408.15276, 2024 - arxiv.org
With the advancement of artificial intelligence (AI) technology, group-level emotion
recognition (GER) has emerged as an important area in analyzing human behavior. Early …

MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context Understanding

S Madan, S Ghosh, LR Sookha, MA Ganaie… - arxiv preprint arxiv …, 2024 - arxiv.org
Estimating the Most Important Person (MIP) in any social event setup is a challenging
problem mainly due to contextual complexity and scarcity of labeled data. Moreover, the …

Language-Guided Negative Sample Mining for Open-Vocabulary Object Detection

YW Tseng, HH Shuai, CC Huang, YH Li… - 2024 International …, 2024 - ieeexplore.ieee.org
In the domain of computer vision, object detection serves as a fundamental perceptual task
with critical implications. Traditional object detection frameworks are limited by their inability …

Refining Valence-Arousal Estimation with Dual-Stream Label Density Smoothing

H **e, IH Li, L Lo, HH Shuai… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Emotion recognition through facial expressions remains a long-standing research pursuit,
yet the challenges persist, particularly in dynamic real-world scenarios. In-the-wild datasets …

A Spatial-Temporal Graph Convolutional Network for Video-Based Group Emotion Recognition

X Wang, T Chen, D Zhang - International Conference on Pattern …, 2024 - Springer
There are complex emotional interactions between individuals in group and between group
and individuals. Although existing methods for group emotion recognition (GER) made quite …

Group-Level Emotion Recognition Using Hierarchical Dual-Branch Cross Transformer with Semi-Supervised Learning

J Xu, X Huang - 2024 IEEE 4th International Conference on …, 2024 - ieeexplore.ieee.org
Group-level emotion recognition (GER) has received attention from researchers to identify
an overall emotion in a multi-person scene. To address attention issue on group dynamic in …