Multimodal feature extraction and fusion for emotional reaction intensity estimation and expression classification in videos with transformers J Li, Y Chen, X Zhang, J Nie, Z Li, Y Yu, Y Zhang, R Hong, M Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 21 | 2023 |
DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimation J Li, Y Yu, Y Chen, Y Zhang, P Jia, Y Xu, Z Li, M Wang, R Hong Proceedings of the 32nd ACM International Conference on Multimedia, 11397-11403, 2024 | 1 | 2024 |