STAT: Spatial-temporal attention mechanism for video captioning C Yan*, Y Tu*, X Wang, Y Zhang, X Hao, Y Zhang, Q Dai
IEEE TMM 2019, 2019
407 2019 Long short-term relation transformer with global gating for video captioning L Li, X Gao, J Deng, Y Tu, ZJ Zha, Q Huang
IEEE TIP 2022, 2022
78 2022 Video description with spatial-temporal attention Y Tu, X Zhang, B Liu, C Yan
ACM MM 2017, 2017
70 2017 Enhancing the alignment between target words and corresponding frames for video captioning Y Tu, C Zhou, J Guo, S Gao, Z Yu
Pattern Recognition 2021, 2021
56 2021 Semantic relation-aware difference representation learning for change captioning Y Tu, T Yao, L Li, J Lou, S Gao, Z Yu, C Yan
Findings of ACL 2021, 2021
28 2021 SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning Y Tu, L Li, L Su, ZJ Zha, Q Huang
IEEE TPAMI 2024, 2024
27 2024 Viewpoint-Adaptive Representation Disentanglement Network for Change Captioning Y Tu, L Li, L Su, J Du, K Lu, Q Huang
IEEE TIP 2023, 2023
27 * 2023 Relation-aware attention for video captioning via graph learning Y Tu, C Zhou, J Guo, H Li, S Gao, Z Yu
Pattern Recognition 2023, 2023
25 2023 I2Transformer: Intra-and Inter-relation Embedding Transformer for TV Show Captioning Y Tu, L Li, L Su, S Gao, C Yan, ZJ Zha, Z Yu, Q Huang
IEEE TIP 2022, 2022
25 2022 Self-supervised cross-view representation reconstruction for change captioning Y Tu, L Li, L Su, ZJ Zha, C Yan, Q Huang
ICCV 2023, 2023
23 2023 R Net: Relation-embedded Representation Reconstruction Network for Change Captioning Y Tu, L Li, C Yan, S Gao, Z Yu
EMNLP 2021, 2021
19 2021 I3n: Intra-and inter-representation interaction network for change captioning S Yue, Y Tu, L Li, Y Yang, S Gao, Z Yu
IEEE TMM 2023, 2023
18 2023 Neighborhood contrastive transformer for change captioning Y Tu, L Li, L Su, K Lu, Q Huang
IEEE TMM 2023, 2023
17 2023 Ls-gan: iterative language-based image manipulation via long and short term consistency reasoning G Cong, L Li, Z Liu, Y Tu, W Qin, S Zhang, C Yan, W Wang, B Jiang
ACM MM 2022, 2022
15 2022 Corrections to" STAT: Spatial-Temporal Attention Mechanism for Video Captioning". C Yan*, Y Tu*, X Wang, Y Zhang, X Hao, Y Zhang, Q Dai
IEEE TMM 2020, 2020
7 * 2020 Context-aware Difference Distilling for Multi-change Captioning Y Tu, L Li, L Su, ZJ Zha, C Yan, Q Huang
ACL 2024, 2024
4 2024 Multi-grained Representation Aggregating Transformer with Gating Cycle for Change Captioning S Yue, Y Tu, L Li, S Gao, Z Yu
ACM TOMM 2024, 2024
3 2024 Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning Y Tu, L Li, L Su, C Yan, Q Huang
ECCV 2024, 2024
1 2024 MAGIC: Rethinking Dynamic Convolution Design for Medical Image Segmentation S Li, Y Tu, Q Xiang, Z Li
ACM MM 2024, 2024
1 2024 Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Y Tu, L Li, L Su, Q Huang
AAAI 2025, 2025
2025