Hierarchical banzhaf interaction for general video-language representation learning

P **, H Li, L Yuan, S Yan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Multimodal representation learning, with contrastive learning, plays an important role in the
artificial intelligence domain. As an important subfield, video-language representation …