Large Language Model with Curriculum Reasoning for Visual Concept Recognition

Y Zhang, X Wang, H Chen, J Fan, W Wen… - Proceedings of the 30th …, 2024 - dl.acm.org
Visual concept recognition aims to capture the basic attributes of an image and reason
about the relationships among them to determine whether the image satisfies a certain …

Neighbor Does Matter: Curriculum Global Positive-Negative Sampling for Vision-Language Pre-training

B Huang, F He, Q Wang, H Chen, G Li, Z Feng… - Proceedings of the …, 2024 - dl.acm.org
Sampling strategies have been widely adopted in Vision-Language Pre-training (VLP) and
have achieved great success recently. However, the sampling strategies adopted by current …