Aerogen: enhancing remote sensing object detection with diffusion-driven data generation

D Tang, X Cao, X Wu, J Li, J Yao, X Bai, D Jiang… - arxiv preprint arxiv …, 2024 - arxiv.org
Remote sensing image object detection (RSIOD) aims to identify and locate specific objects
within satellite or aerial imagery. However, there is a scarcity of labeled data in current …

Paint Outside the Box: Synthesizing and Selecting Training Data for Visual Grounding

Z Du, H Li, J Yu, B Li - arxiv preprint arxiv:2412.00684, 2024 - arxiv.org
Visual grounding aims to localize the image regions based on a textual query. Given the
difficulty of large-scale data curation, we investigate how to effectively learn visual grounding …