Parallel learning: Overview and perspective for computational learning across Syn2Real and Sim2Real
Q Miao, Y Lv, M Huang, X Wang… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
The virtual-to-real paradigm, ie, training models on virtual data and then applying them to
solve real-world problems, has attracted more and more attention from various domains by …
solve real-world problems, has attracted more and more attention from various domains by …
A survey on open-vocabulary detection and segmentation: Past, present, and future
As the most fundamental scene understanding tasks, object detection and segmentation
have made tremendous progress in deep learning era. Due to the expensive manual …
have made tremendous progress in deep learning era. Due to the expensive manual …
Knowledge-enhanced dual-stream zero-shot composed image retrieval
We study the zero-shot Composed Image Retrieval (ZS-CIR) task which is to retrieve the
target image given a reference image and a description without training on the triplet …
target image given a reference image and a description without training on the triplet …
Object-aware distillation pyramid for open-vocabulary object detection
Open-vocabulary object detection aims to provide object detectors trained on a fixed set of
object categories with the generalizability to detect objects described by arbitrary text …
object categories with the generalizability to detect objects described by arbitrary text …
Zero-shot camouflaged object detection
The goal of Camouflaged object detection (COD) is to detect objects that are visually
embedded in their surroundings. Existing COD methods only focus on detecting …
embedded in their surroundings. Existing COD methods only focus on detecting …
Revisiting open world object detection
Open World Object Detection (OWOD), simulating the real dynamic world where knowledge
grows continuously, attempts to detect both known and unknown classes and incrementally …
grows continuously, attempts to detect both known and unknown classes and incrementally …
Interacting objects: A dataset of object-object interactions for richer dynamic scene representations
Dynamic environments in factories, surgical robotics, and warehouses increasingly involve
humans, machines, robots, and various other objects such as tools, fixtures, conveyors, and …
humans, machines, robots, and various other objects such as tools, fixtures, conveyors, and …
Compositional prompt tuning with motion cues for open-vocabulary video relation detection
Prompt tuning with large-scale pretrained vision-language models empowers open-
vocabulary predictions trained on limited base categories, eg, object classification and …
vocabulary predictions trained on limited base categories, eg, object classification and …
Generating features with increased crop-related diversity for few-shot object detection
Two-stage object detectors generate object proposals and classify them to detect objects in
images. These proposals often do not perfectly contain the objects but overlap with them in …
images. These proposals often do not perfectly contain the objects but overlap with them in …
Meta-ZSDETR: Zero-shot DETR with Meta-learning
Zero-shot object detection aims to localize and recognize objects of unseen classes. Most of
existing works face two problems: the low recall of RPN in unseen classes and the confusion …
existing works face two problems: the low recall of RPN in unseen classes and the confusion …