Cows on pasture: Baselines and benchmarks for language-driven zero-shot object navigation
For robots to be generally useful, they must be able to find arbitrary objects described by
people (ie, be language-driven) even without expensive navigation training on in-domain …
people (ie, be language-driven) even without expensive navigation training on in-domain …
Ovarnet: Towards open-vocabulary object attribute recognition
In this paper, we consider the problem of simultaneously detecting objects and inferring their
visual attributes in an image, even for those with no manual annotations provided at the …
visual attributes in an image, even for those with no manual annotations provided at the …
A survey of deep learning for low-shot object detection
Object detection has achieved a huge breakthrough with deep neural networks and massive
annotated data. However, current detection methods cannot be directly transferred to the …
annotated data. However, current detection methods cannot be directly transferred to the …
Object detectors in the open environment: Challenges, solutions, and outlook
With the emergence of foundation models, deep learning-based object detectors have
shown practical usability in closed set scenarios. However, for real-world tasks, object …
shown practical usability in closed set scenarios. However, for real-world tasks, object …
Collaborative normality learning framework for weakly supervised video anomaly detection
Video anomaly detection (VAD) under weak supervision aims to temporally locate abnormal
clips using the easy-to-obtain video-level labels. In this brief, we introduce the underlying …
clips using the easy-to-obtain video-level labels. In this brief, we introduce the underlying …
Meta-ZSDETR: Zero-shot DETR with Meta-learning
Zero-shot object detection aims to localize and recognize objects of unseen classes. Most of
existing works face two problems: the low recall of RPN in unseen classes and the confusion …
existing works face two problems: the low recall of RPN in unseen classes and the confusion …
Cross-epoch learning for weakly supervised anomaly detection in surveillance videos
Weakly Supervised Anomaly Detection (WSAD) in surveillance videos is a complex task
since usually only video-level annotations are available. Previous work treated it as a …
since usually only video-level annotations are available. Previous work treated it as a …
Probing contextual language models for common ground with visual representations
The success of large-scale contextual language models has attracted great interest in
probing what is encoded in their representations. In this work, we consider a new question …
probing what is encoded in their representations. In this work, we consider a new question …
Resolving semantic confusions for improved zero-shot detection
Zero-shot detection (ZSD) is a challenging task where we aim to recognize and localize
objects simultaneously, even when our model has not been trained with visual samples of a …
objects simultaneously, even when our model has not been trained with visual samples of a …
One point is all you need for weakly supervised object detection
Object detection with weak annotations has attracted much attention recently. Weakly
supervised object detection (WSOD) methods which only use image-level labels to train a …
supervised object detection (WSOD) methods which only use image-level labels to train a …