Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community

JE Ball, DT Anderson, CS Chan - Journal of applied remote …, 2017 - spiedigitallibrary.org
In recent years, deep learning (DL), a rebranding of neural networks (NNs), has risen to the
top in numerous areas, namely computer vision (CV), speech recognition, and natural …

Making images real again: A comprehensive survey on deep image composition

L Niu, W Cong, L Liu, Y Hong, B Zhang, J Liang… - arxiv preprint arxiv …, 2021 - arxiv.org
As a common image editing operation, image composition aims to combine the foreground
from one image and another background image, resulting in a composite image. However …

Ego4d: Around the world in 3,000 hours of egocentric video

K Grauman, A Westbury, E Byrne… - Proceedings of the …, 2022 - openaccess.thecvf.com
We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It
offers 3,670 hours of daily-life activity video spanning hundreds of scenarios (household …

Simple copy-paste is a strong data augmentation method for instance segmentation

G Ghiasi, Y Cui, A Srinivas, R Qian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Building instance segmentation models that are data-efficient and can handle rare object
categories is an important challenge in computer vision. Leveraging data augmentations is a …

Segcloud: Semantic segmentation of 3d point clouds

L Tchapmi, C Choy, I Armeni, JY Gwak… - … conference on 3D …, 2017 - ieeexplore.ieee.org
3D semantic scene labeling is fundamental to agents operating in the real world. In
particular, labeling raw 3D point sets from sensors provides fine-grained semantics. Recent …

Cut, paste and learn: Surprisingly easy synthesis for instance detection

D Dwibedi, I Misra, M Hebert - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
A major impediment in rapidly deploying object detection models for instance detection is
the lack of large annotated datasets. For example, finding a large labeled dataset containing …

Class-agnostic object detection with multi-modal transformer

M Maaz, H Rasheed, S Khan, FS Khan… - European conference on …, 2022 - Springer
What constitutes an object? This has been a long-standing question in computer vision.
Towards this goal, numerous learning-free and learning-based approaches have been …

Towards universal object detection by domain attention

X Wang, Z Cai, D Gao… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Despite increasing efforts on universal representations for visual recognition, few have
addressed object detection. In this paper, we develop an effective and efficient universal …

T-LESS: An RGB-D dataset for 6D pose estimation of texture-less objects

T Hodan, P Haluza, Š Obdržálek… - 2017 IEEE Winter …, 2017 - ieeexplore.ieee.org
We introduce T-LESS, a new public dataset for estimating the 6D pose, ie translation and
rotation, of texture-less rigid objects. The dataset features thirty industry-relevant objects with …

Synthesizing training data for object detection in indoor scenes

G Georgakis, A Mousavian, AC Berg… - arxiv preprint arxiv …, 2017 - arxiv.org
Detection of objects in cluttered indoor environments is one of the key enabling
functionalities for service robots. The best performing object detection approaches in …