Epic-kitchens visor benchmark: Video segmentations and object relations
We introduce VISOR, a new dataset of pixel annotations and a benchmark suite for
segmenting hands and active objects in egocentric video. VISOR annotates videos from …
segmenting hands and active objects in egocentric video. VISOR annotates videos from …
A step towards worldwide biodiversity assessment: The bioscan-1m insect dataset
In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled
insect images, the BIOSCAN-1M Insect Dataset. Each record is taxonomically classified by …
insect images, the BIOSCAN-1M Insect Dataset. Each record is taxonomically classified by …
A deep neural network for high‐throughput measurement of functional traits on museum skeletal specimens
Increasingly, natural history museum collections are being used to generate large‐scale
morphological datasets to address a range of macroecological and macroevolutionary …
morphological datasets to address a range of macroecological and macroevolutionary …
Opdmulti: Openable part detection for multiple objects
Openable part detection is the task of detecting the openable parts of an object in a single-
view image, and predicting corresponding motion parameters. Prior work investigated the …
view image, and predicting corresponding motion parameters. Prior work investigated the …
Clvos23: A long video object segmentation dataset for continual learning
A Nazemi, Z Moustafa… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Continual learning in real-world scenarios is a major challenge. A general continual
learning model should have a constant memory size and no predefined task boundaries, as …
learning model should have a constant memory size and no predefined task boundaries, as …
Pixel-Wise Recognition for Holistic Surgical Scene Understanding
This paper presents the Holistic and Multi-Granular Surgical Scene Understanding of
Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene …
Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene …
Personalized Representation from Personalized Generation
Modern vision models excel at general purpose downstream tasks. It is unclear, however,
how they may be used for personalized vision tasks, which are both fine-grained and data …
how they may be used for personalized vision tasks, which are both fine-grained and data …
OPDMulti: Openable Part Detection for Multiple Objects
Openable part detection is the task of detecting the openable parts of an object in a single-
view image and predicting corresponding motion parameters. Prior work investigated the …
view image and predicting corresponding motion parameters. Prior work investigated the …
SolarDK: A high-resolution urban solar panel image classification and localization dataset
The body of research on classification of solar panel arrays from aerial imagery is
increasing, yet there are still not many public benchmark datasets. This paper introduces two …
increasing, yet there are still not many public benchmark datasets. This paper introduces two …
Skeletal trait measurements for thousands of bird species
Large comparative datasets of avian functional traits have been used to address a wide
range of questions in ecology and evolution. To date, this work has been constrained by the …
range of questions in ecology and evolution. To date, this work has been constrained by the …