Epic-kitchens visor benchmark: Video segmentations and object relations

A Darkhalil, D Shan, B Zhu, J Ma… - Advances in …, 2022 - proceedings.neurips.cc
We introduce VISOR, a new dataset of pixel annotations and a benchmark suite for
segmenting hands and active objects in egocentric video. VISOR annotates videos from …

A step towards worldwide biodiversity assessment: The bioscan-1m insect dataset

Z Gharaee, ZM Gong, N Pellegrino… - Advances in …, 2024 - proceedings.neurips.cc
In an effort to catalog insect biodiversity, we propose a new large dataset of hand-labelled
insect images, the BIOSCAN-1M Insect Dataset. Each record is taxonomically classified by …

A deep neural network for high‐throughput measurement of functional traits on museum skeletal specimens

BC Weeks, Z Zhou, BK O'Brien… - Methods in Ecology …, 2023 - Wiley Online Library
Increasingly, natural history museum collections are being used to generate large‐scale
morphological datasets to address a range of macroecological and macroevolutionary …

Opdmulti: Openable part detection for multiple objects

X Sun, H Jiang, M Savva, AX Chang - arxiv preprint arxiv:2303.14087, 2023 - arxiv.org
Openable part detection is the task of detecting the openable parts of an object in a single-
view image, and predicting corresponding motion parameters. Prior work investigated the …

Clvos23: A long video object segmentation dataset for continual learning

A Nazemi, Z Moustafa… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Continual learning in real-world scenarios is a major challenge. A general continual
learning model should have a constant memory size and no predefined task boundaries, as …

Pixel-Wise Recognition for Holistic Surgical Scene Understanding

N Ayobi, S Rodríguez, A Pérez, I Hernández… - arxiv preprint arxiv …, 2024 - arxiv.org
This paper presents the Holistic and Multi-Granular Surgical Scene Understanding of
Prostatectomies (GraSP) dataset, a curated benchmark that models surgical scene …

Personalized Representation from Personalized Generation

S Sundaram, J Chae, Y Tian, S Beery… - arxiv preprint arxiv …, 2024 - arxiv.org
Modern vision models excel at general purpose downstream tasks. It is unclear, however,
how they may be used for personalized vision tasks, which are both fine-grained and data …

OPDMulti: Openable Part Detection for Multiple Objects

X Sun, H Jiang, M Savva… - … Conference on 3D Vision …, 2024 - ieeexplore.ieee.org
Openable part detection is the task of detecting the openable parts of an object in a single-
view image and predicting corresponding motion parameters. Prior work investigated the …

SolarDK: A high-resolution urban solar panel image classification and localization dataset

M Khomiakov, JH Radzikowski, CA Schmidt… - arxiv preprint arxiv …, 2022 - arxiv.org
The body of research on classification of solar panel arrays from aerial imagery is
increasing, yet there are still not many public benchmark datasets. This paper introduces two …

Skeletal trait measurements for thousands of bird species

BC Weeks, Z Zhou, CM Probst, JS Berv, BK O'Brien… - bioRxiv, 2024 - biorxiv.org
Large comparative datasets of avian functional traits have been used to address a wide
range of questions in ecology and evolution. To date, this work has been constrained by the …