A survey of image labelling for computer vision applications

C Sager, C Janiesch, P Zschech - Journal of Business Analytics, 2021 - Taylor & Francis
Supervised machine learning methods for image analysis require large amounts of labelled
training data to solve computer vision problems. The recent rise of deep learning algorithms …

Object pose estimation using mid-level visual representations

N Nejatishahidin, P Fayyazsanavi… - 2022 IEEE/RSJ …, 2022 - ieeexplore.ieee.org
This work proposes a novel pose estimation model for object categories that can be
effectively transferred to pre-viously unseen environments. The deep convolutional network …

Survey and systematization of 3D object detection models and methods

M Drobnitzky, J Friederich, B Egger, P Zschech - The Visual Computer, 2024 - Springer
Strong demand for autonomous vehicles and the wide availability of 3D sensors are
continuously fueling the proposal of novel methods for 3D object detection. In this paper, we …

Wildlife 3D multi-object tracking

M Klasen, V Steinhage - Ecological Informatics, 2022 - Elsevier
The study of wildlife populations and species has gained increased relevance due to
significant endangerment, loss of habitats and world climate change. Using camera traps for …

Understanding Novice's Annotation Process For 3D Semantic Segmentation Task With Human-In-The-Loop

Y Kim, E Lee, Y Lee, U Oh - … of the 29th International Conference on …, 2024 - dl.acm.org
Large-scale 3D point clouds are often used as training data for 3D semantic segmentation,
but the labor-intensive nature of the annotation process challenges the acquisition of …

OpenAnnotate2: Multi-Modal Auto-Annotating for Autonomous Driving

Y Zhou, L Cai, X Cheng, Q Zhang, X Xue… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
The demand for high-quality annotated data has surged in recent years for applications
driven by real-world artificial intelligence, such as autonomous driving and embodied …

ALGPT: Multi-Agent Cooperative Framework for Open-Vocabulary Multi-Modal Auto-Annotating in Autonomous Driving

Y Zhou, X Cheng, Q Zhang, L Wang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) have achieved impressive progress in decision-making
and task automation for intelligent agents. However, multiple agents must cooperate to …

Utilizing Active Machine Learning for Quality Assurance: A case study of virtual car renderings in the automotive industry

P Hemmer, N Kühl, J Schöffer - arxiv preprint arxiv:2110.09023, 2021 - arxiv.org
Computer-generated imagery of car models has become an indispensable part of car
manufacturers' advertisement concepts. They are for instance used in car configurators to …

Interactive 3D Annotation of Objects in Moving Videos from Sparse Multi-view Frames

K Oomori, W Kawabe, F Matulic, T Igarashi… - Proceedings of the ACM …, 2023 - dl.acm.org
Segmenting and determining the 3D bounding boxes of objects of interest in RGB videos is
an important task for a variety of applications such as augmented reality, navigation, and …

[HTML][HTML] FRESH: Fusion-Based 3D Apple Recognition via Estimating Stem Direction Heading

G Son, S Lee, Y Choi - Agriculture, 2024 - mdpi.com
In 3D apple detection, the challenge of direction for apple stem harvesting for agricultural
robotics has not yet been resolved. Addressing the issue of determining the stem direction of …