A survey of image labelling for computer vision applications
Supervised machine learning methods for image analysis require large amounts of labelled
training data to solve computer vision problems. The recent rise of deep learning algorithms …
training data to solve computer vision problems. The recent rise of deep learning algorithms …
Object pose estimation using mid-level visual representations
This work proposes a novel pose estimation model for object categories that can be
effectively transferred to pre-viously unseen environments. The deep convolutional network …
effectively transferred to pre-viously unseen environments. The deep convolutional network …
Survey and systematization of 3D object detection models and methods
Strong demand for autonomous vehicles and the wide availability of 3D sensors are
continuously fueling the proposal of novel methods for 3D object detection. In this paper, we …
continuously fueling the proposal of novel methods for 3D object detection. In this paper, we …
Wildlife 3D multi-object tracking
The study of wildlife populations and species has gained increased relevance due to
significant endangerment, loss of habitats and world climate change. Using camera traps for …
significant endangerment, loss of habitats and world climate change. Using camera traps for …
Understanding Novice's Annotation Process For 3D Semantic Segmentation Task With Human-In-The-Loop
Large-scale 3D point clouds are often used as training data for 3D semantic segmentation,
but the labor-intensive nature of the annotation process challenges the acquisition of …
but the labor-intensive nature of the annotation process challenges the acquisition of …
OpenAnnotate2: Multi-Modal Auto-Annotating for Autonomous Driving
The demand for high-quality annotated data has surged in recent years for applications
driven by real-world artificial intelligence, such as autonomous driving and embodied …
driven by real-world artificial intelligence, such as autonomous driving and embodied …
ALGPT: Multi-Agent Cooperative Framework for Open-Vocabulary Multi-Modal Auto-Annotating in Autonomous Driving
Y Zhou, X Cheng, Q Zhang, L Wang… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) have achieved impressive progress in decision-making
and task automation for intelligent agents. However, multiple agents must cooperate to …
and task automation for intelligent agents. However, multiple agents must cooperate to …
Utilizing Active Machine Learning for Quality Assurance: A case study of virtual car renderings in the automotive industry
Computer-generated imagery of car models has become an indispensable part of car
manufacturers' advertisement concepts. They are for instance used in car configurators to …
manufacturers' advertisement concepts. They are for instance used in car configurators to …
Interactive 3D Annotation of Objects in Moving Videos from Sparse Multi-view Frames
Segmenting and determining the 3D bounding boxes of objects of interest in RGB videos is
an important task for a variety of applications such as augmented reality, navigation, and …
an important task for a variety of applications such as augmented reality, navigation, and …
[HTML][HTML] FRESH: Fusion-Based 3D Apple Recognition via Estimating Stem Direction Heading
G Son, S Lee, Y Choi - Agriculture, 2024 - mdpi.com
In 3D apple detection, the challenge of direction for apple stem harvesting for agricultural
robotics has not yet been resolved. Addressing the issue of determining the stem direction of …
robotics has not yet been resolved. Addressing the issue of determining the stem direction of …