Panoptic segmentation: A review

O Elharrouss, S Al-Maadeed, N Subramanian… - arxiv preprint arxiv …, 2021 - arxiv.org
Image segmentation for video analysis plays an essential role in different research fields
such as smart city, healthcare, computer vision and geoscience, and remote sensing …

Transformer-based visual segmentation: A survey

X Li, H Ding, H Yuan, W Zhang, J Pang… - IEEE transactions on …, 2024 - ieeexplore.ieee.org
Visual segmentation seeks to partition images, video frames, or point clouds into multiple
segments or groups. This technique has numerous real-world applications, such as …

Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d

Y Liao, J **e, A Geiger - IEEE Transactions on Pattern Analysis …, 2022 - ieeexplore.ieee.org
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …

Hoi4d: A 4d egocentric dataset for category-level human-object interaction

Y Liu, Y Liu, C Jiang, K Lyu, W Wan… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the
research of category-level human-object interaction. HOI4D consists of 2.4 M RGB-D …

Panoptic nuscenes: A large-scale benchmark for lidar panoptic segmentation and tracking

WK Fong, R Mohan, JV Hurtado, L Zhou… - IEEE Robotics and …, 2022 - ieeexplore.ieee.org
Panoptic scene understanding and tracking of dynamic agents are essential for robots and
automated vehicles to navigate in urban environments. As LiDARs provide accurate …

Detzero: Rethinking offboard 3d object detection with long-term sequential point clouds

T Ma, X Yang, H Zhou, X Li, B Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing offboard 3D detectors always follow a modular pipeline design to take advantage of
unlimited sequential point clouds. We have found that the full potential of offboard 3D …

Temporal consistent 3d lidar representation learning for semantic perception in autonomous driving

L Nunes, L Wiesmann, R Marcuzzi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Semantic perception is a core building block in autonomous driving, since it provides
information about the drivable space and location of other traffic participants. For learning …

Sscbench: A large-scale 3d semantic scene completion benchmark for autonomous driving

Y Li, S Li, X Liu, M Gong, K Li, N Chen… - 2024 IEEE/RSJ …, 2024 - ieeexplore.ieee.org
Monocular scene understanding is a foundational component of autonomous systems.
Within the spectrum of monocular perception topics, one crucial and useful task for holistic …

Polarmot: How far can geometric relations take us in 3d multi-object tracking?

A Kim, G Brasó, A Ošep, L Leal-Taixé - European conference on computer …, 2022 - Springer
Abstract Most (3D) multi-object tracking methods rely on appearance-based cues for data
association. By contrast, we investigate how far we can get by only encoding geometric …

Dynamic 3d scene analysis by point cloud accumulation

S Huang, Z Gojcic, J Huang, A Wieser… - European Conference on …, 2022 - Springer
Multi-beam LiDAR sensors, as used on autonomous vehicles and mobile robots, acquire
sequences of 3D range scans (“frames”). Each frame covers the scene sparsely, due to …