Mvimgnet: A large-scale dataset of multi-view images
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
Synthetic datasets for autonomous driving: A survey
Z Song, Z He, X Li, Q Ma, R Ming, Z Mao… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Autonomous driving techniques have been flourishing in recent years while thirsting for
huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up …
huge amounts of high-quality data. However, it is difficult for real-world datasets to keep up …
Efem: Equivariant neural field expectation maximization for 3d object segmentation without scene supervision
Abstract We introduce Equivariant Neural Field Expectation Maximization (EFEM), a simple,
effective, and robust geometric algorithm that can segment objects in 3D scenes without …
effective, and robust geometric algorithm that can segment objects in 3D scenes without …
Mm-3dscene: 3d scene understanding by customizing masked modeling with informative-preserved reconstruction and self-distilled consistency
Masked Modeling (MM) has demonstrated widespread success in various vision challenges,
by reconstructing masked visual patches. Yet, applying MM for large-scale 3D scenes …
by reconstructing masked visual patches. Yet, applying MM for large-scale 3D scenes …
Sampro3d: Locating sam prompts in 3d for zero-shot scene segmentation
We introduce SAMPro3D for zero-shot 3D indoor scene segmentation. Given the 3D point
cloud and multiple posed 2D frames of 3D scenes, our approach segments 3D scenes by …
cloud and multiple posed 2D frames of 3D scenes, our approach segments 3D scenes by …
Geometric relation-based feature aggregation for 3D small object detection
Point cloud-based 3D small object detection is crucial for autonomous driving and smart
ships. The current 3D object detection mainly relies on object global features derived from …
ships. The current 3D object detection mainly relies on object global features derived from …
Arkitscenerefer: Text-based localization of small objects in diverse real-world 3d indoor scenes
Abstract 3D referring expression comprehension is a task to ground text representations
onto objects in 3D scenes. It is a crucial task for indoor household robots or augmented …
onto objects in 3D scenes. It is a crucial task for indoor household robots or augmented …
DSPDet3D: 3D Small Object Detection with Dynamic Spatial Pruning
In this paper, we propose an efficient feature pruning strategy for 3D small object detection.
Conventional 3D object detection methods struggle on small objects due to the weak …
Conventional 3D object detection methods struggle on small objects due to the weak …
3D Adaptive Structural Convolution Network for Domain-Invariant Point Cloud Recognition
Adapting deep learning networks for point cloud data recognition in self-driving vehicles
faces challenges due to the variability in datasets and sensor technologies, emphasizing the …
faces challenges due to the variability in datasets and sensor technologies, emphasizing the …
Multi-scale deep learning and clustering-based tabletop object instance segmentation for robot manipulation
Z Jiang, Y Xue, Y Zhao, X Huang… - International Journal of …, 2024 - journals.sagepub.com
3D object instance segmentation plays a vital role in various applications such as
autonomous driving, robotics and virtual reality. However, tabletop scenes exhibit diverse …
autonomous driving, robotics and virtual reality. However, tabletop scenes exhibit diverse …