Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …
vision, graphics, and robotics have progressed largely independently from each other …
Monoscene: Monocular 3d semantic scene completion
MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense
geometry and semantics of a scene are inferred from a single monocular RGB image …
geometry and semantics of a scene are inferred from a single monocular RGB image …
Resnest: Split-attention networks
The ability to learn richer network representations generally boosts the performance of deep
learning models. To improve representation-learning in convolutional neural networks, we …
learning models. To improve representation-learning in convolutional neural networks, we …
A dynamic multi-scale voxel flow network for video prediction
The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …
networks. However, most of the current methods suffer from large model sizes and require …
Robustnet: Improving domain generalization in urban-scene segmentation via instance selective whitening
Enhancing the generalization capability of deep neural networks to unseen domains is
crucial for safety-critical applications in the real world such as autonomous driving. To …
crucial for safety-critical applications in the real world such as autonomous driving. To …
Axial-deeplab: Stand-alone axial-attention for panoptic segmentation
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …
attention has been adopted to augment CNNs with non-local interactions. Recent works …
Object-contextual representations for semantic segmentation
In this paper, we study the context aggregation problem in semantic segmentation.
Motivated by that the label of a pixel is the category of the object that the pixel belongs to, we …
Motivated by that the label of a pixel is the category of the object that the pixel belongs to, we …
Pointpainting: Sequential fusion for 3d object detection
Camera and lidar are important sensor modalities for robotics in general and self-driving
cars in particular. The sensors provide complementary information offering an opportunity for …
cars in particular. The sensors provide complementary information offering an opportunity for …
Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …
Dacs: Domain adaptation via cross-domain mixed sampling
Semantic segmentation models based on convolutional neural networks have recently
displayed remarkable performance for a multitude of applications. However, these models …
displayed remarkable performance for a multitude of applications. However, these models …