Review on panoramic imaging and its applications in scene understanding

S Gao, K Yang, H Shi, K Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid development of high-speed communication and artificial intelligence
technologies, human perception of real-world scenes is no longer limited to the use of small …

Lidargait: Benchmarking 3d gait recognition with point clouds

C Shen, C Fan, W Wu, R Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video-based gait recognition has achieved impressive results in constrained scenarios.
However, visual cameras neglect human 3D structure information, which limits the feasibility …

Gina-3d: Learning to generate implicit neural assets in the wild

B Shen, X Yan, CR Qi, M Najibi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Modeling the 3D world from sensor data for simulation is a scalable way of develo**
testing and validation environments for robotic learning problems such as autonomous …

Invariant slot attention: Object discovery with slot-centric reference frames

O Biza, S Van Steenkiste, MSM Sajjadi… - arxiv preprint arxiv …, 2023 - arxiv.org
Automatically discovering composable abstractions from raw perceptual data is a long-
standing challenge in machine learning. Recent slot-based neural networks that learn about …

Int2: Interactive trajectory prediction at intersections

Z Yan, P Li, Z Fu, S Xu, Y Shi, X Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Motion forecasting is an important component in autonomous driving systems. One of the
most challenging problems in motion forecasting is interactive trajectory prediction, whose …

Learning to adapt sam for segmenting cross-domain point clouds

X Peng, R Chen, F Qiao, L Kong, Y Liu, Y Sun… - … on Computer Vision, 2024 - Springer
Unsupervised domain adaptation (UDA) in 3D segmentation tasks presents a formidable
challenge, primarily steming from the sparse and unordered nature of point clouds …

XLM for Autonomous Driving Systems: A Comprehensive Review

S Fourati, W Jaafar, N Baccar, S Alfattani - arxiv preprint arxiv:2409.10484, 2024 - arxiv.org
Large Language Models (LLMs) have showcased remarkable proficiency in various
information-processing tasks. These tasks span from extracting data and summarizing …

Open panoramic segmentation

J Zheng, R Liu, Y Chen, K Peng, C Wu, K Yang… - … on Computer Vision, 2024 - Springer
Panoramic images, capturing a 360∘ field of view (FoV), encompass omnidirectional spatial
information crucial for scene understanding. However, it is not only costly to obtain training …

Dynamo-depth: fixing unsupervised depth estimation for dynamical scenes

Y Sun, B Hariharan - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Unsupervised monocular depth estimation techniques have demonstrated encouraging
results but typically assume that the scene is static. These techniques suffer when trained on …

Behind every domain there is a shift: Adapting distortion-aware vision transformers for panoramic semantic segmentation

J Zhang, K Yang, H Shi, S Reiß, K Peng… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
In this paper, we address panoramic semantic segmentation which is under-explored due to
two critical challenges:(1) image distortions and object deformations on panoramas;(2) lack …