Autonomous vehicles enabled by the integration of IoT, edge intelligence, 5G, and blockchain

A Biswas, HC Wang - Sensors, 2023 - mdpi.com
The wave of modernization around us has put the automotive industry on the brink of a
paradigm shift. Leveraging the ever-evolving technologies, vehicles are steadily …

A brief survey on semantic segmentation with deep learning

S Hao, Y Zhou, Y Guo - Neurocomputing, 2020 - Elsevier
Semantic segmentation is a challenging task in computer vision. In recent years, the
performance of semantic segmentation has been greatly improved by using deep learning …

Sequential modeling enables scalable learning for large vision models

Y Bai, X Geng, K Mangalam, A Bar… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce a novel sequential modeling approach which enables learning a Large Vision
Model (LVM) without making use of any linguistic data. To do this we define a common …

Probing the 3d awareness of visual foundation models

M El Banani, A Raj, KK Maninis, A Kar… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recent advances in large-scale pretraining have yielded visual foundation models with
strong capabilities. Not only can recent models generalize to arbitrary images for their …

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

Google scanned objects: A high-quality dataset of 3d scanned household items

L Downs, A Francis, N Koenig, B Kinman… - … on Robotics and …, 2022 - ieeexplore.ieee.org
Interactive 3D simulations have enabled break-throughs in robotics and computer vision, but
simulating the broad diversity of environments needed for deep learning requires large …

Common objects in 3d: Large-scale learning and evaluation of real-life 3d category reconstruction

J Reizenstein, R Shapovalov… - Proceedings of the …, 2021 - openaccess.thecvf.com
Traditional approaches for learning 3D object categories have been predominantly trained
and evaluated on synthetic datasets due to the unavailability of real 3D-annotated category …

Omnidata: A scalable pipeline for making multi-task mid-level vision datasets from 3d scans

A Eftekhar, A Sax, J Malik… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Computer vision now relies on data, but we know surprisingly little about what factors in the
data affect performance. We argue that this stems from the way data is collected. Designing …

Abo: Dataset and benchmarks for real-world 3d object understanding

J Collins, S Goel, K Deng, A Luthra… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed
to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog …

Hypersim: A photorealistic synthetic dataset for holistic indoor scene understanding

M Roberts, J Ramapuram, A Ranjan… - Proceedings of the …, 2021 - openaccess.thecvf.com
For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-
pixel ground truth labels from real images. We address this challenge by introducing …