[HTML][HTML] Generative artificial intelligence in the metaverse era

Z Lv - Cognitive Robotics, 2023 - Elsevier
Generative artificial intelligence (AI) is a form of AI that can autonomously generate new
content, such as text, images, audio, and video. Generative AI provides innovative …

Chaotic image encryption: state-of-the-art, ecosystem, and future roadmap

B Zolfaghari, T Koshiba - Applied System Innovation, 2022 - mdpi.com
Recently, many researchers have been interested in the application of chaos in
cryptography. Specifically, numerous research works have been focusing on chaotic image …

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Monovit: Self-supervised monocular depth estimation with a vision transformer

C Zhao, Y Zhang, M Poggi, F Tosi… - … conference on 3D …, 2022 - ieeexplore.ieee.org
Self-supervised monocular depth estimation is an attractive solution that does not require
hard-to-source depth la-bels for training. Convolutional neural networks (CNNs) have …

NTIRE 2024 challenge on HR depth from images of specular and transparent surfaces

PZ Ramirez, F Tosi, L Di Stefano… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper reports on the NTIRE 2024 challenge on HR Depth From images of Specular and
Transparent surfaces held in conjunction with the New Trends in Image Restoration and …

Nerf-supervised deep stereo

F Tosi, A Tonioni, D De Gregorio… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce a novel framework for training deep stereo networks effortlessly and without
any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate …

Croco: Self-supervised pre-training for 3d vision tasks by cross-view completion

P Weinzaepfel, V Leroy, T Lucas… - Advances in …, 2022 - proceedings.neurips.cc
Abstract Masked Image Modeling (MIM) has recently been established as a potent pre-
training paradigm. A pretext task is constructed by masking patches in an input image, and …

Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey

U Rajapaksha, F Sohel, H Laga, D Diepeveen… - ACM Computing …, 2024 - dl.acm.org
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …

Learning the distribution of errors in stereo matching for joint disparity and uncertainty estimation

L Chen, W Wang, P Mordohai - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We present a new loss function for joint disparity and uncertainty estimation in deep stereo
matching. Our work is motivated by the need for precise uncertainty estimates and the …

Smd-nets: Stereo mixture density networks

F Tosi, Y Liao, C Schmitt… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Despite stereo matching accuracy has greatly improved by deep learning in the last few
years, recovering sharp boundaries and high-resolution outputs efficiently remains …