[HTML][HTML] Generative artificial intelligence in the metaverse era
Z Lv - Cognitive Robotics, 2023 - Elsevier
Generative artificial intelligence (AI) is a form of AI that can autonomously generate new
content, such as text, images, audio, and video. Generative AI provides innovative …
content, such as text, images, audio, and video. Generative AI provides innovative …
Chaotic image encryption: state-of-the-art, ecosystem, and future roadmap
Recently, many researchers have been interested in the application of chaos in
cryptography. Specifically, numerous research works have been focusing on chaotic image …
cryptography. Specifically, numerous research works have been focusing on chaotic image …
Unifying flow, stereo and depth estimation
We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …
Monovit: Self-supervised monocular depth estimation with a vision transformer
Self-supervised monocular depth estimation is an attractive solution that does not require
hard-to-source depth la-bels for training. Convolutional neural networks (CNNs) have …
hard-to-source depth la-bels for training. Convolutional neural networks (CNNs) have …
NTIRE 2024 challenge on HR depth from images of specular and transparent surfaces
This paper reports on the NTIRE 2024 challenge on HR Depth From images of Specular and
Transparent surfaces held in conjunction with the New Trends in Image Restoration and …
Transparent surfaces held in conjunction with the New Trends in Image Restoration and …
Nerf-supervised deep stereo
We introduce a novel framework for training deep stereo networks effortlessly and without
any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate …
any ground-truth. By leveraging state-of-the-art neural rendering solutions, we generate …
Croco: Self-supervised pre-training for 3d vision tasks by cross-view completion
Abstract Masked Image Modeling (MIM) has recently been established as a potent pre-
training paradigm. A pretext task is constructed by masking patches in an input image, and …
training paradigm. A pretext task is constructed by masking patches in an input image, and …
Deep learning-based depth estimation methods from monocular image and videos: A comprehensive survey
Estimating depth from single RGB images and videos is of widespread interest due to its
applications in many areas, including autonomous driving, 3D reconstruction, digital …
applications in many areas, including autonomous driving, 3D reconstruction, digital …
Learning the distribution of errors in stereo matching for joint disparity and uncertainty estimation
We present a new loss function for joint disparity and uncertainty estimation in deep stereo
matching. Our work is motivated by the need for precise uncertainty estimates and the …
matching. Our work is motivated by the need for precise uncertainty estimates and the …
Smd-nets: Stereo mixture density networks
Despite stereo matching accuracy has greatly improved by deep learning in the last few
years, recovering sharp boundaries and high-resolution outputs efficiently remains …
years, recovering sharp boundaries and high-resolution outputs efficiently remains …