- Academic Search

บันทึก อ้างอิง อ้างโดย133 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

Midas v3. 1--a model zoo for robust monocular relative depth estimation

R Birkl, D Wofk, M Müller - arxiv preprint arxiv:2307.14460, 2023 - arxiv.org

We release MiDaS v3. 1 for monocular depth estimation, offering a variety of new models
based on different encoder backbones. This release is motivated by the success of …

บันทึก อ้างอิง อ้างโดย26 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

Controlroom3d: Room generation using semantic proxy rooms

J Schult, S Tsai, L Höllein, B Wu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Manually creating 3D environments for AR/VR applications is a complex process requiring
expert knowledge in 3D modeling software. Pioneering works facilitate this process by …

บันทึก อ้างอิง อ้างโดย27 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

Towards text-guided 3d scene composition

Q Zhang, C Wang, A Siarohin… - Proceedings of the …, 2024 - openaccess.thecvf.com

We are witnessing significant breakthroughs in the technology for generating 3D objects
from text. Existing approaches either leverage large text-to-image models to optimize a 3D …

บันทึก อ้างอิง อ้างโดย4 บทความที่เกี่ยวข้อง ทั้งหมด 6 ฉบับ ดูในรูปแบบ HTML

G3dr: Generative 3d reconstruction in imagenet

P Reddy, I Elezi, J Deng - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

We introduce a novel 3D generative method Generative 3D Reconstruction (G3DR) in
ImageNet capable of generating diverse and high-quality 3D objects from single images …

บันทึก อ้างอิง อ้างโดย37 บทความที่เกี่ยวข้อง ทั้งหมด 3 ฉบับ Library Search

[หนังสือ][B] SparseGS: Real-time 360° sparse view synthesis using Gaussian splatting

H **ong - 2024 - search.proquest.com

The problem of novel view synthesis has grown significantly in popularity recently with the
introduction of Neural Radiance Fields (NeRFs) and other implicit scene representation …

บันทึก อ้างอิง อ้างโดย5 บทความที่เกี่ยวข้อง ทั้งหมด 10 ฉบับ

Idol: Unified dual-modal latent diffusion for human-centric joint video-depth generation

Y Zhai, K Lin, L Li, CC Lin, J Wang, Z Yang… - … on Computer Vision, 2024 - Springer

Significant advances have been made in human-centric video generation, yet the joint video-
depth generation problem remains underexplored. Most existing monocular depth …

บันทึก อ้างอิง อ้างโดย15 บทความที่เกี่ยวข้อง ทั้งหมด 8 ฉบับ ดูในรูปแบบ HTML

Exploiting the signal-leak bias in diffusion models

MN Everaert, A Fitsios, M Bocchio… - Proceedings of the …, 2024 - openaccess.thecvf.com

There is a bias in the inference pipeline of most diffusion models. This bias arises from a
signal leak whose distribution deviates from the noise distribution, creating a discrepancy …

บันทึก อ้างอิง อ้างโดย15 บทความที่เกี่ยวข้อง ทั้งหมด 2 ฉบับ ดูในรูปแบบ HTML

Diffusion priors for dynamic view synthesis from monocular videos

C Wang, P Zhuang, A Siarohin, J Cao, G Qian… - arxiv preprint arxiv …, 2024 - arxiv.org

Dynamic novel view synthesis aims to capture the temporal evolution of visual content within
videos. Existing methods struggle to distinguishing between motion and structure …