Google 학술 검색

D Danier, M Aygün, C Li, H Bilen… - arxiv preprint arxiv …, 2024 - arxiv.org

Large-scale pre-trained vision models are becoming increasingly prevalent, offering
expressive and generalizable visual representations that benefit various downstream tasks …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Z Gu, R Yan, J Lu, P Li, Z Dou, C Si, Z Dong… - arxiv preprint arxiv …, 2025 - arxiv.org

Diffusion models have demonstrated impressive performance in generating high-quality
videos from text prompts or images. However, precise control over the video generation …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Exploring Representation-Aligned Latent Space for Better Generation

W Xu, X Yue, Z Wang, Y Teng, W Zhang, X Liu… - arxiv preprint arxiv …, 2025 - arxiv.org

Generative models serve as powerful tools for modeling the real world, with mainstream
diffusion models, particularly those based on the latent diffusion model paradigm, achieving …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization

S Dong, S Wang, S Liu, L Cai, Q Fan, J Kannala… - arxiv preprint arxiv …, 2024 - arxiv.org

Visual localization aims to determine the camera pose of a query image relative to a
database of posed images. In recent years, deep neural networks that directly regress …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors

Y Yu, S Liu, R Pautrat, M Pollefeys… - arxiv preprint arxiv …, 2025 - arxiv.org

Monocular depth estimation (MDE) models have undergone significant advancements over
recent years. Many MDE models aim to predict affine-invariant relative depth from …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]

[PDF] arxiv.org

SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos

Y Liu, S Dong, S Wang, Y Yin, Y Yang, Q Fan… - arxiv preprint arxiv …, 2024 - arxiv.org

In this paper, we introduce\textbf {SLAM3R}, a novel and effective monocular RGB SLAM
system for real-time and high-quality dense 3D reconstruction. SLAM3R provides an end-to …

저장 인용 관련 학술자료 전체 2개의 버전 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Moge: Unlocking accurate monocular geometry estimation for open-domain images with optimal...

DepthCues: Evaluating Monocular Depth Perception in Large Vision Models

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Exploring Representation-Aligned Latent Space for Better Generation

Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors

SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos