- Academic Search

Uložit Citovat Počet citací tohoto článku: 101 Související články Všechny verze (počet: 3)

Mvsplat: Efficient 3d gaussian splatting from sparse multi-view images

Y Chen, H Xu, C Zheng, B Zhuang, M Pollefeys… - … on Computer Vision, 2024 - Springer

We introduce MVSplat, an efficient model that, given sparse multi-view images as input,
predicts clean feed-forward 3D Gaussians. To accurately localize the Gaussian centers, we …

Uložit Citovat Počet citací tohoto článku: 170 Související články Všechny verze (počet: 4)

Rerender a video: Zero-shot text-guided video-to-video translation

S Yang, Y Zhou, Z Liu, CC Loy - SIGGRAPH Asia 2023 Conference …, 2023 - dl.acm.org

Large text-to-image diffusion models have exhibited impressive proficiency in generating
high-quality images. However, when applying these models to video domain, ensuring …

Uložit Citovat Počet citací tohoto článku: 203 Související články Všechny verze (počet: 15)

Unifying flow, stereo and depth estimation

H Xu, J Zhang, J Cai, H Rezatofighi… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

We present a unified formulation and model for three motion and 3D perception tasks:
optical flow, rectified stereo matching and unrectified stereo depth estimation from posed …

Uložit Citovat Počet citací tohoto článku: 100 Související články Všechny verze (počet: 5) Zobrazit jako HTML

Flowformer++: Masked cost volume autoencoding for pretraining optical flow estimation

X Shi, Z Huang, D Li, M Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

FlowFormer introduces a transformer architecture into optical flow estimation and achieves
state-of-the-art performance. The core component of FlowFormer is the transformer-based …

Uložit Citovat Počet citací tohoto článku: 128 Související články Všechny verze (počet: 6) Zobrazit jako HTML

Tapir: Tracking any point with per-frame initialization and temporal refinement

C Doersch, Y Yang, M Vecerik… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …

Uložit Citovat Počet citací tohoto článku: 141 Související články Všechny verze (počet: 6)

Nicer-slam: Neural implicit scene encoding for rgb slam

Z Zhu, S Peng, V Larsson, Z Cui… - … Conference on 3D …, 2024 - ieeexplore.ieee.org

Neural implicit representations have recently become popular in simultaneous localization
and map** (SLAM), especially in dense visual SLAM. However, existing works either rely …

Uložit Citovat Počet citací tohoto článku: 81 Související články Všechny verze (počet: 8) Zobrazit jako HTML

A dynamic multi-scale voxel flow network for video prediction

X Hu, Z Huang, A Huang, J Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …

Uložit Citovat Počet citací tohoto článku: 24 Související články Všechny verze (počet: 2)

Dino-tracker: Taming dino for self-supervised point tracking in a single video

N Tumanyan, A Singer, S Bagon, T Dekel - European Conference on …, 2024 - Springer

We present DINO-Tracker–a new framework for long-term dense tracking in video. The pillar
of our approach is combining test-time training on a single video, with the powerful localized …