Google Tudós

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2024 - Springer

We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

Mentés Hivatkozás Idézetek száma: 196 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] Estimating optical flow: A comprehensive review of the state of the art

A Alfarano, L Maiano, L Papa, I Amerini - Computer Vision and Image …, 2024 - Elsevier

Optical flow estimation is a crucial task in computer vision that provides low-level motion
information. Despite recent advances, real-world applications still present significant …

Mentés Hivatkozás Idézetek száma: 10 Kapcsolódó cikkek

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

Mentés Hivatkozás Idézetek száma: 148 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Mentés Hivatkozás Idézetek száma: 117 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Perception test: A diagnostic benchmark for multimodal video models

V Patraucean, L Smaira, A Gupta… - Advances in …, 2023 - proceedings.neurips.cc

We propose a novel multimodal video benchmark-the Perception Test-to evaluate the
perception and reasoning skills of pre-trained multimodal models (eg Flamingo, BEiT-3, or …

Mentés Hivatkozás Idézetek száma: 86 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Tapir: Tracking any point with per-frame initialization and temporal refinement

C Doersch, Y Yang, M Vecerik… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …

Mentés Hivatkozás Idézetek száma: 129 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Tap-vid: A benchmark for tracking any point in a video

C Doersch, A Gupta, L Markeeva… - Advances in …, 2022 - proceedings.neurips.cc

Generic motion understanding from video involves not only tracking objects, but also
perceiving how their surfaces deform and move. This information is useful to make …

Mentés Hivatkozás Idézetek száma: 144 Kapcsolódó cikkek Mind a(z) 6 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Scene representation transformer: Geometry-free novel view synthesis through set-latent scene representations

MSM Sajjadi, H Meyer, E Pot… - Proceedings of the …, 2022 - openaccess.thecvf.com

A classical problem in computer vision is to infer a 3D scene representation from few images
that can be used to render novel views at interactive rates. Previous work focuses on …

Mentés Hivatkozás Idézetek száma: 189 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Object 3dit: Language-guided 3d-aware image editing

O Michel, A Bhattad, E VanderBilt… - Advances in …, 2023 - proceedings.neurips.cc

Existing image editing tools, while powerful, typically disregard the underlying 3D geometry
from which the image is projected. As a result, edits made using these tools may become …

Mentés Hivatkozás Idézetek száma: 30 Kapcsolódó cikkek Mind a(z) 5 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Infinite photorealistic worlds using procedural generation

A Raistrick, L Lipson, Z Ma, L Mei… - Proceedings of the …, 2023 - openaccess.thecvf.com

We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural
world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from …

Mentés Hivatkozás Idézetek száma: 68 Kapcsolódó cikkek Mind a(z) 7 változat HTML-változat

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Cotracker: It is better to track together

[HTML][HTML] Estimating optical flow: A comprehensive review of the state of the art

Tracking everything everywhere all at once

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Perception test: A diagnostic benchmark for multimodal video models

Tapir: Tracking any point with per-frame initialization and temporal refinement

Tap-vid: A benchmark for tracking any point in a video

Scene representation transformer: Geometry-free novel view synthesis through set-latent scene representations

Object 3dit: Language-guided 3d-aware image editing

Infinite photorealistic worlds using procedural generation