Review on panoramic imaging and its applications in scene understanding

S Gao, K Yang, H Shi, K Wang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid development of high-speed communication and artificial intelligence
technologies, human perception of real-world scenes is no longer limited to the use of small …

Deep learning for fluid velocity field estimation: A review

C Yu, X Bi, Y Fan - Ocean Engineering, 2023 - Elsevier
Deep learning technique, has made tremendous progress in fluid mechanics in recent
years, because of its mighty feature extraction capacity from complicated and massive fluid …

Cotracker: It is better to track together

N Karaev, I Rocco, B Graham, N Neverova… - … on Computer Vision, 2024 - Springer
We introduce CoTracker, a transformer-based model that tracks a large number of 2D points
in long video sequences. Differently from most existing approaches that track points …

Stable video diffusion: Scaling latent video diffusion models to large datasets

A Blattmann, T Dockhorn, S Kulal… - arxiv preprint arxiv …, 2023 - arxiv.org
We present Stable Video Diffusion-a latent video diffusion model for high-resolution, state-of-
the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained …

Pix2video: Video editing using image diffusion

D Ceylan, CHP Huang, NJ Mitra - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Image diffusion models, trained on massive image collections, have emerged as the most
versatile image generator model in terms of quality and diversity. They support inverting real …

Videocomposer: Compositional video synthesis with motion controllability

X Wang, H Yuan, S Zhang, D Chen… - Advances in …, 2024 - proceedings.neurips.cc
The pursuit of controllability as a higher standard of visual content creation has yielded
remarkable progress in customizable image synthesis. However, achieving controllable …

Iterative geometry encoding volume for stereo matching

G Xu, X Wang, X Ding, X Yang - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Abstract Recurrent All-Pairs Field Transforms (RAFT) has shown great potentials in
matching tasks. However, all-pairs correlations lack non-local geometry knowledge and …

Drag your gan: Interactive point-based manipulation on the generative image manifold

X Pan, A Tewari, T Leimkühler, L Liu, A Meka… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Synthesizing visual content that meets users' needs often requires flexible and precise
controllability of the pose, shape, expression, and layout of the generated objects. Existing …

Tracking everything everywhere all at once

Q Wang, YY Chang, R Cai, Z Li… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present a new test-time optimization method for estimating dense and long-range motion
from a video sequence. Prior optical flow or particle video tracking algorithms typically …

Suds: Scalable urban dynamic scenes

H Turki, JY Zhang, F Ferroni… - Proceedings of the …, 2023 - openaccess.thecvf.com
We extend neural radiance fields (NeRFs) to dynamic large-scale urban scenes. Prior work
tends to reconstruct single video clips of short durations (up to 10 seconds). Two reasons …