Revideo: Remake a video with motion and content control

C Mou, M Cao, X Wang, Z Zhang… - Advances in Neural …, 2025 - proceedings.neurips.cc
Despite significant advancements in video generation and editing using diffusion models,
achieving accurate and localized video editing remains a substantial challenge …

A survey on segment anything model (sam): Vision foundation model meets prompt engineering

C Zhang, FD Puspitasari, S Zheng, C Li, Y Qiao… - arxiv preprint arxiv …, 2023 - arxiv.org
Segment anything model (SAM) developed by Meta AI Research has recently attracted
significant attention. Trained on a large segmentation dataset of over 1 billion masks, SAM is …

Image conductor: Precision control for interactive video synthesis

Y Li, X Wang, Z Zhang, Z Wang, Z Yuan, L **e… - arxiv preprint arxiv …, 2024 - arxiv.org
Filmmaking and animation production often require sophisticated techniques for
coordinating camera transitions and object movements, typically involving labor-intensive …

Motion prompting: Controlling video generation with motion trajectories

D Geng, C Herrmann, J Hur, F Cole, S Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Motion control is crucial for generating expressive and compelling video content; however,
most existing video generation models rely mainly on text prompts for control, which struggle …

Animateanything: Consistent and controllable animation for video generation

G Lei, C Wang, H Li, R Zhang, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
We present a unified controllable video generation approach AnimateAnything that
facilitates precise and consistent video manipulation across various conditions, including …

Sg-i2v: Self-guided trajectory control in image-to-video generation

K Namekata, S Bahmani, Z Wu, Y Kant… - arxiv preprint arxiv …, 2024 - arxiv.org
Methods for image-to-video generation have achieved impressive, photo-realistic quality.
However, adjusting specific elements in generated videos, such as object motion or camera …

Identifying and solving conditional image leakage in image-to-video diffusion model

M Zhao, H Zhu, C **ang, K Zheng, C Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Diffusion models have obtained substantial progress in image-to-video generation.
However, in this paper, we find that these models tend to generate videos with less motion …

This&that: Language-gesture controlled video generation for robot planning

B Wang, N Sridhar, C Feng, M Van der Merwe… - arxiv preprint arxiv …, 2024 - arxiv.org
We propose a robot learning method for communicating, planning, and executing a wide
range of tasks, dubbed This&That. We achieve robot planning for general tasks by …

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

H Wang, H Ouyang, Q Wang, W Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
The intuitive nature of drag-based interaction has led to its growing adoption for controlling
object trajectories in image-to-video synthesis. Still, existing methods that perform dragging …

Trackgo: A flexible and efficient method for controllable video generation

H Zhou, C Wang, R Nie, J Lin, D Yu, Q Yu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent years have seen substantial progress in diffusion-based controllable video
generation. However, achieving precise control in complex scenarios, including fine-grained …