Understanding World or Predicting Future? A Comprehensive Survey of World Models
The concept of world models has garnered significant attention due to advancements in
multimodal large language models such as GPT-4 and video generation models such as …
multimodal large language models such as GPT-4 and video generation models such as …
Spaceblender: Creating context-rich collaborative spaces through generative 3d scene blending
There is increased interest in using generative AI to create 3D spaces for Virtual Reality (VR)
applications. However, today's models produce artificial environments, falling short of …
applications. However, today's models produce artificial environments, falling short of …
Crossviewdiff: A cross-view diffusion model for satellite-to-street view synthesis
Satellite-to-street view synthesis aims at generating a realistic street-view image from its
corresponding satellite-view image. Although stable diffusion models have exhibit …
corresponding satellite-view image. Although stable diffusion models have exhibit …
StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation
Recent advances in large reconstruction and generative models have significantly improved
scene reconstruction and novel view generation. However, due to compute limitations, each …
scene reconstruction and novel view generation. However, due to compute limitations, each …
[PDF][PDF] Lifelong Learning of Video Diffusion Models From a Single Video Stream
This work demonstrates that training autoregressive video diffusion models from a single,
continuous video stream is not only possible but remarkably can also be competitive with …
continuous video stream is not only possible but remarkably can also be competitive with …