Sora: A review on background, technology, limitations, and opportunities of large vision models
Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …
model is trained to generate videos of realistic or imaginative scenes from text instructions …
Beyond deep reinforcement learning: A tutorial on generative diffusion models in network optimization
Generative Diffusion Models (GDMs) have emerged as a transformative force in the realm of
Generative Artificial Intelligence (GAI), demonstrating their versatility and efficacy across a …
Generative Artificial Intelligence (GAI), demonstrating their versatility and efficacy across a …
Diffusion policy: Visuomotor policy learning via action diffusion
This paper introduces Diffusion Policy, a new way of generating robot behavior by
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
representing a robot's visuomotor policy as a conditional denoising diffusion process. We …
Learning universal policies via text-guided video generation
A goal of artificial intelligence is to construct an agent that can solve a wide variety of tasks.
Recent progress in text-guided image synthesis has yielded models with an impressive …
Recent progress in text-guided image synthesis has yielded models with an impressive …
Training diffusion models with reinforcement learning
Diffusion models are a class of flexible generative models trained with an approximation to
the log-likelihood objective. However, most use cases of diffusion models are not concerned …
the log-likelihood objective. However, most use cases of diffusion models are not concerned …
Diffusion policies as an expressive policy class for offline reinforcement learning
Offline reinforcement learning (RL), which aims to learn an optimal policy using a previously
collected static dataset, is an important paradigm of RL. Standard RL methods often perform …
collected static dataset, is an important paradigm of RL. Standard RL methods often perform …
Diffusion model is an effective planner and data synthesizer for multi-task reinforcement learning
Diffusion models have demonstrated highly-expressive generative capabilities in vision and
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …
NLP. Recent studies in reinforcement learning (RL) have shown that diffusion models are …
Foundation models for decision making: Problems, methods, and opportunities
Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …
capabilities in a wide range of vision and language tasks. When such models are deployed …
Fast sampling of diffusion models via operator learning
Diffusion models have found widespread adoption in various areas. However, their
sampling process is slow because it requires hundreds to thousands of network evaluations …
sampling process is slow because it requires hundreds to thousands of network evaluations …
Playfusion: Skill acquisition via diffusion from language-annotated play
Learning from unstructured and uncurated data has become the dominant paradigm for
generative approaches in language or vision. Such unstructured and unguided behavior …
generative approaches in language or vision. Such unstructured and unguided behavior …