State of the art on diffusion models for visual computing
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …
Evaluation of openai o1: Opportunities and challenges of agi
This comprehensive study evaluates the performance of OpenAI's o1-preview large
language model across a diverse array of complex reasoning tasks, spanning multiple …
language model across a diverse array of complex reasoning tasks, spanning multiple …
Layoutgpt: Compositional visual planning and generation with large language models
Attaining a high degree of user controllability in visual generation often requires intricate,
fine-grained inputs like layouts. However, such inputs impose a substantial burden on users …
fine-grained inputs like layouts. However, such inputs impose a substantial burden on users …
Pointodyssey: A large-scale synthetic dataset for long-term point tracking
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …
Mvimgnet: A large-scale dataset of multi-view images
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …
🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …
computer vision and natural language understanding. This work presents a platform to …
Habitat-matterport 3d dataset (hm3d): 1000 large-scale 3d environments for embodied ai
We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-scale dataset of
1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each …
1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each …
Object 3dit: Language-guided 3d-aware image editing
Existing image editing tools, while powerful, typically disregard the underlying 3D geometry
from which the image is projected. As a result, edits made using these tools may become …
from which the image is projected. As a result, edits made using these tools may become …
Infinite photorealistic worlds using procedural generation
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural
world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from …
world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from …
Discoscene: Spatially disentangled generative radiance fields for controllable 3d-aware scene synthesis
Existing 3D-aware image synthesis approaches mainly focus on generating a single
canonical object and show limited capacity in composing a complex scene containing a …
canonical object and show limited capacity in composing a complex scene containing a …