State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Evaluation of openai o1: Opportunities and challenges of agi

T Zhong, Z Liu, Y Pan, Y Zhang, Y Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
This comprehensive study evaluates the performance of OpenAI's o1-preview large
language model across a diverse array of complex reasoning tasks, spanning multiple …

Layoutgpt: Compositional visual planning and generation with large language models

W Feng, W Zhu, T Fu, V Jampani… - Advances in …, 2023 - proceedings.neurips.cc
Attaining a high degree of user controllability in visual generation often requires intricate,
fine-grained inputs like layouts. However, such inputs impose a substantial burden on users …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

M Deitke, E VanderBilt, A Herrasti… - Advances in …, 2022 - proceedings.neurips.cc
Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …

Habitat-matterport 3d dataset (hm3d): 1000 large-scale 3d environments for embodied ai

SK Ramakrishnan, A Gokaslan, E Wijmans… - arxiv preprint arxiv …, 2021 - arxiv.org
We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-scale dataset of
1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each …

Object 3dit: Language-guided 3d-aware image editing

O Michel, A Bhattad, E VanderBilt… - Advances in …, 2023 - proceedings.neurips.cc
Existing image editing tools, while powerful, typically disregard the underlying 3D geometry
from which the image is projected. As a result, edits made using these tools may become …

Infinite photorealistic worlds using procedural generation

A Raistrick, L Lipson, Z Ma, L Mei… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural
world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from …

Discoscene: Spatially disentangled generative radiance fields for controllable 3d-aware scene synthesis

Y Xu, M Chai, Z Shi, S Peng… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing 3D-aware image synthesis approaches mainly focus on generating a single
canonical object and show limited capacity in composing a complex scene containing a …