State of the art on diffusion models for visual computing

R Po, W Yifan, V Golyanik, K Aberman… - Computer Graphics …, 2024 - Wiley Online Library
The field of visual computing is rapidly advancing due to the emergence of generative
artificial intelligence (AI), which unlocks unprecedented capabilities for the generation …

Evaluation of openai o1: Opportunities and challenges of agi

T Zhong, Z Liu, Y Pan, Y Zhang, Y Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
This comprehensive study evaluates the performance of OpenAI's o1-preview large
language model across a diverse array of complex reasoning tasks, spanning multiple …

Layoutgpt: Compositional visual planning and generation with large language models

W Feng, W Zhu, T Fu, V Jampani… - Advances in …, 2023 - proceedings.neurips.cc
Attaining a high degree of user controllability in visual generation often requires intricate,
fine-grained inputs like layouts. However, such inputs impose a substantial burden on users …

Pointodyssey: A large-scale synthetic dataset for long-term point tracking

Y Zheng, AW Harley, B Shen… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework,
for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to …

Mvimgnet: A large-scale dataset of multi-view images

X Yu, M Xu, Y Zhang, H Liu, C Ye… - Proceedings of the …, 2023 - openaccess.thecvf.com
Being data-driven is one of the most iconic properties of deep learning algorithms. The birth
of ImageNet drives a remarkable trend of" learning from large-scale data" in computer vision …

🏘️ ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

M Deitke, E VanderBilt, A Herrasti… - Advances in …, 2022 - proceedings.neurips.cc
Massive datasets and high-capacity models have driven many recent advancements in
computer vision and natural language understanding. This work presents a platform to …

Habitat-matterport 3d dataset (hm3d): 1000 large-scale 3d environments for embodied ai

SK Ramakrishnan, A Gokaslan, E Wijmans… - arxiv preprint arxiv …, 2021 - arxiv.org
We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-scale dataset of
1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each …

Infinite photorealistic worlds using procedural generation

A Raistrick, L Lipson, Z Ma, L Mei… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural
world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from …

Scaling up dynamic human-scene interaction modeling

N Jiang, Z Zhang, H Li, X Ma, Z Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Confronting the challenges of data scarcity and advanced motion synthesis in human-scene
interaction modeling we introduce the TRUMANS dataset alongside a novel HSI motion …

Diffuscene: Denoising diffusion models for generative indoor scene synthesis

J Tang, Y Nie, L Markhasin, A Dai… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present DiffuScene for indoor 3D scene synthesis based on a novel scene configuration
denoising diffusion model. It generates 3D instance properties stored in an unordered object …