Semantic abstraction: Open-world 3d scene understanding from 2d vision-language models

H Ha, S Song - arxiv preprint arxiv:2207.11514, 2022 - arxiv.org
We study open-world 3D scene understanding, a family of tasks that require agents to
reason about their 3D environment with an open-set vocabulary and out-of-domain visual …

Semantic scene completion from a single depth image

S Song, F Yu, A Zeng, AX Chang… - Proceedings of the …, 2017 - openaccess.thecvf.com
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel
representation of volumetric occupancy and semantic labels for a scene from a single-view …

Layoutnet: Reconstructing the 3d room layout from a single rgb image

C Zou, A Colburn, Q Shan… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
We propose an algorithm to predict room layout from a single image that generalizes across
panoramas and perspective images, cuboid layouts and more general layouts (eg" L"-shape …

Marr revisited: 2d-3d alignment via surface normal prediction

A Bansal, B Russell, A Gupta - Proceedings of the IEEE …, 2016 - openaccess.thecvf.com
We introduce an approach that leverages surface normal predictions, along with
appearance cues, to retrieve 3D models for objects depicted in 2D still images from a large …

Learning to parse wireframes in images of man-made environments

K Huang, Y Wang, Z Zhou, T Ding… - Proceedings of the …, 2018 - openaccess.thecvf.com
In this paper, we propose a learning-based approach to the task of automatically extracting
a" wireframe" representation for images of cluttered man-made environments. The wireframe …

Im2cad

H Izadinia, Q Shan, SM Seitz - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Given a single photo of a room and a large database of furniture CAD models, our goal is to
reconstruct a scene that is as similar as possible to the scene depicted in the photograph …

Efficient semantic scene completion network with spatial group convolution

J Zhang, H Zhao, A Yao, Y Chen… - Proceedings of the …, 2018 - openaccess.thecvf.com
Abstract We introduce Spatial Group Convolution (SGC) for accelerating the computation of
3D dense prediction tasks. SGC is orthogonal to group convolution, which works on spatial …

See and think: Disentangling semantic scene completion

S Liu, Y Hu, Y Zeng, Q Tang, B **… - Advances in Neural …, 2018 - proceedings.neurips.cc
Semantic scene completion predicts volumetric occupancy and object category of a 3D
scene, which helps intelligent agents to understand and interact with the surroundings. In …

Cascaded context pyramid for full-resolution 3d semantic scene completion

P Zhang, W Liu, Y Lei, H Lu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Abstract Semantic Scene Completion (SSC) aims to simultaneously predict the volumetric
occupancy and semantic category of a 3D scene. It helps intelligent devices to understand …

A coarse-to-fine indoor layout estimation (cfile) method

Y Ren, S Li, C Chen, CCJ Kuo - Computer Vision–ACCV 2016: 13th Asian …, 2017 - Springer
The task of estimating the spatial layout of cluttered indoor scenes from a single RGB image
is addressed in this work. Existing solutions to this problem largely rely on hand-crafted …