Enhanced computer vision with microsoft kinect sensor: A review

J Han, L Shao, D Xu, J Shotton - IEEE transactions on …, 2013 - ieeexplore.ieee.org
With the invention of the low-cost Microsoft Kinect sensor, high-resolution depth and visual
(RGB) sensing has become available for widespread use. The complementary nature of the …

Methods and datasets on semantic segmentation: A review

H Yu, Z Yang, L Tan, Y Wang, W Sun, M Sun, Y Tang - Neurocomputing, 2018 - Elsevier
Semantic segmentation, also called scene labeling, refers to the process of assigning a
semantic label (eg car, people, and road) to each pixel of an image. It is an essential data …

Matterport3d: Learning from rgb-d data in indoor environments

A Chang, A Dai, T Funkhouser, M Halber… - arxiv preprint arxiv …, 2017 - arxiv.org
Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding
algorithms. However, existing datasets still cover only a limited number of views or a …

Linknet: Exploiting encoder representations for efficient semantic segmentation

A Chaurasia, E Culurciello - 2017 IEEE visual communications …, 2017 - ieeexplore.ieee.org
Pixel-wise semantic segmentation for visual scene understanding not only needs to be
accurate, but also efficient in order to find any use in real-time application. Existing …

Scannet: Richly-annotated 3d reconstructions of indoor scenes

A Dai, AX Chang, M Savva, M Halber… - Proceedings of the …, 2017 - openaccess.thecvf.com
A key requirement for leveraging supervised deep learning methods is the availability of
large, labeled datasets. Unfortunately, in the context of RGB-D scene understanding, very …

Refinenet: Multi-path refinement networks for high-resolution semantic segmentation

G Lin, A Milan, C Shen, I Reid - Proceedings of the IEEE …, 2017 - openaccess.thecvf.com
Recently, very deep convolutional neural networks (CNNs) have shown outstanding
performance in object recognition and have also been the first choice for dense …

Semantic scene completion from a single depth image

S Song, F Yu, A Zeng, AX Chang… - Proceedings of the …, 2017 - openaccess.thecvf.com
This paper focuses on semantic scene completion, a task for producing a complete 3D voxel
representation of volumetric occupancy and semantic labels for a scene from a single-view …

Deeper depth prediction with fully convolutional residual networks

I Laina, C Rupprecht, V Belagiannis… - … conference on 3D …, 2016 - ieeexplore.ieee.org
This paper addresses the problem of estimating the depth map of a scene given a single
RGB image. We propose a fully convolutional architecture, encompassing residual learning …

3d semantic parsing of large-scale indoor spaces

I Armeni, O Sener, AR Zamir, H Jiang… - Proceedings of the …, 2016 - openaccess.thecvf.com
In this paper, we propose a method for semantic parsing the 3D point cloud of an entire
building using a hierarchical approach: first, the raw data is parsed into semantically …

Segnet: A deep convolutional encoder-decoder architecture for image segmentation

V Badrinarayanan, A Kendall… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
We present a novel and practical deep fully convolutional neural network architecture for
semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine …