State of the art on 3D reconstruction with RGB‐D cameras

M Zollhöfer, P Stotko, A Görlitz… - Computer graphics …, 2018 - Wiley Online Library
The advent of affordable consumer grade RGB‐D cameras has brought about a profound
advancement of visual scene reconstruction methods. Both computer graphics and …

Languagebind: Extending video-language pretraining to n-modality by language-based semantic alignment

B Zhu, B Lin, M Ning, Y Yan, J Cui, HF Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
The video-language (VL) pretraining has achieved remarkable improvement in multiple
downstream tasks. However, the current VL pretraining framework is hard to extend to …

Intrinsicnerf: Learning intrinsic neural radiance fields for editable novel view synthesis

W Ye, S Chen, C Bao, H Bao… - Proceedings of the …, 2023 - openaccess.thecvf.com
Existing inverse rendering combined with neural rendering methods can only perform
editable novel view synthesis on object-specific scenes, while we present intrinsic neural …

Self-supervised multi-level face model learning for monocular reconstruction at over 250 hz

A Tewari, M Zollhöfer, P Garrido… - Proceedings of the …, 2018 - openaccess.thecvf.com
The reconstruction of dense 3D models of face geometry and appearance from a single
image is highly challenging and ill-posed. To constrain the problem, many approaches rely …

Improving video temporal consistency via broad learning system

B Sheng, P Li, R Ali, CLP Chen - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Applying image-based processing methods to original videos on a framewise level breaks
the temporal consistency between consecutive frames. Traditional video temporal …

Fml: Face model learning from videos

A Tewari, F Bernard, P Garrido… - Proceedings of the …, 2019 - openaccess.thecvf.com
Monocular image-based 3D reconstruction of faces is a long-standing problem in computer
vision. Since image data is a 2D projection of a 3D face, the resulting depth ambiguity …

An L1 image transform for edge-preserving smoothing and scene-level intrinsic decomposition

S Bi, X Han, Y Yu - ACM Transactions On Graphics (TOG), 2015 - dl.acm.org
Identifying sparse salient structures from dense pixels is a longstanding problem in visual
computing. Solutions to this problem can benefit both image manipulation and …

Pie-net: Photometric invariant edge guided network for intrinsic image decomposition

P Das, S Karaoglu, T Gevers - Proceedings of the IEEE/CVF …, 2022 - openaccess.thecvf.com
Intrinsic image decomposition is the process of recovering the image formation components
(reflectance and shading) from an image. Previous methods employ either explicit priors to …

Blind video temporal consistency

N Bonneel, J Tompkin, K Sunkavalli, D Sun… - ACM Transactions on …, 2015 - dl.acm.org
Extending image processing techniques to videos is a non-trivial task; applying processing
independently to each video frame often leads to temporal inconsistencies, and explicitly …

Estimating reflectance layer from a single image: Integrating reflectance guidance and shadow/specular aware learning

Y **, R Li, W Yang, RT Tan - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
Estimating the reflectance layer from a single image is a challenging task. It becomes more
challenging when the input image contains shadows or specular highlights, which often …