Generative adversarial networks in computer vision: A survey and taxonomy

Z Wang, Q She, TE Ward - ACM Computing Surveys (CSUR), 2021 - dl.acm.org
Generative adversarial networks (GANs) have been extensively studied in the past few
years. Arguably their most significant impact has been in the area of computer vision where …

Making images real again: A comprehensive survey on deep image composition

L Niu, W Cong, L Liu, Y Hong, B Zhang, J Liang… - arxiv preprint arxiv …, 2021 - arxiv.org
As a common image editing operation, image composition aims to combine the foreground
from one image and another background image, resulting in a composite image. However …

Objectstitch: Object compositing with diffusion model

Y Song, Z Zhang, Z Lin, S Cohen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Object compositing based on 2D images is a challenging problem since it typically involves
multiple processing stages such as color harmonization, geometry correction and shadow …

Concealed object detection

DP Fan, GP Ji, MM Cheng… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
We present the first systematic study on concealed object detection (COD), which aims to
identify objects that are visually embedded in their background. The high intrinsic similarities …

Focalclick: Towards practical interactive image segmentation

X Chen, Z Zhao, Y Zhang, M Duan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Interactive segmentation allows users to extract target masks by making positive/negative
clicks. Although explored by many previous works, there is still a gap between academic …

Vitmatte: Boosting image matting with pre-trained plain vision transformers

J Yao, X Wang, S Yang, B Wang - Information Fusion, 2024 - Elsevier
Image matting is an inverse fusion process that separates the foreground and background
information by predicting alpha matte for each pixel. Recently, plain vision Transformers …

Head-free lightweight semantic segmentation with linear transformer

B Dong, P Wang, F Wang - Proceedings of the AAAI conference on …, 2023 - ojs.aaai.org
Existing semantic segmentation works have been mainly focused on designing effective
decoders; however, the computational load introduced by the overall structure has long …

Highly accurate dichotomous image segmentation

X Qin, H Dai, X Hu, DP Fan, L Shao… - European Conference on …, 2022 - Springer
We present a systematic study on a new task called dichotomous image segmentation (DIS),
which aims to segment highly accurate objects from natural images. To this end, we …

Robust high-resolution video matting with temporal guidance

S Lin, L Yang, I Saleemi… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We introduce a robust, real-time, high-resolution human video matting method that achieves
new state-of-the-art performance. Our method is much lighter than previous approaches and …

Mvsnet: Depth inference for unstructured multi-view stereo

Y Yao, Z Luo, S Li, T Fang… - Proceedings of the …, 2018 - openaccess.thecvf.com
We present an end-to-end deep learning architecture for depth map inference from multi-
view images. In the network, we first extract deep visual image features, and then build the …