CNN architectures for geometric transformation-invariant feature representation in computer vision: a review

A Mumuni, F Mumuni - SN Computer Science, 2021 - Springer
One of the main challenges in machine vision relates to the problem of obtaining robust
representation of visual features that remain unaffected by geometric transformations. This …

Occlusion handling in generic object detection: A review

K Saleh, S Szénási, Z Vámossy - 2021 IEEE 19th World …, 2021 - ieeexplore.ieee.org
The significant power of deep learning networks has led to enormous development in object
detection. Over the last few years, object detector frameworks have achieved tremendous …

3d-aware neural body fitting for occlusion robust 3d human pose estimation

Y Zhang, P Ji, A Wang, J Mei… - Proceedings of the …, 2023 - openaccess.thecvf.com
Regression-based methods for 3D human pose estimation directly predict the 3D pose
parameters from a 2D image using deep networks. While achieving state-of-the-art …

Occluded video instance segmentation: A benchmark

J Qi, Y Gao, Y Hu, X Wang, X Liu, X Bai… - International Journal of …, 2022 - Springer
Can our video understanding systems perceive objects when a heavy occlusion exists in a
scene? To answer this question, we collect a large-scale dataset called OVIS for occluded …

Swapmix: Diagnosing and regularizing the over-reliance on visual context in visual question answering

V Gupta, Z Li, A Kortylewski, C Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract While Visual Question Answering (VQA) has progressed rapidly, previous works
raise concerns about robustness of current VQA models. In this work, we study the …

{PatchCleanser}: Certifiably robust defense against adversarial patches for any image classifier

C **ang, S Mahloujifar, P Mittal - 31st USENIX Security Symposium …, 2022 - usenix.org
The adversarial patch attack against image classification models aims to inject adversarially
crafted pixels within a restricted image region (ie, a patch) for inducing model …

Robust object detection under occlusion with context-aware compositionalnets

A Wang, Y Sun, A Kortylewski… - Proceedings of the …, 2020 - openaccess.thecvf.com
Detecting partially occluded objects is a difficult task. Our experimental results show that
deep learning approaches, such as Faster R-CNN, are not robust at object detection under …

Compositional convolutional neural networks: A robust and interpretable model for object recognition under occlusion

A Kortylewski, Q Liu, A Wang, Y Sun… - International Journal of …, 2021 - Springer
Computer vision systems in real-world applications need to be robust to partial occlusion
while also being explainable. In this work, we show that black-box deep convolutional …

Using latent space regression to analyze and leverage compositionality in gans

L Chai, J Wulff, P Isola - arxiv preprint arxiv:2103.10426, 2021 - arxiv.org
In recent years, Generative Adversarial Networks have become ubiquitous in both research
and public perception, but how GANs convert an unstructured latent code to a high quality …

Deep nets: What have they ever done for vision?

AL Yuille, C Liu - International Journal of Computer Vision, 2021 - Springer
This is an opinion paper about the strengths and weaknesses of Deep Nets for vision. They
are at the heart of the enormous recent progress in artificial intelligence and are of growing …