Partial success in closing the gap between human and machine vision

R Geirhos, K Narayanappa, B Mitzkus… - Advances in …, 2021 - proceedings.neurips.cc
A few years ago, the first CNN surpassed human performance on ImageNet. However, it
soon became clear that machines lack robustness on more challenging test cases, a major …

Towards viewpoint robustness in Bird's Eye View segmentation

T Klinghoffer, J Philion, W Chen… - Proceedings of the …, 2023 - openaccess.thecvf.com
Autonomous vehicles (AV) require that neural networks used for perception be robust to
different viewpoints if they are to be deployed across many types of vehicles without the …

Pug: Photorealistic and semantically controllable synthetic data for representation learning

F Bordes, S Shekhar, M Ibrahim… - Advances in …, 2023 - proceedings.neurips.cc
Synthetic image datasets offer unmatched advantages for designing and evaluating deep
neural networks: they make it possible to (i) render as many data samples as needed,(ii) …

Does Progress On Object Recognition Benchmarks Improve Real-World Generalization?

M Richards, P Kirichenko, D Bouchacourt… - arxiv preprint arxiv …, 2023 - arxiv.org
For more than a decade, researchers have measured progress in object recognition on
ImageNet-based generalization benchmarks such as ImageNet-A,-C, and-R. Recent …

Learning to transform for generalizable instance-wise invariance

U Singhal, C Esteves, A Makadia… - Proceedings of the …, 2023 - openaccess.thecvf.com
Computer vision research has long aimed to build systems that are robust to transformations
found in natural data. Traditionally, this is done using data augmentation or hard-coding …

Does Progress On Object Recognition Benchmarks Improve Generalization on Crowdsourced, Global Data?

M Richards, P Kirichenko, D Bouchacourt… - The Twelfth …, 2023 - openreview.net
For more than a decade, researchers have measured progress in object recognition on the
ImageNet dataset along with its associated generalization benchmarks such as ImageNet-A …

Investigating the nature of 3d generalization in deep neural networks

SA Siddiqui, D Krueger, T Breuel - arxiv preprint arxiv:2304.09358, 2023 - arxiv.org
Visual object recognition systems need to generalize from a set of 2D training views to novel
views. The question of how the human visual system can generalize to novel views has …

On the Ability of Deep Networks to Learn Symmetries from Data: A Neural Kernel Theory

A Perin, S Deny - arxiv preprint arxiv:2412.11521, 2024 - arxiv.org
Symmetries (transformations by group actions) are present in many datasets, and leveraging
them holds significant promise for improving predictions in machine learning. In this work …

Understanding out-of-distribution accuracies through quantifying difficulty of test samples

B Simsek, M Hall, L Sagun - arxiv preprint arxiv:2203.15100, 2022 - arxiv.org
Existing works show that although modern neural networks achieve remarkable
generalization performance on the in-distribution (ID) dataset, the accuracy drops …

A comparison between humans and AI at recognizing objects in unusual poses

N Ollikka, A Abbas, A Perin, M Kilpeläinen… - arxiv preprint arxiv …, 2024 - arxiv.org
Deep learning is closing the gap with human vision on several object recognition
benchmarks. Here we investigate this gap for challenging images where objects are seen in …