Are deep neural networks adequate behavioral models of human visual perception?

FA Wichmann, R Geirhos - Annual review of vision science, 2023 - annualreviews.org
Deep neural networks (DNNs) are machine learning algorithms that have revolutionized
computer vision due to their remarkable successes in tasks like object classification and …

Improving performance of robots using human-inspired approaches: a survey

H Qiao, S Zhong, Z Chen, H Wang - Science China Information Sciences, 2022 - Springer
Realizing high performance of ordinary robots is one of the core problems in robotic
research. Improving the performance of ordinary robots usually relies on the collaborative …

Perceiver: General perception with iterative attention

A Jaegle, F Gimeno, A Brock… - International …, 2021 - proceedings.mlr.press
Biological systems understand the world by simultaneously processing high-dimensional
inputs from modalities as diverse as vision, audition, touch, proprioception, etc. The …

Siamese masked autoencoders

A Gupta, J Wu, J Deng, FF Li - Advances in Neural …, 2023 - proceedings.neurips.cc
Establishing correspondence between images or scenes is a significant challenge in
computer vision, especially given occlusions, viewpoint changes, and varying object …

Efficient neuromorphic signal processing with loihi 2

G Orchard, EP Frady, DBD Rubin… - … IEEE Workshop on …, 2021 - ieeexplore.ieee.org
The biologically inspired spiking neurons used in neuromorphic computing are nonlinear
filters with dynamic state variables—very different from the stateless neuron models used in …

Slowfast networks for video recognition

C Feichtenhofer, H Fan, J Malik… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We present SlowFast networks for video recognition. Our model involves (i) a Slow pathway,
operating at low frame rate, to capture spatial semantics, and (ii) a Fast pathway, operating …

Visual object tracking: A survey

F Chen, X Wang, Y Zhao, S Lv, X Niu - Computer Vision and Image …, 2022 - Elsevier
Visual object tracking is an important area in computer vision, and many tracking algorithms
have been proposed with promising results. Existing object tracking approaches can be …

Optical flow estimation using a spatial pyramid network

A Ranjan, MJ Black - … of the IEEE conference on computer …, 2017 - openaccess.thecvf.com
We learn to compute optical flow by combining a classical spatial-pyramid formulation with
deep learning. This estimates large motions in a coarse-to-fine approach by war** one …

Appearance-and-relation networks for video classification

L Wang, W Li, W Li, L Van Gool - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Spatiotemporal feature learning in videos is a fundamental problem in computer vision. This
paper presents a new architecture, termed as Appearance-and-Relation Network (ARTNet) …

Deep convolutional models improve predictions of macaque V1 responses to natural images

SA Cadena, GH Denfield, EY Walker… - PLoS computational …, 2019 - journals.plos.org
Despite great efforts over several decades, our best models of primary visual cortex (V1) still
predict spiking activity quite poorly when probed with natural stimuli, highlighting our limited …