Predicting visual fixations

M Kümmerer, M Bethge - Annual Review of Vision Science, 2023 - annualreviews.org
As we navigate and behave in the world, we are constantly deciding, a few times per
second, where to look next. The outcomes of these decisions in response to visual input are …

A comprehensive survey on video saliency detection with auditory information: the audio-visual consistency perceptual is the key!

C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency detection (VSD) aims at fast locating the most attractive
objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …

[HTML][HTML] TranSalNet: Towards perceptually relevant visual saliency prediction

J Lou, H Lin, D Marshall, D Saupe, H Liu - Neurocomputing, 2022 - Elsevier
Convolutional neural networks (CNNs) have significantly advanced computational
modelling for saliency prediction. However, accurately simulating the mechanisms of visual …

DeepGaze IIE: Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling

A Linardos, M Kümmerer, O Press… - Proceedings of the …, 2021 - openaccess.thecvf.com
Since 2014 transfer learning has become the key driver for the improvement of spatial
saliency prediction-however, with stagnant progress in the last 3-5 years. We conduct a …

Towards end-to-end video-based eye-tracking

S Park, E Aksan, X Zhang, O Hilliges - … , Glasgow, UK, August 23–28, 2020 …, 2020 - Springer
Estimating eye-gaze from images alone is a challenging task, in large parts due to un-
observable person-specific factors. Achieving high accuracy typically requires labeled data …

Video saliency forecasting transformer

C Ma, H Sun, Y Rao, J Zhou, J Lu - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency prediction (VSP) aims to imitate eye fixations of humans. However, the
potential of this task has not been fully exploited since existing VSP methods only focus on …

Vinet: Pushing the limits of visual modality for audio-visual saliency prediction

S Jain, P Yarlagadda, S Jyoti, S Karthik… - 2021 IEEE/RSJ …, 2021 - ieeexplore.ieee.org
We propose the ViNet architecture for audio-visual saliency prediction. ViNet is a fully
convolutional encoder-decoder architecture. The encoder uses visual features from a …

Automatic probe movement guidance for freehand obstetric ultrasound

R Droste, L Drukker, AT Papageorghiou… - … Image Computing and …, 2020 - Springer
We present the first system that provides real-time probe movement guidance for acquiring
standard planes in routine freehand obstetric ultrasound scanning. Such a system can …

Transformer-based multi-scale feature integration network for video saliency prediction

X Zhou, S Wu, R Shi, B Zheng, S Wang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Most cutting-edge video saliency prediction models rely on spatiotemporal features
extracted by 3D convolutions due to its local contextual cues acquirement ability. However …

UEyes: Understanding visual saliency across user interface types

Y Jiang, LA Leiva, H Rezazadegan Tavakoli… - Proceedings of the …, 2023 - dl.acm.org
While user interfaces (UIs) display elements such as images and text in a grid-based layout,
UI types differ significantly in the number of elements and how they are displayed. For …