CNN-based encoder-decoder networks for salient object detection: A comprehensive review and recent advances

Y Ji, H Zhang, Z Zhang, M Liu - Information Sciences, 2021 - Elsevier
Convolutional neural network (CNN)-based encoder-decoder models have profoundly
inspired recent works in the field of salient object detection (SOD). With the rapid …

Few-shot object detection: A survey

S Antonelli, D Avola, L Cinque, D Crisostomi… - ACM Computing …, 2022 - dl.acm.org
Deep learning approaches have recently raised the bar in many fields, from Natural
Language Processing to Computer Vision, by leveraging large amounts of data. However …

Attention, please! A survey of neural attention models in deep learning

A de Santana Correia, EL Colombini - Artificial Intelligence Review, 2022 - Springer
In humans, Attention is a core property of all perceptual and cognitive operations. Given our
limited ability to process competing sources, attention mechanisms select, modulate, and …

A hybrid deep learning model with 1DCNN-LSTM-Attention networks for short-term traffic flow prediction

K Wang, C Ma, Y Qiao, X Lu, W Hao, S Dong - Physica A: Statistical …, 2021 - Elsevier
With the rapid development of social economy, the traffic volume of urban roads has raised
significantly, which has led to increasingly serious urban traffic congestion problems, and …

How much position information do convolutional neural networks encode?

MA Islam, S Jia, NDB Bruce - arxiv preprint arxiv:2001.08248, 2020 - arxiv.org
In contrast to fully connected networks, Convolutional Neural Networks (CNNs) achieve
efficiency by learning weights associated with local filters with a finite spatial extent. An …

Predicting the driver's focus of attention: the dr (eye) ve project

A Palazzi, D Abati, F Solera… - IEEE transactions on …, 2018 - ieeexplore.ieee.org
In this work we aim to predict the driver's focus of attention. The goal is to estimate what a
person would pay attention to while driving, and which part of the scene around the vehicle …

Unified image and video saliency modeling

R Droste, J Jiao, JA Noble - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Visual saliency modeling for images and videos is treated as two independent tasks in
recent computer vision literature. While image saliency modeling is a well-studied problem …

[HTML][HTML] Contextual encoder–decoder network for visual saliency prediction

A Kroner, M Senden, K Driessens, R Goebel - Neural Networks, 2020 - Elsevier
Predicting salient regions in natural images requires the detection of objects that are present
in a scene. To develop robust representations for this challenging task, high-level visual …

Salient object detection driven by fixation prediction

W Wang, J Shen, X Dong… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Research in visual saliency has been focused on two major types of models namely fixation
prediction and salient object detection. The relationship between the two, however, has …

A multimodal saliency model for videos with high audio-visual correspondence

X Min, G Zhai, J Zhou, XP Zhang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Audio information has been bypassed by most of current visual attention prediction studies.
However, sound could have influence on visual attention and such influence has been …