Matnet: Motion-attentive transition network for zero-shot video object segmentation

T Zhou, J Li, S Wang, R Tao… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
In this paper, we present a novel end-to-end learning neural network, ie, MATNet, for zero-
shot video object segmentation (ZVOS). Motivated by the human visual attention behavior …

A comprehensive survey on video saliency detection with auditory information: The audio-visual consistency perceptual is the key!

C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency detection (VSD) aims at fast locating the most attractive
objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …

Asynchronous spatio-temporal memory network for continuous event-based object detection

J Li, J Li, L Zhu, X **ang, T Huang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Event cameras, offering extremely high temporal resolution and high dynamic range, have
brought a new perspective to addressing common object detection challenges (eg, motion …

[HTML][HTML] Review of visual saliency prediction: Development process from neurobiological basis to deep models

F Yan, C Chen, P **ao, S Qi, Z Wang, R **ao - Applied Sciences, 2021 - mdpi.com
The human attention mechanism can be understood and simulated by closely associating
the saliency prediction task to neuroscience and psychology. Furthermore, saliency …

Unified image and video saliency modeling

R Droste, J Jiao, JA Noble - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
Visual saliency modeling for images and videos is treated as two independent tasks in
recent computer vision literature. While image saliency modeling is a well-studied problem …

Multi-scale spatiotemporal feature fusion network for video saliency prediction

Y Zhang, T Zhang, C Wu, R Tao - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Recently, video saliency prediction has attracted increasing attention, yet the improvement
of its accuracy is still subject to the insufficient use of multi-scale spatiotemporal features. To …

Lidar-based online 3d video object detection with graph-based message passing and spatiotemporal transformer attention

J Yin, J Shen, C Guan, D Zhou… - Proceedings of the …, 2020 - openaccess.thecvf.com
Existing LiDAR-based 3D object detectors usually focus on the single-frame detection, while
ignoring the spatiotemporal information in consecutive point cloud frames. In this paper, we …

DADA: Driver attention prediction in driving accident scenarios

J Fang, D Yan, J Qiao, J Xue… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Driver attention prediction is becoming an essential research problem in human-like driving
systems. This work makes an attempt to predict the driver attention in driving accident …

Transformer-based multi-scale feature integration network for video saliency prediction

X Zhou, S Wu, R Shi, B Zheng, S Wang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Most cutting-edge video saliency prediction models rely on spatiotemporal features
extracted by 3D convolutions due to its local contextual cues acquirement ability. However …

Video saliency forecasting transformer

C Ma, H Sun, Y Rao, J Zhou, J Lu - IEEE transactions on circuits …, 2022 - ieeexplore.ieee.org
Video saliency prediction (VSP) aims to imitate eye fixations of humans. However, the
potential of this task has not been fully exploited since existing VSP methods only focus on …