A comprehensive survey on video saliency detection with auditory information: the audio-visual consistency perceptual is the key!

C Chen, M Song, W Song, L Guo… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Video saliency detection (VSD) aims at fast locating the most attractive
objects/things/patterns in a given video clip. Existing VSD-related works have mainly relied …

Stavis: Spatio-temporal audiovisual saliency network

A Tsiami, P Koutras, P Maragos - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-
temporal visual and auditory information in order to efficiently address the problem of …

A novel un-supervised burst time dependent plasticity learning approach for biologically pattern recognition networks

M Amiri, AH Jafari, B Makkiabadi, S Nazari… - Information …, 2023 - Elsevier
Bio-inspired computing is an appropriate platform for develo** artificial intelligent
machines based on the behavioral and functional principles of the brain. Bio-inspired …

Listen to look into the future: Audio-visual egocentric gaze anticipation

B Lai, F Ryan, W Jia, M Liu, JM Rehg - European Conference on Computer …, 2024 - Springer
Egocentric gaze anticipation serves as a key building block for the emerging capability of
Augmented Reality. Notably, gaze behavior is driven by both visual cues and audio signals …

Recognizing intertwined patterns using a network of spiking pattern recognition platforms

M Amiri, AH Jafari, B Makkiabadi, S Nazari - Scientific Reports, 2022 - nature.com
Artificial intelligence computing adapted from biology is a suitable platform for the
development of intelligent machines by imitating the functional mechanisms of the nervous …

Neuromorphic circuit based on the un-supervised learning of biologically inspired spiking neural network for pattern recognition

S Nazari, A Keyanfar, MM Van Hulle - Engineering Applications of Artificial …, 2022 - Elsevier
One of the most sophisticated platforms for hosting intelligent systems is bio-inspired. This
study proposes pattern recognition hardware using a biologically inspired Spiking Neural …

A novel lightweight audio-visual saliency model for videos

D Zhu, X Shao, Q Zhou, X Min, G Zhai… - ACM Transactions on …, 2023 - dl.acm.org
Audio information has not been considered an important factor in visual attention models
regardless of many psychological studies that have shown the importance of audio …

Spiking pattern recognition using informative signal of image and unsupervised biologically plausible learning

S Nazari - Neurocomputing, 2019 - Elsevier
The recent progress of low-power neuromorphic hardware provides exceptional conditions
for applications where their focus is more on saving power. However, the design of spiking …

Audio–visual collaborative representation learning for dynamic saliency prediction

H Ning, B Zhao, Z Hu, L He, E Pei - Knowledge-Based Systems, 2022 - Elsevier
Abstract The Dynamic Saliency Prediction (DSP) task simulates the human selective
attention mechanism to perceive a dynamic scene, which is significant and imperative in …

A developmental model of audio-visual attention (MAVA) for bimodal language learning in infants and robots

R Bergoin, S Boucenna, R D'urso, D Cohen, A Pitti - Scientific Reports, 2024 - nature.com
A social individual needs to effectively manage the amount of complex information in his or
her environment relative to his or her own purpose to obtain relevant information. This paper …