Følg
Xinyuan Qian
Xinyuan Qian
Associate Professor, University of Science and Technology Beijing, China
Verificeret mail på nus.edu.sg - Startside
Titel
Citeret af
Citeret af
År
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection
R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li
Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021
1922021
Seeing what you said: Talking face generation guided by a lip reading expert
J Wang, X Qian, M Zhang, RT Tan, H Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
862023
Multi-speaker tracking from an audio–visual sensing device
X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro
IEEE Transactions on Multimedia 21 (10), 2576-2588, 2019
652019
Multi-target DoA estimation with an audio-visual fusion mechanism
X Qian, M Madhavi, Z Pan, J Wang, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
492021
A time-frequency attention module for neural speech enhancement
Q Zhang, X Qian, Z Ni, A Nicolson, E Ambikairajah, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 462-475, 2022
392022
3D audio-visual speaker tracking with an adaptive particle filter
X Qian, A Brutti, M Omologo, A Cavallaro
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
392017
Audio-visual cross-attention network for robotic speaker tracking
X Qian, Z Wang, J Wang, G Guan, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 550-562, 2022
362022
Mamba in speech: Towards an alternative to self-attention
X Zhang, Q Zhang, H Liu, T Xiao, X Qian, B Ahmed, E Ambikairajah, H Li, ...
arXiv preprint arXiv:2405.12609, 2024
342024
Audio-visual tracking of concurrent speakers
X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro
IEEE Transactions on Multimedia 24, 942-954, 2021
342021
Speaker extraction with co-speech gestures cue
Z Pan, X Qian, H Li
IEEE Signal Processing Letters 29, 1467-1471, 2022
282022
L F-TOUCH: A Wireless GelSight With Decoupled Tactile and Three-Axis Force Sensing
W Li, M Wang, J Li, Y Su, DK Jha, X Qian, K Althoefer, H Liu
IEEE Robotics and Automation Letters 8 (8), 5148-5155, 2023
252023
3D mouth tracking from a compact microphone array co-located with a camera
X Qian, A Xompero, A Cavallaro, A Brutti, O Lanz, M Omologo
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
242018
GCC-PHAT with speech-oriented attention for robotic sound source localization
J Wang, X Qian, Z Pan, M Zhang, H Li
2021 IEEE International Conference on Robotics and Automation (ICRA), 5876-5883, 2021
172021
Predict-and-update network: Audio-visual speech recognition inspired by human speech perception
J Wang, X Qian, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
152024
Deep audio-visual beamforming for speaker localization
X Qian, Q Zhang, G Guan, W Xue
IEEE Signal Processing Letters 29, 1132-1136, 2022
142022
Speech-oriented sparse attention denoising for voice user interface toward industry 5.0
H Zhu, Q Zhang, P Gao, X Qian
IEEE Transactions on Industrial Informatics 19 (2), 2151-2160, 2022
132022
Neural-free attention for monaural speech enhancement toward voice user interface for consumer electronics
M Chen, Q Zhang, Q Song, X Qian, R Guo, M Wang, D Chen
IEEE Transactions on Consumer Electronics 69 (4), 765-774, 2023
102023
A miniaturised camera-based multi-modal tactile sensor
K Althoefer, Y Ling, W Li, X Qian, WW Lee, P Qi
2023 IEEE International Conference on Robotics and Automation (ICRA), 12570 …, 2023
82023
Iterative sound source localization for unknown number of sources
Y Fu, M Ge, H Yin, X Qian, L Wang, G Zhang, J Dang
arXiv preprint arXiv:2206.12273, 2022
82022
Audio-visual temporal forgery detection using embedding-level fusion and multi-dimensional contrastive loss
M Liu, J Wang, X Qian, H Li
IEEE Transactions on Circuits and Systems for Video Technology 34 (8), 6937-6948, 2023
72023
Systemet kan ikke foretage handlingen nu. Prøv igen senere.
Artikler 1–20