Xinyuan Qian

Citeret af

	Alle	Siden 2020
Henvisninger	837	805
h-index	14	14
i10-indeks	17	17

420

210

105

315

2017201820192020202120222023202420253 5 21 21 34 93 201 402 54

Offentlig adgang

Se alle

16 artikler

7 artikler

tilgængelige

ikke tilgængelige

Baseret på krav i forbindelse med finansiering

Medforfattere

Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeVerificeret mail på u.nus.edu
Jiadong WangTechnical University of Munich; National University of SingaporeVerificeret mail på tum.de
Pan ZexuAlibaba; MERL; National University of SingaporeVerificeret mail på u.nus.edu
Tao RuijieZoom, NUSVerificeret mail på u.nus.edu
Alessio BruttiFBKVerificeret mail på fbk.eu
Andrea CavallaroDirector, Idiap Research Institute; Professor, EPFLVerificeret mail på epfl.ch
Oswald LanzFree University of Bozen-BolzanoVerificeret mail på inf.unibz.it
Alessio XomperoPostdoctoral Research Assistant, Queen Mary University of LondonVerificeret mail på qmul.ac.uk
Wei XueHKUSTVerificeret mail på ust.hk
Hao TangPeking University | CMU | ETH Zurich | University of Oxford | University of Trento | IIAIVerificeret mail på pku.edu.cn
Maurizio OmologoPrincipal Applied Scientist, Amazon Alexa, Italy and USA

Følg

Xinyuan Qian

Associate Professor, University of Science and Technology Beijing, China

Verificeret mail på nus.edu.sg - Startside

speech processing multimedia human robot interaction


Titel Sortér efter henvisninger Sortér efter årstal Sortér efter titel	Citeret af Citeret af	År
Is someone speaking? exploring long-term temporal features for audio-visual active speaker detection R Tao, Z Pan, RK Das, X Qian, MZ Shou, H Li Proceedings of the 29th ACM international conference on multimedia, 3927-3935, 2021	192	2021
Seeing what you said: Talking face generation guided by a lip reading expert J Wang, X Qian, M Zhang, RT Tan, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	86	2023
Multi-speaker tracking from an audio–visual sensing device X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro IEEE Transactions on Multimedia 21 (10), 2576-2588, 2019	65	2019
Multi-target DoA estimation with an audio-visual fusion mechanism X Qian, M Madhavi, Z Pan, J Wang, H Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	49	2021
A time-frequency attention module for neural speech enhancement Q Zhang, X Qian, Z Ni, A Nicolson, E Ambikairajah, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 462-475, 2022	39	2022
3D audio-visual speaker tracking with an adaptive particle filter X Qian, A Brutti, M Omologo, A Cavallaro 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017	39	2017
Audio-visual cross-attention network for robotic speaker tracking X Qian, Z Wang, J Wang, G Guan, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 550-562, 2022	36	2022
Mamba in speech: Towards an alternative to self-attention X Zhang, Q Zhang, H Liu, T Xiao, X Qian, B Ahmed, E Ambikairajah, H Li, ... arXiv preprint arXiv:2405.12609, 2024	34	2024
Audio-visual tracking of concurrent speakers X Qian, A Brutti, O Lanz, M Omologo, A Cavallaro IEEE Transactions on Multimedia 24, 942-954, 2021	34	2021
Speaker extraction with co-speech gestures cue Z Pan, X Qian, H Li IEEE Signal Processing Letters 29, 1467-1471, 2022	28	2022
L F-TOUCH: A Wireless GelSight With Decoupled Tactile and Three-Axis Force Sensing W Li, M Wang, J Li, Y Su, DK Jha, X Qian, K Althoefer, H Liu IEEE Robotics and Automation Letters 8 (8), 5148-5155, 2023	25	2023
3D mouth tracking from a compact microphone array co-located with a camera X Qian, A Xompero, A Cavallaro, A Brutti, O Lanz, M Omologo 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	24	2018
GCC-PHAT with speech-oriented attention for robotic sound source localization J Wang, X Qian, Z Pan, M Zhang, H Li 2021 IEEE International Conference on Robotics and Automation (ICRA), 5876-5883, 2021	17	2021
Predict-and-update network: Audio-visual speech recognition inspired by human speech perception J Wang, X Qian, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	15	2024
Deep audio-visual beamforming for speaker localization X Qian, Q Zhang, G Guan, W Xue IEEE Signal Processing Letters 29, 1132-1136, 2022	14	2022
Speech-oriented sparse attention denoising for voice user interface toward industry 5.0 H Zhu, Q Zhang, P Gao, X Qian IEEE Transactions on Industrial Informatics 19 (2), 2151-2160, 2022	13	2022
Neural-free attention for monaural speech enhancement toward voice user interface for consumer electronics M Chen, Q Zhang, Q Song, X Qian, R Guo, M Wang, D Chen IEEE Transactions on Consumer Electronics 69 (4), 765-774, 2023	10	2023
A miniaturised camera-based multi-modal tactile sensor K Althoefer, Y Ling, W Li, X Qian, WW Lee, P Qi 2023 IEEE International Conference on Robotics and Automation (ICRA), 12570 …, 2023	8	2023
Iterative sound source localization for unknown number of sources Y Fu, M Ge, H Yin, X Qian, L Wang, G Zhang, J Dang arXiv preprint arXiv:2206.12273, 2022	8	2022
Audio-visual temporal forgery detection using embedding-level fusion and multi-dimensional contrastive loss M Liu, J Wang, X Qian, H Li IEEE Transactions on Circuits and Systems for Video Technology 34 (8), 6937-6948, 2023	7	2023

Systemet kan ikke foretage handlingen nu. Prøv igen senere.

Artikler 1–20

Henvisninger pr. år

Dublerede henvisninger

Flettede henvisninger

Tilføj medforfattereMedforfattere

Følg

Citeret af

Medforfattere