팔로우
Ming Cheng | 程铭
Ming Cheng | 程铭
Wuhan University | Duke Kunshan University
whu.edu.cn의 이메일 확인됨
제목
인용
인용
연도
RWF-2000: An Open Large Scale Video Database for Violence Detection
M Cheng, K Cai, M Li
2020 25th International Conference on Pattern Recognition (ICPR), 4183-4190, 2021
2592021
Target-Speaker Voice Activity Detection via Sequence-to-Sequence Prediction
M Cheng, W Wang, Y Zhang, X Qin, M Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
362023
Computer-Aided Autism Spectrum Disorder Diagnosis with Behavior Signal Processing
M Cheng, Y Zhang, Y Xie, Y Pan, X Li, W Liu, C Yu, D Zhang, Y Xing, ...
IEEE Transactions on Affective Computing 14 (4), 2982-3000, 2023
172023
The DKU Audio-Visual Wake Word Spotting System for the 2021 MISP Challenge
M Cheng, H Wang, Y Wang, M Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
142022
Voxblink: A Large Scale Speaker Verification Dataset on Camera
Y Lin, X Qin, G Zhao, M Cheng, N Jiang, H Wu, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
122024
The DKU Post-Challenge Audio-Visual Wake Word Spotting System for The 2021 MISP Challenge: Deep Analysis
H Wang, M Cheng, Q Fu, M Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
102023
The DKU-DukeECE Diarization System for the VoxCeleb Speaker Recognition Challenge 2022
W Wang, X Qin, M Cheng, Y Zhang, K Wang, M Li
arXiv preprint arXiv:2210.01677, 2022
92022
Multi-Input Multi-Output Target-Speaker Voice Activity Detection for Unified, Flexible, and Robust Audio-Visual Speaker Diarization
M Cheng, M Li
arXiv preprint arXiv:2401.08052, 2024
82024
The WHU-Alibaba Audio-Visual Speaker Diarization System for the MISP 2022 Challenge
M Cheng, H Wang, Z Wang, Q Fu, M Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
The DKU-MSXF Diarization System for the Voxceleb Speaker Recognition Challenge 2023
M Cheng, W Wang, X Qin, Y Lin, N Jiang, G Zhao, M Li
National Conference on Man-Machine Speech Communication, 330-337, 2023
72023
Voxblink2: A 100k+ speaker recognition corpus and the open-set speaker-identification benchmark
Y Lin, M Cheng, F Zhang, Y Gao, S Zhang, M Li
arXiv preprint arXiv:2407.11510, 2024
62024
Responsive Social Smile: A Machine Learning based Multimodal Behavior Assessment Framework towards Early Stage Autism Screening
Y Pan, K Cai, M Cheng, X Zou, M Li
2020 25th International Conference on Pattern Recognition (ICPR), 2240-2247, 2021
62021
Efficient Personal Voice Activity Detection with Wake Word Reference Speech
B Zeng, M Cheng, Y Tian, H Liu, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
42024
Robust Wake Word Spotting With Frame-Level Cross-Modal Attention Based Audio-Visual Conformer
H Wang, M Cheng, Q Fu, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Assessing The Social Skills of Children with Autism Spectrum Disorder via Language-Image Pre-training Models
W Liu, M Cheng, Y Pan, L Yuan, S Hu, M Li, S Zeng
22023
Joint Inference of Speaker Diarization and ASR with Multi-Stage Information Sharing
W Wang, D Cai, M Cheng, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Cross-modal Assisted Training for Abnormal Event Recognition in Elevators
X Chen, X Gong, M Cheng, Q Deng, M Li
Proceedings of the 2021 International Conference on Multimodal Interaction …, 2021
12021
Sequence-to-Sequence Neural Diarization with Automatic Speaker Detection and Representation
M Cheng, Y Lin, M Li
arXiv preprint arXiv:2411.13849, 2024
2024
A Multimodal Dynamic Neural Network for Call for Help Recognition in Elevators
R Ju, H Chu, Y Wang, Q Deng, M Cheng, M Li
Companion Publication of the 2021 International Conference on Multimodal …, 2021
2021
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–19