Wenetspeech: A 10000+ hours multi-domain mandarin corpus for speech recognition B Zhang, H Lv, P Guo, Q Shao, C Yang, L Xie, X Xu, H Bu, X Chen, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 221 | 2022 |
Wenet 2.0: More productive end-to-end speech recognition toolkit B Zhang, D Wu, Z Peng, X Song, Z Yao, H Lv, L Xie, C Yang, F Pan, J Niu arXiv preprint arXiv:2203.15455, 2022 | 100 | 2022 |
Espresso: A fast end-to-end neural speech recognition toolkit Y Wang, T Chen, H Xu, S Ding, H Lv, Y Shao, N Peng, L Xie, S Watanabe, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 89 | 2019 |
Wake word detection with streaming transformers Y Wang, H Lv, D Povey, L Xie, S Khudanpur ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 43 | 2021 |
Wake word detection with alignment-free lattice-free MMI Y Wang, H Lv, D Povey, L Xie, S Khudanpur arXiv preprint arXiv:2005.08347, 2020 | 24 | 2020 |
The NNI Query-by-Example System for MediaEval 2014. P Yang, H Xu, X Xiao, L Xie, CC Leung, H Chen, J Yu, H Lv, L Wang, ... MediaEval, 2014 | 24 | 2014 |
The NNI Query-by-Example System for MediaEval 2015. J Hou, CCL Van Tung Pham, CC Leung, L Wang, H Xu, H Lv, L Xie, Z Fu, ... MediaEval, 2015 | 22 | 2015 |
Toward High-Performance Language-Independent Query-by-Example Spoken Term Detection for MediaEval 2015: Post-Evaluation Analysis. CC Leung, L Wang, H Xu, J Hou, Van Tung Pham, H Lv, L Xie, X Xiao, ... INTERSPEECH, 3703-3707, 2016 | 19 | 2016 |
Language independent query-by-example spoken term detection using n-best phone sequences and partial matching H Xu, P Yang, X Xiao, L Xie, CC Leung, H Chen, J Yu, H Lv, L Wang, ... 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 19 | 2015 |
Approximate search of audio queries by using DTW with phone time boundary and data augmentation H Xu, J Hou, X Xiao, CC Leung, L Wang, H Lv, L Xie, B Ma, ES Chng, H Li 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 16 | 2016 |
Acoustic Modeling from Frequency Domain Representations of Speech. P Ghahremani, H Hadian, H Lv, D Povey, S Khudanpur Interspeech, 1596-1600, 2018 | 13 | 2018 |
W-Infer-polation: Approximate reasoning via integrating weighted fuzzy rule inference and interpolation H Lv, F Li, C Shang, Q Shen Knowledge-Based Systems 258, 109995, 2022 | 7 | 2022 |
Context-aware RNNLM rescoring for conversational speech recognition K Wei, P Guo, H Lv, Z Tu, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021 | 6 | 2021 |
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation K Wei, B Li, H Lv, Q Lu, N Jiang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 4 | 2024 |
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR P Guo, X Chang, H Lv, S Watanabe, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | 2 | 2024 |
An asynchronous WFST-based decoder for automatic speech recognition H Lv, Z Chen, H Xu, D Povey, L Xie, S Khudanpur ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 2 | 2021 |
LET-Decoder: A WFST-Based Lazy-Evaluation Token-Group Decoder With Exact Lattice Generation H Lv, D Povey, M Yarmohammadi, K Li, Y Wang, L Xie, S Khudanpur IEEE Signal Processing Letters 28, 703-707, 2021 | 2 | 2021 |
Minimizing sequential confusion error in speech command recognition Z Yang, H Lv, X Wang, A Zhang, L Xie arXiv preprint arXiv:2207.01261, 2022 | 1 | 2022 |
Incremental Lattice Determinization for WFST Decoders Z Chen, M Yarmohammadi, H Xu, H Lv, L Xie, D Povey, S Khudanpur 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2019 | 1 | 2019 |
Light but fruitful: enhanced fuzzy inference via weight-guided selection of rules with attribute weights F Li, H Lv, Q Shen International Journal of Systems Science, 1-13, 2024 | | 2024 |