VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research X Wang, J Wu, J Chen, L Li, YF Wang, WY Wang ICCV 2019, 2019 | 612 | 2019 |
Meta multi-task learning for sequence modeling J Chen, X Qiu, P Liu, X Huang AAAI 2018, 2018 | 110 | 2018 |
Fused acoustic and text encoding for multimodal bilingual pretraining and speech translation R Zheng, J Chen, M Ma, L Huang ICML 2021, 2021 | 72 | 2021 |
AT: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing H Bai, R Zheng, J Chen, M Ma, X Li, L Huang International Conference on Machine Learning, 1399-1411, 2022 | 47 | 2022 |
Dropattention: A regularization method for fully-connected self-attention networks L Zehui, P Liu, L Huang, J Chen, X Qiu, X Huang arXiv preprint arXiv:1907.11065, 2019 | 43 | 2019 |
Same representation, different attentions: Shareable sentence representation learning from multiple tasks R Zheng, J Chen, X Qiu IJCAI 2018, 2018 | 34 | 2018 |
SpecRec: An Alternative Solution for Improving End-to-End Speech-to-Text Translation via Spectrogram Reconstruction J Chen, M Ma, R Zheng, L Huang Proc. Interspeech 2021, 2232-2236, 2021 | 31* | 2021 |
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR J Chen, M Ma, R Zheng, L Huang Findings of ACL-21 (short), 2021 | 29 | 2021 |
PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit H Zhang, T Yuan, J Chen, X Li, R Zheng, Y Huang, X Chen, E Gong, ... NAACL-2022 Demo Track (Best Demo Award), 2022 | 26 | 2022 |
Improving simultaneous translation by incorporating pseudo-references with fewer reorderings J Chen, R Zheng, A Kita, M Ma, L Huang EMNLP 2021, 2020 | 21 | 2020 |
Exploring shared structures and hierarchies for multiple nlp tasks J Chen, K Chen, X Chen, X Qiu, X Huang arXiv preprint arXiv:1808.07658, 2018 | 19 | 2018 |
Token-level serialized output training for joint streaming asr and st leveraging textual alignments S Papi, P Wang, J Chen, J Xue, J Li, Y Gaur 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 10 | 2023 |
Diarist: Streaming Speech Translation with Speaker Diarization M Yang, N Kanda, X Wang, J Chen, P Wang, J Xue, J Li, T Yoshioka ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 5 | 2024 |
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation S Papi, P Wang, J Chen, J Xue, N Kanda, J Li, Y Gaur ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach J Chen, J Xue, P Wang, J Pan, J Li 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 3 | 2023 |
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation P Wang, J Xue, J Li, J Chen, AS Subramanian arXiv preprint arXiv:2406.10276, 2024 | | 2024 |
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech X Fan, C Pang, T Yuan, H Bai, R Zheng, P Zhu, S Wang, J Chen, Z Chen, ... arXiv preprint arXiv:2211.03545, 2022 | | 2022 |
Data-Driven Adaptive Simultaneous Machine Translation G Xun, M Ma, Y Bian, X Cai, J Huang, R Zheng, J Chen, J Yuan, K Church, ... arXiv preprint arXiv:2204.12672, 2022 | | 2022 |