Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 3913 | 2016 |
Cbnet: A composite backbone network architecture for object detection T Liang, X Chu, Y Liu, Y Wang, Z Tang, W Chu, J Chen, H Ling IEEE Transactions on Image Processing 31, 6893-6906, 2022 | 209 | 2022 |
Deployed end-to-end speech recognition B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ... US Patent 10,319,374, 2019 | 136 | 2019 |
End-to-end speech recognition B Catanzaro, J Chen, M Chrzanowski, E Elsen, J Engel, C Fougner, ... US Patent 10,332,509, 2019 | 111 | 2019 |
Cmua-watermark: A cross-model universal adversarial watermark for combating deepfakes H Huang, Y Wang, Z Chen, Y Zhang, Y Li, Z Tang, W Chu, J Chen, W Lin, ... Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 989-997, 2022 | 103 | 2022 |
DKDFN: Domain knowledge-guided deep collaborative fusion network for multimodal unitemporal remote sensing land cover classification Y Li, Y Zhou, Y Zhang, L Zhong, J Wang, J Chen ISPRS Journal of Photogrammetry and Remote Sensing 186, 170-189, 2022 | 100 | 2022 |
Skysense: A multi-modal remote sensing foundation model towards universal interpretation for earth observation imagery X Guo, J Lao, B Dang, Y Zhang, L Yu, L Ru, L Zhong, Z Huang, K Wu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 93 | 2024 |
Lpsnet: A lightweight solution for fast panoptic segmentation W Hong, Q Guo, W Zhang, J Chen, W Chu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 38 | 2021 |
Siman: Exploring self-supervised representation learning of scene text via similarity-aware normalization C Luo, L Jin, J Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 37 | 2022 |
Matchvie: Exploiting match relevancy between entities for visual information extraction G Tang, L Xie, L Jin, J Wang, J Chen, Z Xu, Q Wang, Y Wu, H Li arXiv preprint arXiv:2106.12940, 2021 | 35 | 2021 |
Gilbert: Generative vision-language pre-training for image-text retrieval W Hong, K Ji, J Liu, J Wang, J Chen, W Chu Proceedings of the 44th International ACM SIGIR Conference on Research and …, 2021 | 34 | 2021 |
Farmland parcel mapping in mountain areas using time-series SAR data and VHR optical images W Liu, J Wang, J Luo, Z Wu, J Chen, Y Zhou, Y Sun, Z Shen, N Xu, ... Remote Sensing 12 (22), 3733, 2020 | 34 | 2020 |
Hierarchical memory learning for fine-grained scene graph generation Y Deng, Y Li, Y Zhang, X Xiang, J Wang, J Chen, J Ma European Conference on Computer Vision, 266-283, 2022 | 26 | 2022 |
Cret: Cross-modal retrieval transformer for efficient text-video retrieval K Ji, J Liu, W Hong, L Zhong, J Wang, J Chen, W Chu Proceedings of the 45th international ACM SIGIR conference on research and …, 2022 | 17 | 2022 |
Training object detectors from scratch: An empirical study in the era of vision transformer W Hong, J Lao, W Ren, J Wang, J Chen, W Chu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 16 | 2022 |
Variational connectionist temporal classification L Chao, J Chen, W Chu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 13 | 2020 |
Simultaneously short-and long-term temporal modeling for semi-supervised video semantic segmentation J Lao, W Hong, X Guo, Y Zhang, J Wang, J Chen, W Chu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 11 | 2023 |
Animate-x: Universal character image animation with enhanced motion representation S Tan, B Gong, X Wang, S Zhang, D Zheng, R Zheng, K Zheng, J Chen, ... arXiv preprint arXiv:2410.10306, 2024 | 4 | 2024 |
Fine-grained Pseudo Labels for Scene Text Recognition X Li, X Chen, Z Huang, L Xie, J Chen, M Yang Proceedings of the 31st ACM International Conference on Multimedia, 5786-5795, 2023 | 3 | 2023 |
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models W Li, M Fang, C Zou, B Gong, R Zheng, M Wang, J Chen, M Yang European Conference on Computer Vision, 110-126, 2024 | 1 | 2024 |