Ernie-vil: Knowledge enhanced vision-language representations through scene graphs F Yu, J Tang, W Yin, Y Sun, H Tian, H Wu, H Wang Proceedings of the AAAI conference on artificial intelligence 35 (4), 3208-3216, 2021 | 411 | 2021 |
Ernie-vilg 2.0: Improving text-to-image diffusion model with knowledge-enhanced mixture-of-denoising-experts Z Feng, Z Zhang, X Yu, Y Fang, L Li, X Chen, Y Lu, J Liu, W Yin, S Feng, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 126 | 2023 |
Ernie-layout: Layout knowledge enhanced pre-training for visually-rich document understanding Q Peng, Y Pan, W Wang, B Luo, Z Zhang, Z Huang, T Hu, W Yin, Y Chen, ... arXiv preprint arXiv:2210.06155, 2022 | 79 | 2022 |
Ernie-vilg: Unified generative pre-training for bidirectional vision-language generation H Zhang, W Yin, Y Fang, L Li, B Duan, Z Wu, Y Sun, H Tian, H Wu, ... arXiv preprint arXiv:2112.15283, 2021 | 61 | 2021 |
Mmlayout: multi-grained multimodal transformer for document understanding W Wang, Z Huang, B Luo, Q Chen, Q Peng, Y Pan, W Yin, S Feng, Y Sun, ... Proceedings of the 30th ACM International Conference on Multimedia, 4877-4886, 2022 | 19 | 2022 |
Ernie-vil 2.0: Multi-view contrastive learning for image-text pre-training B Shan, W Yin, Y Sun, H Tian, H Wu, H Wang arXiv preprint arXiv:2209.15270, 2022 | 19 | 2022 |
Alpha at semeval-2021 task 6: Transformer based propaganda classification Z Feng, J Tang, J Liu, W Yin, S Feng, Y Sun, L Chen Proceedings of the 15th International Workshop on Semantic Evaluation …, 2021 | 17 | 2021 |
ERNIE-ViL: Knowledge enhanced vision-language representations through scene graphs Y Fei, T Jiji, Y Weichong, S Yu, T Hao, W Hua, W Haifeng Proceedings of the AAAI Conference on Artificial Intelligence 35, 3208-3216, 2021 | 10 | 2021 |
Ernie-unix2: A unified cross-lingual cross-modal framework for understanding and generation B Shan, Y Han, W Yin, S Wang, Y Sun, H Tian, H Wu, H Wang arXiv preprint arXiv:2211.04861, 2022 | 5 | 2022 |
A novel multi-view object class detection framework for document image content analysis W Yin, T Lu, F Su 2013 12th International Conference on Document Analysis and Recognition …, 2013 | 2 | 2013 |
Method for generating target object, electronic device, and storage medium LI Yukun, H Zhang, YIN Weichong, X Dongling, Y Sun, H Tian US Patent App. 17/835,717, 2022 | 1 | 2022 |
Orthogonal Finetuning for Direct Preference Optimization C Yang, R Jia, N Gu, Z Lin, S Chen, C Pang, W Yin, Y Sun, H Wu, ... arXiv preprint arXiv:2409.14836, 2024 | | 2024 |
Multi-modal pre-training model acquisition method, electronic device and storage medium F Yu, T Jiji, YIN Weichong, Y Sun, H Tian, H Wu, H Wang US Patent 11,928,432, 2024 | | 2024 |
Method and apparatus for processing document image, and electronic device W Wang, Z Huang, B Luo, Q Peng, YIN Weichong, F Shikun, S Huang, ... US Patent App. 18/181,800, 2023 | | 2023 |