mplug-owl: Modularization empowers large language models with multimodality Q Ye, H Xu, G Xu, J Ye, M Yan, Y Zhou, J Wang, A Hu, P Shi, Y Shi, C Li, ... arXiv preprint arXiv:2304.14178, 2023 | 844 | 2023 |
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections C Li, H Xu, J Tian, W Wang, M Yan, B Bi, J Ye, ... EMNLP, 2022 | 169* | 2022 |
Ureader: Universal ocr-free visually-situated language understanding with multimodal large language model J Ye, A Hu, H Xu, Q Ye, M Yan, G Xu, C Li, J Tian, Q Qian, J Zhang, Q Jin, ... arXiv preprint arXiv:2310.05126, 2023 | 120 | 2023 |
Ecnu at semeval-2017 task 1: Leverage kernel-based traditional nlp features and neural networks to build a universal model for multilingual and cross-lingual semantic textual … J Tian, Z Zhou, M Lan, Y Wu Proceedings of the 11th international workshop on semantic evaluation …, 2017 | 120* | 2017 |
mplug-docowl: Modularized multimodal large language model for document understanding J Ye, A Hu, H Xu, Q Ye, M Yan, Y Dan, C Zhao, G Xu, C Li, J Tian, Q Qi, ... arXiv preprint arXiv:2307.02499, 2023 | 114 | 2023 |
Sentix: A sentiment-aware pre-trained model for cross-domain sentiment analysis J Zhou, J Tian, R Wang, Y Wu, W Xiao, L He Proceedings of the 28th international conference on computational …, 2020 | 102 | 2020 |
Shifting more attention to visual backbone: Query-modulated refinement networks for end-to-end visual grounding J Ye, J Tian, M Yan, X Yang, X Wang, J Zhang, L He, X Lin proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 76 | 2022 |
Multi-domain dialogue acts and response co-generation K Wang, J Tian, R Wang, X Quan, J Yu arXiv preprint arXiv:2004.12363, 2020 | 65 | 2020 |
WikiDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and Entity Types X Wang, J Tian, M Gui, Z Li, R Wang, M Yan, L Chen, Y Xiao. Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022 | 56* | 2022 |
A multi-task learning approach for improving product title compression with user search log data J Wang, J Tian, L Qiu, S Li, J Lang, L Si, M Lan Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 49 | 2018 |
CAT-MNER: multimodal named entity recognition with knowledge-refined cross-modal attention X Wang, J Ye, Z Li, J Tian, Y Jiang, M Yan, J Zhang, Y Xiao 2022 IEEE international conference on multimedia and expo (ICME), 1-6, 2022 | 47 | 2022 |
Sentiment-aware multimodal pre-training for multimodal sentiment analysis J Ye, J Zhou, J Tian, R Wang, J Zhou, T Gui, Q Zhang, X Huang Knowledge-Based Systems 258, 110021, 2022 | 44 | 2022 |
PromptMNER: Prompt-Based Entity-Related Visual Clue Extraction and Integration for Multimodal Named Entity Recognition X Wang, J Tian, M Gui, Z Li, J Ye, M Yan, Y Xiao Database Systems for Advanced Applications: 27th International Conference …, 2022 | 32* | 2022 |
Mind at semeval-2021 task 6: Propaganda detection using transfer learning and multimodal fusion J Tian, M Gui, C Li, M Yan, W Xiao Proceedings of the 15th International Workshop on Semantic Evaluation …, 2021 | 22 | 2021 |
Attention optimization for abstractive document summarization M Gui, J Tian, R Wang, Z Yang arXiv preprint arXiv:1910.11491, 2019 | 22 | 2019 |
ECNU at SemEval-2016 Task 1: Leveraging word embedding from macro and micro views to boost performance for semantic textual similarity J Tian, M Lan Proceedings of the 10th International Workshop on Semantic Evaluation …, 2016 | 16 | 2016 |
ECNU: using traditional similarity measurements and word embedding for semantic textual similarity estimation J Zhao, M Lan, J Tian Proceedings of the 9th International Workshop on Semantic Evaluation …, 2015 | 13 | 2015 |
Achieving Human Parity on Visual Question Answering M Yan, H Xu, C Li, J Tian, B Bi, W Wang, X Xu, J Zhang, S Huang, ... ACM Transactions on Information Systems 41 (3), 1-40, 2023 | 10 | 2023 |
Chatplug: Open-domain generative dialogue system with internet-augmented instruction tuning for digital human J Tian, H Chen, G Xu, M Yan, X Gao, J Zhang, C Li, J Liu, W Xu, H Xu, ... arXiv preprint arXiv:2304.07849, 2023 | 9 | 2023 |
Grid-vlp: Revisiting grid features for vision-language pre-training M Yan, H Xu, C Li, B Bi, J Tian, M Gui, W Wang arXiv preprint arXiv:2108.09479, 2021 | 9 | 2021 |