Jingqun Tang

Cited by

	All	Since 2020
Citations	389	389
h-index	10	10
i10-index	11	11

240

120

180

202220232024202510 60 235 84

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Xiang BaiHuazhong University of Science and Technology (HUST)Verified email at hust.edu.cn
Hao LIUResearcher, ByteDance; Previously TencentVerified email at bytedance.com
Hao FengPh.D., University of Science and Technology of China; Researcher, ByteDanceVerified email at mail.ustc.edu.cn
Bing-hong WuByteDance, BaiduVerified email at bytedance.com
Jinghui LuByteDance Inc., School of Computer Science, University College DublinVerified email at bytedance.com
Yuan XieFull Professor, School of Computer Science and Technology, East China Normal UniversityVerified email at ia.ac.cn
Wengang ZhouProfessor, EEIS Department, University of Science and Technology of ChinaVerified email at ustc.edu.cn
Mingxin HuangSouth China University of TechnologyVerified email at mail.scut.edu.cn
Lianwen JinProfessor of Electronic and Information Engineering, South China University of TechnologyVerified email at scut.edu.cn
Jiaxin ZhangVerified email at mail.scut.edu.cn
Dahua LinThe Chinese University of Hong KongVerified email at ie.cuhk.edu.hk
Chunhua ShenZhejiang UniversityVerified email at zju.edu.cn
Dimitrios KanoulasProfessor in Robotics and AI, UKRI FLF, University College London (UCL), Archimedes/Athena RCVerified email at ucl.ac.uk

Jingqun Tang

ByteDance Inc.

Verified email at bytedance.com

Computer Vision Document Understanding MLLM


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Few could be better than all: Feature sampling and grouping for scene text detection J Tang, W Zhang, H Liu, MK Yang, B Jiang, G Hu, X Bai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	100	2022
Spts v2: single-point scene text spotting Y Liu, J Zhang, D Peng, M Huang, X Wang, J Tang, C Huang, D Lin, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023	51	2023
Docpedia: Unleashing the power of large multimodal model in the frequency domain for versatile document understanding H Feng, Q Liu, H Liu, J Tang, W Zhou, H Li, C Huang Science China Information Sciences 2024, 2023	42	2023
Unidoc: A universal large multimodal model for simultaneous text detection, recognition, spotting and understanding H Feng, Z Wang, J Tang, J Lu, W Zhou, H Li, C Huang arXiv preprint arXiv:2308.11592, 2023	36	2023
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering J Tang, Q Liu, Y Ye, J Lu, S Wei, C Lin, W Li, MFFB Mahmood, H Feng, ... arXiv preprint arXiv:2405.11985, 2024	25	2024
You can even annotate text with voice: Transcription-only-supervised text spotting J Tang, S Qiao, B Cui, Y Ma, S Zhang, D Kanoulas Proceedings of the 30th ACM International Conference on Multimedia, 4154-4163, 2022	22	2022
TextSquare: Scaling up Text-Centric Visual Instruction Tuning J Tang, C Lin, Z Zhao, S Wei, B Wu, Q Liu, H Feng, Y Li, S Wang, L Liao, ... arXiv preprint arXiv:2404.12803, 2024	20	2024
Optimal boxes: boosting end-to-end scene text recognition by adjusting annotated bounding boxes via reinforcement learning J Tang, W Qian, L Song, X Dong, L Li, X Bai European Conference on Computer Vision, 233-248, 2022	17	2022
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer Z Zhao, J Tang, B Wu, C Lin, H Liu, Z Zhang, X Tan, C Huang, Y Xie Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	15	2023
Tabpedia: Towards comprehensive visual table understanding with concept synergy W Zhao, H Feng, Q Liu, J Tang, S Wei, B Wu, L Liao, Y Ye, H Liu, W Zhou, ... arXiv preprint arXiv:2406.01326 (NeurIPS 2024), 2024	11	2024
Character recognition competition for street view shop signs J Tang, W Du, B Wang, W Zhou, S Mei, T Xue, X Xu, H Zhang National Science Review 10 (6), nwad141, 2023	10	2023
Harmonizing Visual Text Comprehension and Generation Z Zhao, J Tang, B Wu, C Lin, S Wei, H Liu, X Tan, Z Zhang, C Huang, ... arXiv preprint arXiv:2407.16364 (NeurIPS 2024), 2024	9	2024
Cell-cell contact-driven EphB1 cis-and trans-signalings regulate cancer stem cells enrichment after chemotherapy L Wang, Q Peng, Y Xie, N Yin, J Xu, A Chen, J Yi, W Shi, J Tang, J Xiang Cell Death & Disease 13 (11), 980, 2022	9	2022
Pargo: Bridging vision-language with partial and global views AL Wang, B Shan, W Shi, KY Lin, X Fei, G Tang, L Liao, J Tang, C Huang, ... arXiv preprint arXiv:2408.12928 (AAAI 2025), 2024	7	2024
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance W Sun, B Cui, J Tang, XM Dong arXiv preprint arXiv:2412.12974 (AAAI 2025), 2024	6	2024
A bounding box is worth one token: Interleaving layout and text in a large language model for document understanding J Lu, H Yu, Y Wang, Y Ye, J Tang, Z Yang, B Wu, Q Liu, H Feng, H Wang, ... arXiv preprint arXiv:2407.01976, 2024	5	2024
Mctbench: Multimodal cognition towards text-rich visual scenes benchmark B Shan, X Fei, W Shi, AL Wang, G Tang, L Liao, J Tang, X Bai, C Huang arXiv preprint arXiv:2410.11538, 2024	4	2024
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning L Fu, B Yang, Z Kuang, J Song, Y Li, L Zhu, Q Luo, X Wang, H Lu, ... arXiv preprint arXiv:2501.00321, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors