Towards robust visual information extraction in real world: new dataset and novel solution J Wang, C Liu, L Jin, G Tang, J Zhang, S Zhang, Q Wang, Y Wu, M Cai Proceedings of the AAAI Conference on Artificial Intelligence 35 (4), 2738-2745, 2021 | 83 | 2021 |
High-performance all-polymer photodetectors via a thick photoactive layer strategy Z Zhong, K Li, J Zhang, L Ying, R Xie, G Yu, F Huang, Y Cao ACS applied materials & interfaces 11 (15), 14208-14214, 2019 | 65 | 2019 |
High-detectivity organic photodetectors based on a thick-film photoactive layer using a conjugated polymer containing a naphtho [1, 2-c: 5, 6-c] bis [1, 2, 5] thiadiazole unit Z Zeng, Z Zhong, W Zhong, J Zhang, L Ying, G Yu, F Huang, Y Cao Journal of Materials Chemistry C 7 (20), 6070-6076, 2019 | 38 | 2019 |
M6Doc: a large-scale multi-format, multi-type, multi-layout, multi-language, multi-annotation category dataset for modern document layout analysis H Cheng, P Zhang, S Wu, J Zhang, Q Zhu, Z Xie, J Li, K Ding, L Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 29 | 2023 |
Forgery-free signature verification with stroke-aware cycle-consistent generative adversarial network J Jiang, S Lai, L Jin, Y Zhu, J Zhang, B Chen Neurocomputing 507, 345-357, 2022 | 22 | 2022 |
Marior: Margin removal and iterative content rectification for document dewarping in the wild J Zhang, C Luo, L Jin, F Guo, K Ding arXiv preprint arXiv:2207.11515, 2022 | 20 | 2022 |
SaHAN: Scale-aware hierarchical attention network for scene text recognition J Zhang, C Luo, L Jin, T Wang, Z Li, W Zhou Pattern Recognition Letters 136, 205-211, 2020 | 14 | 2020 |
Looking from a higher-level perspective: Attention and recognition enhanced multi-scale scene text segmentation Y Ren, J Zhang, B Chen, X Zhang, L Jin Proceedings of the Asian Conference on Computer Vision, 3138-3154, 2022 | 13 | 2022 |
Scene table structure recognition with segmentation collaboration and alignment H Wang, Y Xue, J Zhang, L Jin Pattern Recognition Letters 165, 146-153, 2023 | 9 | 2023 |
Appearance enhancement for camera-captured document images in the wild J Zhang, L Liang, K Ding, F Guo, L Jin IEEE Transactions on Artificial Intelligence 5 (5), 2319-2330, 2023 | 8 | 2023 |
Dockylin: A large multimodal model for visual document understanding with efficient visual slimming J Zhang, W Yang, S Lai, Z Xie, L Jin arXiv preprint arXiv:2406.19101, 2024 | 7 | 2024 |
UPOCR: Towards unified pixel-level ocr interface D Peng, Z Yang, J Zhang, C Liu, Y Shi, K Ding, F Guo, L Jin Forty-first International Conference on Machine Learning, 2024 | 6 | 2024 |
DocAligner: Annotating real-world photographic document images by simply taking pictures J Zhang, B Chen, H Cheng, F Guo, K Ding, L Jin arXiv preprint arXiv:2306.05749, 2023 | 6 | 2023 |
White polymer light-emitting diodes with ultra-large color shifts for pulse-width-modulation applications J Zhang, F Peng, Z Zhong, L Ying, Y Cao Journal of Materials Chemistry C 7 (34), 10567-10573, 2019 | 6 | 2019 |
Docres: a generalist model toward unifying document image restoration tasks J Zhang, D Peng, C Liu, P Zhang, L Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
Complex table structure recognition in the wild using transformer and identity matrix-based augmentation B Chen, D Peng, J Zhang, Y Ren, L Jin International Conference on Frontiers in Handwriting Recognition, 545-561, 2022 | 5 | 2022 |
Textsrnet: scene text super-resolution based on contour prior and atrous convolution J Ma, L Jin, J Zhang, J Jiang, Y Xue, M He 2022 26th International Conference on Pattern Recognition (ICPR), 3252-3258, 2022 | 1 | 2022 |
QT-TextSR: Enhancing scene text image super-resolution via efficient interaction with text recognition using a Query-aware Transformer C Liu, Q Jiang, D Peng, Y Kong, J Zhang, L Xiong, J Duan, C Sun, L Jin Neurocomputing 620, 129241, 2025 | | 2025 |
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models P Zhang, J Zhang, J Cao, H Li, L Jin International Journal of Computer Vision, 1-27, 2025 | | 2025 |
Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs H Li, J Zhang, W Liao, D Peng, K Ding, L Jin arXiv preprint arXiv:2501.19036, 2025 | | 2025 |