On the hidden mystery of ocr in large multimodal models Y Liu, Z Li, H Li, W Yu, M Huang, D Peng, M Liu, M Chen, C Li, L Jin, X Bai arXiv preprint arXiv:2305.07895, 2023 | 176 | 2023 |
Detecting heads using feature refine net and cascaded multi-scale architecture D Peng, Z Sun, Z Chen, Z Cai, L Xie, L Jin 2018 24th International Conference on Pattern Recognition (ICPR), 2528-2533, 2018 | 87 | 2018 |
SPTS: Single-point text spotting D Peng, X Wang, Y Liu, J Zhang, M Huang, S Lai, J Li, S Zhu, D Lin, ... Proceedings of the 30th ACM International Conference on Multimedia, 4272-4281, 2022 | 63 | 2022 |
SPTS v2: single-point scene text spotting Y Liu, J Zhang, D Peng, M Huang, X Wang, J Tang, C Huang, D Lin, ... IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (12 …, 2023 | 53 | 2023 |
Exploring OCR capabilities of GPT-4V (ision): A quantitative and in-depth evaluation Y Shi, D Peng, W Liao, Z Lin, X Chen, C Liu, Y Zhang, L Jin arXiv preprint arXiv:2310.16809, 2023 | 53 | 2023 |
Recognition of handwritten Chinese text by segmentation: a segment-annotation-free approach D Peng, L Jin, W Ma, C Xie, H Zhang, S Zhu, J Li IEEE Transactions on Multimedia 25, 2368-2381, 2022 | 52 | 2022 |
Revisiting scene text recognition: A data perspective Q Jiang, J Wang, D Peng, C Liu, L Jin Proceedings of the IEEE/CVF international conference on computer vision …, 2023 | 43 | 2023 |
SLOGAN: handwriting style synthesis for arbitrary-length and out-of-vocabulary text C Luo, Y Zhu, L Jin, Z Li, D Peng IEEE transactions on neural networks and learning systems 34 (11), 8503-8515, 2022 | 33 | 2022 |
A fast and accurate fully convolutional network for end-to-end handwritten Chinese text segmentation and recognition D Peng, L Jin, Y Wu, Z Wang, M Cai 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019 | 33 | 2019 |
Estextspotter: Towards better scene text spotting with explicit synergy in transformer M Huang, J Zhang, D Peng, H Lu, C Huang, Y Liu, X Bai, L Jin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 32 | 2023 |
FontDiffuser: One-shot font generation via denoising diffusion with multi-scale content aggregation and style contrastive learning Z Yang, D Peng, Y Kong, Y Zhang, C Yao, L Jin Proceedings of the AAAI conference on artificial intelligence 38 (7), 6603-6611, 2024 | 31 | 2024 |
PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition D Peng, L Jin, Y Liu, C Luo, S Lai International Journal of Computer Vision, 1-23, 2022 | 31 | 2022 |
Zero-shot Chinese text recognition via matching class embedding Y Huang, L Jin, D Peng Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021 | 25 | 2021 |
Towards robust tampered text detection in document image: New dataset and new solution C Qu, C Liu, Y Liu, X Chen, D Peng, F Guo, L Jin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 22 | 2023 |
Implicit feature alignment: learn to convert text recognizer to text spotter T Wang, Y Zhu, L Jin, D Peng, Z Li, M He, Y Wang, C Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 15 | 2021 |
Towards fast, accurate and compact online handwritten Chinese text recognition D Peng, C Xie, H Li, L Jin, Z Xie, K Ding, Y Huang, Y Wu Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021 | 10 | 2021 |
Scale mapping and dynamic re-detecting in dense head detection Z Sun, D Peng, Z Cai, Z Chen, L Jin 2018 25th IEEE International Conference on Image Processing (ICIP), 1902-1906, 2018 | 10 | 2018 |
SideNet: Learning representations from interactive side information for zero-shot Chinese character recognition Z Li, Y Huang, D Peng, M He, L Jin Pattern Recognition 148, 110208, 2024 | 9 | 2024 |
ViTEraser: Harnessing the power of vision transformers for scene text removal with segmim pretraining D Peng, C Liu, Y Liu, L Jin Proceedings of the AAAI Conference on Artificial Intelligence 38 (5), 4468-4477, 2024 | 9 | 2024 |
M5HisDoc: A Large-scale Multi-style Chinese Historical Document Analysis Benchmark Y Shi, C Liu, D Peng, C Jian, J Huang, L Jin Advances in Neural Information Processing Systems 36, 2024 | 7 | 2024 |