SEMv2: Table separation line detection based on instance segmentation Z Zhang, P Hu, J Ma, J Du, J Zhang, H Zhu, B Yin, B Yin, C Liu Pattern Recognition 149, 110279, 2024 | 15* | 2024 |
HRDoc: Dataset and Baseline Method Toward Hierarchical Reconstruction of Document Structures J Ma, J Du, P Hu, Z Zhang, J Zhang, H Zhu, C Liu AAAI 2023, 2023 | 13 | 2023 |
Multimodal tree decoder for table of contents extraction in document images P Hu, Z Zhang, J Zhang, J Du, J Wu 2022 26th international conference on pattern recognition (ICPR), 1756-1762, 2022 | 13 | 2022 |
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement N Thuon, J Du, Z Zhang, J Ma, P Hu International Journal on Document Analysis and Recognition (IJDAR) 27 (3 …, 2024 | 4 | 2024 |
Exploring audio-visual information fusion for sound event localization and detection in low-resource realistic scenarios Y Jiang, Q Wang, J Du, M Hu, P Hu, Z Liu, S Cheng, Z Nian, Y Dong, ... 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | 4 | 2024 |
Count, decompose and correct: A new approach to handwritten Chinese character error correction P Hu, J Ma, Z Zhang, J Du, J Zhang Pattern Recognition 160, 111110, 2025 | 3* | 2025 |
Hierarchical audio-visual information fusion with multi-label joint decoding for mer 2023 H Wang, Y Xi, H Chen, J Du, Y Song, Q Wang, H Zhou, C Wang, J Ma, ... Proceedings of the 31st ACM International Conference on Multimedia, 9531-9535, 2023 | 3 | 2023 |
USTC-iFLYTEK at DocILE: A Multi-modal Approach Using Domain-specific GraphDoc. Y Wang, J Du, J Ma, P Hu, Z Zhang, J Zhang CLEF (Working Notes), 598-610, 2023 | 3 | 2023 |
Semv3: A fast and robust approach to table separation line detection C Qin, Z Zhang, P Hu, C Liu, J Ma, J Du arXiv preprint arXiv:2405.11862, 2024 | 2 | 2024 |
Group, contrast and recognize: a self-supervised method for chinese character recognition X Jiang, J Du, P Hu, M Xue, J Ma, J Wu, J Zhang International Conference on Document Analysis and Recognition, 411-427, 2023 | 2 | 2023 |
Bidirectional trained tree-structured decoder for handwritten mathematical expression recognition H Cheng, C Liu, P Hu, Z Zhang, J Ma, J Du arXiv preprint arXiv:2401.00435, 2023 | 1 | 2023 |
Skeleton and Font Generation Network for Zero-shot Chinese Character Generation M Xue, J Du, Z Zhang, J Ma, Q Chang, P Hu, J Zhang, Y Hu arXiv preprint arXiv:2501.08062, 2025 | | 2025 |
RFL: Simplifying Chemical Structure Recognition with Ring-Free Language Q Chang, M Chen, C Pi, P Hu, Z Zhang, J Ma, J Du, B Yin, J Hu arXiv preprint arXiv:2412.07594, 2024 | | 2024 |
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation H Cheng, L Lin, C Liu, P Xia, P Hu, J Ma, J Du, J Pan arXiv preprint arXiv:2410.13726, 2024 | | 2024 |
See then Tell: Enhancing Key Information Extraction with Vision Grounding S Liu, Z Zhang, P Hu, J Ma, J Du, Q Wang, J Zhang, C Liu arXiv preprint arXiv:2409.19573, 2024 | | 2024 |
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition Z Zhang, S Liu, P Hu, J Ma, J Du, J Zhang, Y Hu arXiv preprint arXiv:2409.13148, 2024 | | 2024 |
DocMamba: Efficient Document Pre-training with State Space Model P Hu, Z Zhang, J Ma, S Liu, J Du, J Zhang arXiv preprint arXiv:2409.11887, 2024 | | 2024 |
ICDAR 2024 Competition on Recognition of Chemical Structures M Chen, H Wu, Q Chang, H Cheng, J Ma, P Hu, Z Zhang, C Liu, C Pi, J Hu, ... International Conference on Document Analysis and Recognition, 397-409, 2024 | | 2024 |
Radical Similarity Based Model Optimization and Post-correction for Chinese Character Recognition Z Han, J Du, M Xue, J Ma, P Hu, Z Zhang International Conference on Document Analysis and Recognition, 152-168, 2024 | | 2024 |
Maths: Multimodal Transformer-Based Human-Readable Solver Y Pan, Z Zhang, J Ma, P Hu, J Du, Q Wang, J Zhang, D Liu, S Wei 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | | 2024 |