Urmăriți
Zhenrong Zhang
Zhenrong Zhang
Adresă de e-mail confirmată pe mail.ustc.edu.cn
Titlu
Citat de
Citat de
Anul
Split, embed and merge: An accurate table structure recognizer
Z Zhang, J Zhang, J Du, F Wang
Pattern Recognition 126, 108565, 2022
582022
Multimodal pre-training based on graph attention network for document understanding
Z Zhang, J Ma, J Du, L Wang, J Zhang
IEEE Transactions on Multimedia 25, 6743-6755, 2022
472022
SEMv2: Table Separation Line Detection Based on Instance Segmentation
Z Zhang, P Hu, J Ma, J Du, J Zhang, H Zhu, B Yin, B Yin, C Liu
Pattern Recognition, 2023
15*2023
Hrdoc: Dataset and baseline method toward hierarchical reconstruction of document structures
J Ma, J Du, P Hu, Z Zhang, J Zhang, H Zhu, C Liu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1870-1877, 2023
132023
Multimodal tree decoder for table of contents extraction in document images
P Hu, Z Zhang, J Zhang, J Du, J Wu
2022 26th international conference on pattern recognition (ICPR), 1756-1762, 2022
132022
Quality-aware masked diffusion transformer for enhanced music generation
C Li, R Wang, L Liu, J Du, Y Sun, Z Guo, Z Zhang, Y Jiang
arXiv e-prints, arXiv: 2405.15863, 2024
72024
Generate, transform, and clean: the role of GANs and transformers in palm leaf manuscript generation and enhancement
N Thuon, J Du, Z Zhang, J Ma, P Hu
International Journal on Document Analysis and Recognition (IJDAR) 27 (3 …, 2024
42024
Count, decode and fetch: A new approach to handwritten chinese character error correction
P Hu, J Ma, Z Zhang, J Du, J Zhang
arXiv preprint arXiv:2307.16253, 2023
32023
USTC-iFLYTEK at DocILE: A Multi-modal Approach Using Domain-specific GraphDoc.
Y Wang, J Du, J Ma, P Hu, Z Zhang, J Zhang
CLEF (Working Notes), 598-610, 2023
32023
Accurate oriented instance segmentation in aerial images
ZR Zhang, J Du
Image and Graphics: 11th International Conference, ICIG 2021, Haikou, China …, 2021
32021
Semv3: A fast and robust approach to table separation line detection
C Qin, Z Zhang, P Hu, C Liu, J Ma, J Du
arXiv preprint arXiv:2405.11862, 2024
22024
Bidirectional trained tree-structured decoder for handwritten mathematical expression recognition
H Cheng, C Liu, P Hu, Z Zhang, J Ma, J Du
arXiv preprint arXiv:2401.00435, 2023
12023
Scene Text Recognition with Self-supervised Contrastive Predictive Coding
X Jiang, J Zhang, J Du, Z Zhang, J Wu
2022 26th International Conference on Pattern Recognition (ICPR), 1514-1521, 2022
12022
SRFUND: A Multi-Granularity Hierarchical Structure Reconstruction Benchmark in Form Understanding
J Ma, Y Wang, C Liu, J Du, Y Hu, Z Zhang, P Hu, Q Wang, J Zhang
Advances in Neural Information Processing Systems 37, 112411-112432, 2025
2025
See then Tell: Enhancing Key Information Extraction with Vision Grounding
S Liu, Z Zhang, P Hu, J Ma, J Du, Q Wang, J Zhang, C Liu
arXiv preprint arXiv:2409.19573, 2024
2024
UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition
Z Zhang, S Liu, P Hu, J Ma, J Du, J Zhang, Y Hu
Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
2024
DocMamba: Efficient Document Pre-training with State Space Model
P Hu, Z Zhang, J Ma, S Liu, J Du, J Zhang
arXiv preprint arXiv:2409.11887, 2024
2024
Viewing Writing as Video: Optical Flow based Multi-Modal Handwritten Mathematical Expression Recognition
H Cheng, J Du, P Hu, J Ma, Z Zhang, M Xue
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
2024
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–18