Damo-yolo: A report on real-time object detection design X Xu, Y Jiang, W Chen, Y Huang, Y Zhang, X Sun arXiv preprint arXiv:2211.15444, 2022 | 224 | 2022 |
ICDAR 2019 competition on table detection and recognition (cTDaR) L Gao, Y Huang, H Déjean, JL Meunier, Q Yan, Y Fang, F Kleber, E Lang 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019 | 198 | 2019 |
A YOLO-based table detection method Y Huang, Q Yan, Y Li, Y Chen, X Wang, L Gao, Z Tang 2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019 | 101 | 2019 |
Data-juicer: A one-stop data processing system for large language models D Chen, Y Huang, Z Ma, H Chen, X Pan, C Ge, D Gao, Y Xie, Z Liu, J Gao, ... Companion of the 2024 International Conference on Management of Data, 120-134, 2024 | 41 | 2024 |
Deepmad: Mathematical architecture design for deep convolutional neural network X Shen, Y Wang, M Lin, Y Huang, H Tang, X Sun, Y Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 36 | 2023 |
A GAN-based feature generator for table detection Y Li, L Gao, Z Tang, Q Yan, Y Huang 2019 International conference on document analysis and recognition (ICDAR …, 2019 | 33 | 2019 |
Enhancing multimodal large language models with vision detection models: An empirical study Q Jiao, D Chen, Y Huang, Y Li, Y Shen arXiv preprint arXiv:2401.17981, 2024 | 18 | 2024 |
Rethinking table structure recognition using sequence labeling methods Y Li, Y Huang, Z Zhu, L Pan, Y Huang, L Du, Z Tang, L Gao Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021 | 13 | 2021 |
Img-diff: Contrastive data synthesis for multimodal large language models Q Jiao, D Chen, Y Huang, Y Li, Y Shen arXiv preprint arXiv:2408.04594, 2024 | 9 | 2024 |
Icdar 2019 competition on table detection and recognition (ctdar) H Déjean, JL Meunier, L Gao, Y Huang, Y Fang, F Kleber, EM Lang (No Title), 2019 | 6 | 2019 |
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective Z Qin, D Chen, W Zhang, L Yao, Y Huang, B Ding, Y Li, S Deng arXiv preprint arXiv:2407.08583, 2024 | 5 | 2024 |
Data-juicer sandbox: A comprehensive suite for multimodal data-model co-development D Chen, H Wang, Y Huang, C Ge, Y Li, B Ding, J Zhou arXiv preprint arXiv:2407.11784, 2024 | 4 | 2024 |
NTable: a dataset for camera-based table detection Z Zhu, L Gao, Y Li, Y Huang, L Du, N Lu, X Wang Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021 | 4 | 2021 |
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for Foundation Models D Chen, Y Huang, X Pan, N Jiang, H Wang, C Ge, Y Chen, W Zhang, ... arXiv preprint arXiv:2501.14755, 2024 | | 2024 |