Folgen
Yilun Huang
Yilun Huang
Alibaba Group; Peking University
Bestätigte E-Mail-Adresse bei pku.edu.cn - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Damo-yolo: A report on real-time object detection design
X Xu, Y Jiang, W Chen, Y Huang, Y Zhang, X Sun
arXiv preprint arXiv:2211.15444, 2022
2242022
ICDAR 2019 competition on table detection and recognition (cTDaR)
L Gao, Y Huang, H Déjean, JL Meunier, Q Yan, Y Fang, F Kleber, E Lang
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
1982019
A YOLO-based table detection method
Y Huang, Q Yan, Y Li, Y Chen, X Wang, L Gao, Z Tang
2019 International Conference on Document Analysis and Recognition (ICDAR …, 2019
1012019
Data-juicer: A one-stop data processing system for large language models
D Chen, Y Huang, Z Ma, H Chen, X Pan, C Ge, D Gao, Y Xie, Z Liu, J Gao, ...
Companion of the 2024 International Conference on Management of Data, 120-134, 2024
412024
Deepmad: Mathematical architecture design for deep convolutional neural network
X Shen, Y Wang, M Lin, Y Huang, H Tang, X Sun, Y Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
362023
A GAN-based feature generator for table detection
Y Li, L Gao, Z Tang, Q Yan, Y Huang
2019 International conference on document analysis and recognition (ICDAR …, 2019
332019
Enhancing multimodal large language models with vision detection models: An empirical study
Q Jiao, D Chen, Y Huang, Y Li, Y Shen
arXiv preprint arXiv:2401.17981, 2024
182024
Rethinking table structure recognition using sequence labeling methods
Y Li, Y Huang, Z Zhu, L Pan, Y Huang, L Du, Z Tang, L Gao
Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021
132021
Img-diff: Contrastive data synthesis for multimodal large language models
Q Jiao, D Chen, Y Huang, Y Li, Y Shen
arXiv preprint arXiv:2408.04594, 2024
92024
Icdar 2019 competition on table detection and recognition (ctdar)
H Déjean, JL Meunier, L Gao, Y Huang, Y Fang, F Kleber, EM Lang
(No Title), 2019
62019
The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective
Z Qin, D Chen, W Zhang, L Yao, Y Huang, B Ding, Y Li, S Deng
arXiv preprint arXiv:2407.08583, 2024
52024
Data-juicer sandbox: A comprehensive suite for multimodal data-model co-development
D Chen, H Wang, Y Huang, C Ge, Y Li, B Ding, J Zhou
arXiv preprint arXiv:2407.11784, 2024
42024
NTable: a dataset for camera-based table detection
Z Zhu, L Gao, Y Li, Y Huang, L Du, N Lu, X Wang
Document Analysis and Recognition–ICDAR 2021: 16th International Conference …, 2021
42021
Data-Juicer 2.0: Cloud-Scale Adaptive Data Processing for Foundation Models
D Chen, Y Huang, X Pan, N Jiang, H Wang, C Ge, Y Chen, W Zhang, ...
arXiv preprint arXiv:2501.14755, 2024
2024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–14