What is wrong with scene text recognition model comparisons? dataset and model analysis J Baek, G Kim, J Lee, S Park, D Han, S Yun, SJ Oh, H Lee
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
695 2019 OCR-Free Document Understanding Transformer G Kim, T Hong, M Yim, JY Nam, J Park, J Yim, W Hwang, S Yun, D Han, ...
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
422 * 2022 Cost-effective End-to-end Information Extraction for Semi-structured Document Images W Hwang, H Lee, J Yim, G Kim, M Seo
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
30 2021 Prometheus-Vision: Vision-language model as a judge for fine-grained evaluation S Lee, S Kim, SH Park, G Kim, M Seo
ACL 2024 Findings, 2024
22 2024 Graph embedding with shifted inner product similarity and its improved approximation capability A Okuno, G Kim, H Shimodaira
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
12 2019 Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models G Kim, H Lee, D Kim, H Jung, S Park, Y Kim, S Yun, T Kil, B Lee, S Park
Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023
9 * 2023 Representation learning with weighted inner product for universal approximation of general similarities G Kim, A Okuno, K Fukui, H Shimodaira
Proceedings of the Twenty-Eighth International Joint Conference on …, 2019
9 2019 On text localization in end-to-end OCR-Free document understanding transformer without text localization supervision G Kim, S Yokoo, S Seo, A Osanai, Y Okamoto, Y Baek
International Conference on Document Analysis and Recognition, 215-232, 2023
7 2023 Word-like character n-gram embedding G Kim, K Fukui, H Shimodaira
Proceedings of the 2018 EMNLP Workshop W-NUT: The 4th Workshop on Noisy User …, 2018
7 2018 HyperCLOVA X Technical Report HyperCLOVA
arXiv preprint arXiv:2404.01954, 2024
6 * 2024 Segmentation-free Compositional -gram Embedding G Kim, K Fukui, H Shimodaira
Proceedings of the 2019 Conference of the North American Chapter of the …, 2019
4 2019 SCOB: Universal Text Understanding via Character-wise Supervised Contrastive Learning with Online Text Rendering for Bridging Domain Gap D Kim, Y Kim, DH Kim, Y Lim, G Kim, T Kil
2023 IEEE/CVF International Conference on Computer Vision (ICCV), 2023
2 2023 Revisiting Additive Compositionality: AND, OR and NOT Operations with Word Embeddings M Naito, S Yokoi, G Kim, H Shimodaira
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021
2 2021 On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning G Kim, M Seo
EMNLP 2024, 2024
1 2024 CREPE: Coordinate-Aware End-to-End Document Parser Y Okamoto, Y Baek, G Kim, R Nakao, DH Kim, MB Yim, S Park, B Lee
International Conference on Document Analysis and Recognition, 3-20, 2024
1 2024 On Web-based Visual Corpus Construction for Visual Document Understanding D Kim, T Hong, M Yim, Y Kim, G Kim
Proceedings of the International Conference on Document Analysis and …, 2023
1 2023 Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model S Park, G Kim, J Lee, J Cha, JH Kim, H Lee
Proceedings of the 28th International Conference on Computational …, 2020
1 2020 How Does Vision-Language Adaptation Impact the Safety of Vision Language Models? S Lee, G Kim, J Kim, H Lee, H Chang, SH Park, M Seo
To appear at ICLR 2025, 2024
2024 Semi-Structured Query Grounding for Document-Oriented Databases with Deep Retrieval and Its Application to Receipt and POI Matching G Kim, W Hwang, M Seo, S Park
Proceedings of the AAAI-22 Workshop on Knowledge Discovery from Unstructured …, 2022
2022 Stochastic Neighbor Embedding of Multimodal Relational Data for Image-Text Simultaneous Visualization M Mizutani, A Okuno, G Kim, H Shimodaira
arXiv preprint arXiv:2005.00670, 2020
2020