Chartqa: A benchmark for question answering about charts with visual and logical reasoning A Masry, DX Long, JQ Tan, S Joty, E Hoque ACL 2022 Findings, 2022 | 497 | 2022 |
Chart-to-text: A large-scale benchmark for chart summarization S Kantharaj, RTK Leong, X Lin, A Masry, M Thakkar, E Hoque, S Joty ACL 2022, 2022 | 131 | 2022 |
Chain fl: Decentralized federated machine learning via blockchain C Korkmaz, HE Kocas, A Uysal, A Masry, O Ozkasap, B Akgun 2020 Second international conference on blockchain computing and …, 2020 | 93 | 2020 |
Unichart: A universal vision-language pretrained model for chart comprehension and reasoning A Masry, P Kavehzadeh, XL Do, E Hoque, S Joty EMNLP 2023, 2023 | 91 | 2023 |
Chart question answering: State of the art and future directions E Hoque, P Kavehzadeh, A Masry EuroVis 2022 41 (3), 555-572, 2022 | 43 | 2022 |
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning A Masry, M Shahmohammadi, MR Parvez, E Hoque, S Joty ACL 2024 Findings, 2024 | 25 | 2024 |
Integrating image data extraction and table parsing methods for chart question answering A Masry, E Hoque ChartQA Workshop @ CVPR 2021 [Best Paper Award], 1-5, 2021 | 14 | 2021 |
Do llms work on charts? designing few-shot prompts for chart question answering and summarization XL Do, M Hassanpour, A Masry, P Kavehzadeh, E Hoque, S Joty arXiv preprint arXiv:2312.10610, 2023 | 10 | 2023 |
Chartgemma: Visual instruction-tuning for chart reasoning in the wild A Masry, M Thakkar, A Bajaj, A Kartha, E Hoque, S Joty arXiv preprint arXiv:2407.04172, 2024 | 8 | 2024 |
Are large vision language models up to the challenge of chart comprehension and reasoning? an extensive investigation into the capabilities and limitations of lvlms MS Islam, R Rahman, A Masry, MTR Laskar, MT Nayeem, E Hoque arXiv preprint arXiv:2406.00257, 2024 | 8 | 2024 |
LongFin: A Multimodal Document Understanding Model for Long Financial Domain Documents A Masry, A Hajian AIFinSI Workshop @ AAAI 2024, 2024 | 3 | 2024 |
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding A Masry, JA Rodriguez, T Zhang, S Wang, C Wang, A Feizi, AK Suresh, ... arXiv preprint arXiv:2502.01341, 2025 | | 2025 |
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks J Rodriguez, X Jian, SS Panigrahi, T Zhang, A Feizi, A Puri, A Kalkunte, ... arXiv preprint arXiv:2412.04626, 2024 | | 2024 |
Chart Question Answering with Visual and Logical Reasoning A Masry Master's Thesis, 2022 | | 2022 |
ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models A Masry, E Hoque | | |