Παρακολούθηση
Ahmed Masry
Ahmed Masry
Graduate Student, York University
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα yorku.ca
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Chartqa: A benchmark for question answering about charts with visual and logical reasoning
A Masry, DX Long, JQ Tan, S Joty, E Hoque
ACL 2022 Findings, 2022
4972022
Chart-to-text: A large-scale benchmark for chart summarization
S Kantharaj, RTK Leong, X Lin, A Masry, M Thakkar, E Hoque, S Joty
ACL 2022, 2022
1312022
Chain fl: Decentralized federated machine learning via blockchain
C Korkmaz, HE Kocas, A Uysal, A Masry, O Ozkasap, B Akgun
2020 Second international conference on blockchain computing and …, 2020
932020
Unichart: A universal vision-language pretrained model for chart comprehension and reasoning
A Masry, P Kavehzadeh, XL Do, E Hoque, S Joty
EMNLP 2023, 2023
912023
Chart question answering: State of the art and future directions
E Hoque, P Kavehzadeh, A Masry
EuroVis 2022 41 (3), 555-572, 2022
432022
ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning
A Masry, M Shahmohammadi, MR Parvez, E Hoque, S Joty
ACL 2024 Findings, 2024
252024
Integrating image data extraction and table parsing methods for chart question answering
A Masry, E Hoque
ChartQA Workshop @ CVPR 2021 [Best Paper Award], 1-5, 2021
142021
Do llms work on charts? designing few-shot prompts for chart question answering and summarization
XL Do, M Hassanpour, A Masry, P Kavehzadeh, E Hoque, S Joty
arXiv preprint arXiv:2312.10610, 2023
102023
Chartgemma: Visual instruction-tuning for chart reasoning in the wild
A Masry, M Thakkar, A Bajaj, A Kartha, E Hoque, S Joty
arXiv preprint arXiv:2407.04172, 2024
82024
Are large vision language models up to the challenge of chart comprehension and reasoning? an extensive investigation into the capabilities and limitations of lvlms
MS Islam, R Rahman, A Masry, MTR Laskar, MT Nayeem, E Hoque
arXiv preprint arXiv:2406.00257, 2024
82024
LongFin: A Multimodal Document Understanding Model for Long Financial Domain Documents
A Masry, A Hajian
AIFinSI Workshop @ AAAI 2024, 2024
32024
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
A Masry, JA Rodriguez, T Zhang, S Wang, C Wang, A Feizi, AK Suresh, ...
arXiv preprint arXiv:2502.01341, 2025
2025
BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
J Rodriguez, X Jian, SS Panigrahi, T Zhang, A Feizi, A Puri, A Kalkunte, ...
arXiv preprint arXiv:2412.04626, 2024
2024
Chart Question Answering with Visual and Logical Reasoning
A Masry
Master's Thesis, 2022
2022
ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models
A Masry, E Hoque
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–15