Local Interpretations for Explainable Natural Language Processing: A Survey S Luo, H Ivison, C Han, J Poon ACM Computing Survey, 2024 | 50 | 2024 |
PDFVQA: A New Dataset for Real-World VQA on PDF Documents Y Ding, S Luo, H Chung, SC Han ECML PKDD 2023, 2023 | 21* | 2023 |
VICTR: Visual Information Captured Text Representation for Text-to-Vision Multimodal Tasks C Han, S Long, S Luo, K Wang, J Poon COLING 2020 (Best Area Paper Award), 3107-3117, 2020 | 16* | 2020 |
SceneGATE: Scene-Graph based co-Attention networks for TExt visual question answering F Cao, S Luo, F Nunez, Z Wen, J Poon, SC Han Robotics 12 (4), 114, 2023 | 9 | 2023 |
Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis S Luo, Y Ding, S Long, SC Han, J Poon COLING 2022, 2022 | 8 | 2022 |
Siqu Long, Josiah Poon, and Soyeon Caren Han. 2022. Doc-GCN: Heterogeneous Graph Convolutional Networks for Document Layout Analysis S Luo, Y Ding Proceedings of the 29th International Conference on Computational …, 0 | 8* | |
PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering Y Ding, K Ren, J Huang, S Luo, SC Han IJCAI 2024, 2024 | 5 | 2024 |
REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering S Luo, SC Han, K Sun, J Poon ICONIP 2020 (Best Paper Award), 520-532, 2020 | 5 | 2020 |
MMVQA: A comprehensive dataset for investigating multipage multimodal information retrieval in pdf-based visual question answering Y Ding, K Ren, J Huang, S Luo, SC Han Proceedings of the Thirty-Third International Joint Conference on Artificial …, 2024 | 3 | 2024 |
3M-Health: Multimodal Multi-Teacher Knowledge Distillation for Mental Health Detection RC Cabral, S Luo, J Poon, SC Han Proceedings of the 33rd ACM International Conference on Information and …, 2024 | 1 | 2024 |
Multimodal Commonsense Knowledge Distillation for Visual Question Answering S Yang, S Luo, SC Han arXiv preprint arXiv:2411.02722, 2024 | | 2024 |
'No'Matters: Out-of-Distribution Detection in Multimodality Long Dialogue R Gao, X Wu, S Luo, C Han, F Liu arXiv preprint arXiv:2410.23883, 2024 | | 2024 |
Workshop on Document Intelligence Understanding SC Han, Y Ding, S Luo, J Poon, HG Yoon, Z Huang, P Duuring, ... CIKM 2023, 2023 | | 2023 |
PiggyBack: Pretrained Visual Question Answering Environment for Backing up Non-deep Learning Professionals Z Zhang, S Luo, J Chen, S Lai, S Long, H Chung, SC Han WSDM 2023, 1152-1155, 2023 | | 2023 |
Towards Multi-modal Interpretation and Explanation S Luo | | 2023 |