Enhanced chart understanding in vision and language task via cross-modal pre-training on plot table pairs

M Zhou, YR Fung, L Chen, C Thomas, H Ji… - arxiv preprint arxiv …, 2023 - arxiv.org
Building cross-model intelligence that can understand charts and communicate the salient
information hidden behind them is an appealing challenge in the vision and language (V+ L) …

Are large language models good fact checkers: A preliminary study

H Cao, L Wei, M Chen, W Zhou, S Hu - arxiv preprint arxiv:2311.17355, 2023 - arxiv.org
Recently, Large Language Models (LLMs) have drawn significant attention due to their
outstanding reasoning capabilities and extensive knowledge repository, positioning them as …

Familiarity-aware evidence compression for retrieval augmented generation

D Jung, Q Liu, T Huang, B Zhou, M Chen - arxiv preprint arxiv:2409.12468, 2024 - arxiv.org
Retrieval Augmented Generation (RAG) improves large language models (LMs) by
incorporating non-parametric knowledge through evidence retrieval from external sources …

Star: Improving low-resource information extraction by structure-to-text data generation with large language models

MD Ma, X Wang, PN Kung, PJ Brantingham… - … 2023 Workshop on …, 2023 - openreview.net
Information extraction tasks such as event extraction require an in-depth understanding of
the output structure and sub-task dependencies. They heavily rely on task-specific training …

Explore the Way: Exploring Reasoning Path by Bridging Entities for Effective Cross-Document Relation Extraction

J Son, J Kim, J Lim, Y Jang, HS Lim - Findings of the Association …, 2023 - aclanthology.org
Cross-document relation extraction (CodRED) task aims to infer the relation between two
entities mentioned in different documents within a reasoning path. Previous studies have …

Parameter-efficient low-resource dialogue state tracking by prompt tuning

MD Ma, JY Kao, S Gao, A Gupta, D **, T Chung… - arxiv preprint arxiv …, 2023 - arxiv.org
Dialogue state tracking (DST) is an important step in dialogue management to keep track of
users' beliefs. Existing works fine-tune all language model (LM) parameters to tackle the …

The State of Relation Extraction Data Quality: Is Bigger Always Better?

E Cai, B O'Connor - Findings of the Association for Computational …, 2024 - aclanthology.org
Relation extraction (RE) extracts structured tuples of relationships (eg friend, enemy)
between entities (eg Sherlock Holmes, John Watson) from text, with exciting potential …

Harvesting Events from Multiple Sources: Towards a Cross-Document Event Extraction Paradigm

Q Gao, Z Meng, B Li, J Zhou, F Li, C Teng… - arxiv preprint arxiv …, 2024 - arxiv.org
Document-level event extraction aims to extract structured event information from
unstructured text. However, a single document often contains limited event information and …

Inferring from Logits: Exploring Best Practices for Decoding-Free Generative Candidate Selection

MD Ma, Y Ding, Z Huang, J Gao, Y Sun… - arxiv preprint arxiv …, 2025 - arxiv.org
Generative Language Models rely on autoregressive decoding to produce the output
sequence token by token. Many tasks such as preference optimization, require the model to …

STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models

MD Ma, X Wang, PN Kung, PJ Brantingham… - Proceedings of the …, 2024 - ojs.aaai.org
Abstract Information extraction tasks such as event extraction require an in-depth
understanding of the output structure and sub-task dependencies. They heavily rely on task …