Qa dataset explosion: A taxonomy of nlp resources for question answering and reading comprehension

A Rogers, M Gardner, I Augenstein - ACM Computing Surveys, 2023 - dl.acm.org
Alongside huge volumes of research on deep learning models in NLP in the recent years,
there has been much work on benchmark datasets needed to track modeling progress …

Machine reading comprehension: The role of contextualized language models and beyond

Z Zhang, H Zhao, R Wang - arxiv preprint arxiv:2005.06249, 2020 - arxiv.org
Machine reading comprehension (MRC) aims to teach machines to read and comprehend
human languages, which is a long-standing goal of natural language processing (NLP) …

Ragbench: Explainable benchmark for retrieval-augmented generation systems

R Friel, M Belyi, A Sanyal - arxiv preprint arxiv:2407.11005, 2024 - arxiv.org
Retrieval-Augmented Generation (RAG) has become a standard architectural pattern for
incorporating domain-specific knowledge into user-facing chat applications powered by …

MPMQA: multimodal question answering on product manuals

L Zhang, A Hu, J Zhang, S Hu, Q ** - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Visual contents, such as illustrations and images, play a big role in product manual
understanding. Existing Product Manual Question Answering (PMQA) datasets tend to …

A technical question answering system with transfer learning

W Yu, L Wu, Y Deng, R Mahindru, Q Zeng… - Proceedings of the …, 2020 - aclanthology.org
In recent years, the need for community technical question-answering sites has increased
significantly. However, it is often expensive for human experts to provide timely and helpful …

Is Semantic Chunking Worth the Computational Cost?

R Qu, R Tu, F Bao - arxiv preprint arxiv:2410.13070, 2024 - arxiv.org
Recent advances in Retrieval-Augmented Generation (RAG) systems have popularized
semantic chunking, which aims to improve retrieval performance by dividing documents into …

Cheap and good? simple and effective data augmentation for low resource machine reading

H Van, V Yadav, M Surdeanu - … of the 44th International ACM SIGIR …, 2021 - dl.acm.org
We propose a simple and effective strategy for data augmentation for low-resource machine
reading comprehension (MRC). Our approach first pretrains the answer extraction …

Question answering over electronic devices: A new benchmark dataset and a multi-task learning based QA framework

A Nandy, S Sharma, S Maddhashiya… - arxiv preprint arxiv …, 2021 - arxiv.org
Answering questions asked from instructional corpora such as E-manuals, recipe books,
etc., has been far less studied than open-domain factoid context-based question answering …

Multi-domain multilingual question answering

S Ruder, A Sil - Proceedings of the 2021 Conference on Empirical …, 2021 - aclanthology.org
Question answering (QA) is one of the most challenging and impactful tasks in natural
language processing. Most research in QA, however, has focused on the open-domain or …

A neural question answering system for basic questions about subroutines

A Bansal, Z Eberhart, L Wu… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
A question answering (QA) system is a type of conversational AI that generates natural
language answers to questions posed by human users. QA systems often form the …