Exploring the state of the art in legal QA systems

A Abdallah, B Piryani, A Jatowt - Journal of Big Data, 2023 - Springer
Answering questions related to the legal domain is a complex task, primarily due to the
intricate nature and diverse range of legal document systems. Providing an accurate answer …

Bringing order into the realm of Transformer-based language models for artificial intelligence and law

CM Greco, A Tagarelli - Artificial Intelligence and Law, 2024 - Springer
Transformer-based language models (TLMs) have widely been recognized to be a cutting-
edge technology for the successful development of deep-learning-based solutions to …

A survey on large language models for critical societal domains: Finance, healthcare, and law

ZZ Chen, J Ma, X Zhang, N Hao, A Yan… - arxiv preprint arxiv …, 2024 - arxiv.org
In the fast-evolving domain of artificial intelligence, large language models (LLMs) such as
GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law …

VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

XQ Dao, NB Le, TD Vo, XD Phan, BB Ngo… - arxiv preprint arxiv …, 2023 - arxiv.org
The VNHSGE (VietNamese High School Graduation Examination) dataset, developed
exclusively for evaluating large language models (LLMs), is introduced in this article. The …

A survey on data augmentation in large model era

Y Zhou, C Guo, X Wang, Y Chang, Y Wu - arxiv preprint arxiv:2401.15422, 2024 - arxiv.org
Large models, encompassing large language and diffusion models, have shown
exceptional promise in approximating human-level intelligence, garnering significant …

Retrieval-augmented data augmentation for low-resource domain tasks

M Seo, J Baek, J Thorne, SJ Hwang - arxiv preprint arxiv:2402.13482, 2024 - arxiv.org
Despite large successes of recent language models on diverse tasks, they suffer from
severe performance degeneration in low-resource settings with limited training data …

Conditionalqa: A complex reading comprehension dataset with conditional answers

H Sun, WW Cohen, R Salakhutdinov - arxiv preprint arxiv:2110.06884, 2021 - arxiv.org
We describe a Question Answering (QA) dataset that contains complex questions with
conditional answers, ie the answers are only applicable when certain conditions apply. We …

Can llms augment low-resource reading comprehension datasets? opportunities and challenges

V Samuel, H Aynaou, AG Chowdhury… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have demonstrated impressive zero shot performance on a
wide range of NLP tasks, demonstrating the ability to reason and apply commonsense. A …

Breaking down walls of text: How can nlp benefit consumer privacy?

A Ravichander, A Black, T Norton, S Wilson… - Computational …, 2021 - par.nsf.gov
Decomposable tasks are complex and comprise of a hierarchy of sub-tasks. Spoken intent
prediction, for example, combines automatic speech recognition and natural language …