DeepStruct: Pretraining of language models for structure prediction

C Wang, X Liu, Z Chen, H Hong, J Tang… - arxiv preprint arxiv …, 2022 - arxiv.org
We introduce a method for improving the structural understanding abilities of language
models. Unlike previous approaches that finetune the models with task-specific …

Privacy issues in large language models: A survey

S Neel, P Chang - arxiv preprint arxiv:2312.06717, 2023 - arxiv.org
This is the first survey of the active area of AI research that focuses on privacy issues in
Large Language Models (LLMs). Specifically, we focus on work that red-teams models to …

The gem benchmark: Natural language generation, its evaluation and metrics

S Gehrmann, T Adewumi, K Aggarwal… - arxiv preprint arxiv …, 2021 - arxiv.org
We introduce GEM, a living benchmark for natural language Generation (NLG), its
Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving …

Two tales of persona in llms: A survey of role-playing and personalization

YM Tseng, YC Huang, TY Hsiao, WL Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
The concept of persona, originally adopted in dialogue literature, has re-surged as a
promising framework for tailoring large language models (LLMs) to specific context (eg …

Find or classify? dual strategy for slot-value predictions on multi-domain dialog state tracking

JG Zhang, K Hashimoto, CS Wu, Y Wan, PS Yu… - arxiv preprint arxiv …, 2019 - arxiv.org
Dialog state tracking (DST) is a core component in task-oriented dialog systems. Existing
approaches for DST mainly fall into one of two categories, namely, ontology-based and …

Spokenwoz: A large-scale speech-text benchmark for spoken task-oriented dialogue agents

S Si, W Ma, H Gao, Y Wu, TE Lin… - Advances in …, 2023 - proceedings.neurips.cc
Task-oriented dialogue (TOD) models have made significant progress in recent years.
However, previous studies primarily focus on datasets written by annotators, which has …

Multiwoz 2.4: A multi-domain task-oriented dialogue dataset with essential annotation corrections to improve state tracking evaluation

F Ye, J Manotumruksa, E Yilmaz - arxiv preprint arxiv:2104.00773, 2021 - arxiv.org
The MultiWOZ 2.0 dataset has greatly stimulated the research of task-oriented dialogue
systems. However, its state annotations contain substantial noise, which hinders a proper …

Leveraging slot descriptions for zero-shot cross-domain dialogue state tracking

Z Lin, B Liu, S Moon, P Crook, Z Zhou, Z Wang… - arxiv preprint arxiv …, 2021 - arxiv.org
Zero-shot cross-domain dialogue state tracking (DST) enables us to handle task-oriented
dialogue in unseen domains without the expense of collecting in-domain data. In this paper …