Fairness in large language models: A taxonomic survey

Z Chu, Z Wang, W Zhang - ACM SIGKDD explorations newsletter, 2024 - dl.acm.org
Large Language Models (LLMs) have demonstrated remarkable success across various
domains. However, despite their promising performance in numerous real-world …

Multi 3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

S Hu, H Zhou, M Hergul, M Gritta, G Zhang… - Transactions of the …, 2023 - direct.mit.edu
Creating high-quality annotated data for task-oriented dialog (ToD) is known to be
notoriously difficult, and the challenges are amplified when the goal is to create equitable …

Cross-lingual dialogue dataset creation via outline-based generation

O Majewska, E Razumovskaia, EM Ponti… - Transactions of the …, 2023 - direct.mit.edu
Multilingual task-oriented dialogue (ToD) facilitates access to services and information for
many (communities of) speakers. Nevertheless, its potential is not fully realized, as current …

Multi3NLU++: A multilingual, multi-intent, multi-domain dataset for natural language understanding in task-oriented dialogue

N Moghe, E Razumovskaia, L Guillou, I Vulić… - arxiv preprint arxiv …, 2022 - arxiv.org
Task-oriented dialogue (TOD) systems have been widely deployed in many industries as
they deliver more efficient customer support. These systems are typically constructed for a …

Crossing the conversational chasm: A primer on natural language processing for multilingual task-oriented dialogue systems

E Razumovskaia, G Glavas, O Majewska… - Journal of Artificial …, 2022 - jair.org
In task-oriented dialogue (ToD), a user holds a conversation with an artificial agent with the
aim of completing a concrete task. Although this technology represents one of the central …

Cross-lingual extreme summarization of scholarly documents

S Takeshita, T Green, N Friedrich, K Eckert… - International journal on …, 2024 - Springer
The number of scientific publications nowadays is rapidly increasing, causing information
overload for researchers and making it hard for scholars to keep up to date with current …

[PDF][PDF] Can demographic factors improve text classification? revisiting demographic adaptation in the age of transformers

CC Hung, A Lauscher, D Hovy… - Findings of the …, 2023 - iris.unibocconi.it
Demographic factors (eg, gender or age) shape our language. Previous work showed that
incorporating demographic factors can consistently improve performance for various NLP …

Survey of cultural awareness in language models: Text and beyond

S Pawar, J Park, J **, A Arora, J Myung… - arxiv preprint arxiv …, 2024 - arxiv.org
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …

Cgodial: A large-scale benchmark for chinese goal-oriented dialog evaluation

Y Dai, W He, B Li, Y Wu, Z Cao, Z An, J Sun… - arxiv preprint arxiv …, 2022 - arxiv.org
Practical dialog systems need to deal with various knowledge sources, noisy user
expressions, and the shortage of annotated data. To better solve the above problems, we …

Extrinsic evaluation of machine translation metrics

N Moghe, T Sherborne, M Steedman… - arxiv preprint arxiv …, 2022 - arxiv.org
Automatic machine translation (MT) metrics are widely used to distinguish the translation
qualities of machine translation systems across relatively large test sets (system-level …