doc2dial: A goal-oriented document-grounded dialogue dataset

S Feng, H Wan, C Gunasekara, SS Patel… - arxiv preprint arxiv …, 2020 - arxiv.org
We introduce doc2dial, a new dataset of goal-oriented dialogues that are grounded in the
associated documents. Inspired by how the authors compose documents for guiding end …

Multitask semi-supervised learning for class-imbalanced discourse classification

A Spangher, J May, SR Shiang… - Proceedings of the 2021 …, 2021 - aclanthology.org
As labeling schemas evolve over time, small differences can render datasets following older
schemas unusable. This prevents researchers from building on top of previous annotation …

Doc2Bot: Accessing heterogeneous documents via conversational bots

H Fu, Y Zhang, H Yu, J Sun, F Huang, L Si, Y Li… - arxiv preprint arxiv …, 2022 - arxiv.org
This paper introduces Doc2Bot, a novel dataset for building machines that help users seek
information via conversations. This is of particular interest for companies and organizations …

Connective-lex: A web-based multilingual lexical resource for connectives

M Stede, T Scheffler, A Mendes - Discours. Revue de …, 2019 - journals.openedition.org
In this paper, we present a tangible outcome of the TextLink network: a joint online database
project displaying and linking existing and newly-created lexicons of discourse connectives …

Primary and secondary discourse connectives: definitions and lexicons

L Danlos, K Rysova, M Rysova, M Stede - Dialogue & Discourse, 2018 - journals.uic.edu
Starting from the perspective that discourse structure arises from the presence of coherence
relations, we provide a map of linguistic discourse structuring devices (DRDs), and focus on …

Towards identifying alternative-lexicalization signals of discourse relations

R Knaebel, M Stede - … of the 29th International Conference on …, 2022 - aclanthology.org
The task of shallow discourse parsing in the Penn Discourse Treebank (PDTB) framework
has traditionally been restricted to identifying those relations that are signaled by a …

Doc2dial: a framework for dialogue composition grounded in documents

S Feng, K Fadnis, QV Liao, LA Lastras - … of the AAAI Conference on Artificial …, 2020 - aaai.org
Abstract We introduce Doc2Dial, an end-to-end framework for generating conversational
data grounded in given documents. It takes the documents as input and generates the …

Multi-class categorization of reasons behind mental disturbance in long texts

M Garg - Knowledge-Based Systems, 2023 - Elsevier
Motivated with recent advances in inferring users' mental state in social media posts, we
identify and formulate the problem of finding causal indicators behind mental illness in self …

Measuring forecasting skill from text

S Zong, A Ritter, E Hovy - arxiv preprint arxiv:2006.07425, 2020 - arxiv.org
People vary in their ability to make accurate predictions about the future. Prior studies have
shown that some individuals can predict the outcome of future events with consistently better …

Usage disambiguation of Turkish discourse connectives

K Başıbüyük, D Zeyrek - Language Resources and Evaluation, 2023 - Springer
This paper describes a rule-based approach and a machine learning approach to
disambiguate the discourse usage of Turkish connectives, which not only has single and …