Machine knowledge: Creation and curation of comprehensive knowledge bases

G Weikum, XL Dong, S Razniewski… - … and Trends® in …, 2021 - nowpublishers.com
Equip** machines with comprehensive knowledge of the world's entities and their
relationships has been a longstanding goal of AI. Over the last decade, large-scale …

Toxicity detection: Does context really matter?

J Pavlopoulos, J Sorensen, L Dixon, N Thain… - arxiv preprint arxiv …, 2020 - arxiv.org
Moderation is crucial to promoting healthy on-line discussions. Although severaltoxicity'
detection datasets and models have been published, most of them ignore the context of the …

Trouble on the horizon: Forecasting the derailment of online conversations as they develop

JP Chang, C Danescu-Niculescu-Mizil - arxiv preprint arxiv:1909.01362, 2019 - arxiv.org
Online discussions often derail into toxic exchanges between participants. Recent efforts
mostly focused on detecting antisocial behavior after the fact, by analyzing single comments …

Wikimedia data for AI: a review of Wikimedia datasets for NLP tasks and AI-assisted editing

I Johnson, LA Kaffee, M Redi - arxiv preprint arxiv:2410.08918, 2024 - arxiv.org
Wikimedia content is used extensively by the AI community and within the language
modeling community in particular. In this paper, we provide a review of the different ways in …

Towards systematic monolingual NLP surveys: GenA of Greek NLP

J Bakagianni, K Pouli, M Gavriilidou… - arxiv preprint arxiv …, 2024 - arxiv.org
Natural Language Processing (NLP) research has traditionally been predominantly focused
on English, driven by the availability of resources, the size of the research community, and …

Characterizing online public discussions through patterns of participant interactions

J Zhang, C Danescu-Niculescu-Mizil… - Proceedings of the …, 2018 - dl.acm.org
Public discussions on social media platforms are an intrinsic part of online information
consumption. Characterizing the diverse range of discussions which can arise is crucial for …

Trajectories of blocked community members: Redemption, recidivism and departure

J Chang, C Danescu-Niculescu-Mizil - The world wide web conference, 2019 - dl.acm.org
Community norm violations can impair constructive communication and collaboration online.
As a defense mechanism, community moderators often address such transgressions by …

Preemptive toxic language detection in Wikipedia comments using thread-level context

M Karan, J Šnajder - Proceedings of the Third Workshop on …, 2019 - aclanthology.org
We address the task of automatically detecting toxic content in user generated texts. We fo
cus on exploring the potential for preemptive moderation, ie, predicting whether a particular …

I beg to differ: A study of constructive disagreement in online conversations

C De Kock, A Vlachos - arxiv preprint arxiv:2101.10917, 2021 - arxiv.org
Disagreements are pervasive in human communication. In this paper we investigate what
makes disagreement constructive. To this end, we construct WikiDisputes, a corpus of 7 425 …

Considerations for multilingual wikipedia research

I Johnson, E Lescak - arxiv preprint arxiv:2204.02483, 2022 - arxiv.org
English Wikipedia has long been an important data source for much research and natural
language machine learning modeling. The growth of non-English language editions of …