Ai alignment: A comprehensive survey

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org
AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

[HTML][HTML] A systematic review of hate speech automatic detection using natural language processing

MS Jahan, M Oussalah - Neurocomputing, 2023 - Elsevier
With the multiplication of social media platforms, which offer anonymity, easy access and
online community formation and online debate, the issue of hate speech detection and …

[HTML][HTML] ChatGPT: Jack of all trades, master of none

J Kocoń, I Cichecki, O Kaszyca, M Kochanek, D Szydło… - Information …, 2023 - Elsevier
OpenAI has released the Chat Generative Pre-trained Transformer (ChatGPT) and
revolutionized the approach in artificial intelligence to human-model interaction. The first …

Rwkv: Reinventing rnns for the transformer era

B Peng, E Alcaide, Q Anthony, A Albalak… - arxiv preprint arxiv …, 2023 - arxiv.org
Transformers have revolutionized almost all natural language processing (NLP) tasks but
suffer from memory and computational complexity that scales quadratically with sequence …

Large language model alignment: A survey

T Shen, R **, Y Huang, C Liu, W Dong, Z Guo… - arxiv preprint arxiv …, 2023 - arxiv.org
Recent years have witnessed remarkable progress made in large language models (LLMs).
Such advancements, while garnering significant attention, have concurrently elicited various …

A new generation of perspective api: Efficient multilingual character-level transformers

A Lees, VQ Tran, Y Tay, J Sorensen, J Gupta… - Proceedings of the 28th …, 2022 - dl.acm.org
On the world wide web, toxic content detectors are a crucial line of defense against
potentially hateful and offensive messages. As such, building highly effective classifiers that …

Challenges in detoxifying language models

J Welbl, A Glaese, J Uesato, S Dathathri… - arxiv preprint arxiv …, 2021 - arxiv.org
Large language models (LM) generate remarkably fluent text and can be efficiently adapted
across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of …

Artificial intelligence alone will not democratise education: On educational inequality, techno-solutionism and inclusive tools

S Bulathwela, M Pérez-Ortiz, C Holloway, M Cukurova… - Sustainability, 2024 - mdpi.com
Artificial Intelligence (AI) in Education claims to have the potential for building personalised
curricula, as well as bringing opportunities for democratising education and creating a …

SemEval-2020 task 12: Multilingual offensive language identification in social media (OffensEval 2020)

M Zampieri, P Nakov, S Rosenthal, P Atanasova… - arxiv preprint arxiv …, 2020 - arxiv.org
We present the results and main findings of SemEval-2020 Task 12 on Multilingual
Offensive Language Identification in Social Media (OffensEval 2020). The task involves …

Algorithmic content moderation: Technical and political challenges in the automation of platform governance

R Gorwa, R Binns, C Katzenbach - Big Data & Society, 2020 - journals.sagepub.com
As government pressure on major technology companies builds, both firms and legislators
are searching for technical solutions to difficult platform governance puzzles such as hate …