[HTML][HTML] Pre-trained transformers: an empirical comparison

S Casola, I Lauriola, A Lavelli - Machine Learning with Applications, 2022 - Elsevier
Pre-trained transformers have rapidly become very popular in the Natural Language
Processing (NLP) community, surpassing the previous state of the art in a wide variety of …

French CrowS-pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English

A Névéol, Y Dupont, J Bezançon… - Proceedings of the 60th …, 2022 - aclanthology.org
Warning: This paper contains explicit statements of offensive stereotypes which may be
upsetting. Much work on biases in natural language processing has addressed biases …

The role of machine learning in develo** non-magnetic resonance imaging based biomarkers for multiple sclerosis: a systematic review

MZ Hossain, E Daskalaki, A Brüstle… - BMC Medical Informatics …, 2022 - Springer
Background Multiple sclerosis (MS) is a neurological condition whose symptoms, severity,
and progression over time vary enormously among individuals. Ideally, each person living …

Position: Key claims in llm research have a long tail of footnotes

A Rogers, S Luccioni - Forty-first International Conference on …, 2024 - openreview.net
Much of the recent discourse within the ML community has been centered around Large
Language Models (LLMs), their functionality and potential--yet not only do we not have a …

Just What do You Think You're Doing, Dave?'A Checklist for Responsible Data Use in NLP

A Rogers, T Baldwin, K Leins - arxiv preprint arxiv:2109.06598, 2021 - arxiv.org
A key part of the NLP ethics movement is responsible use of data, but exactly what that
means or how it can be best achieved remain unclear. This position paper discusses the …

On the challenges of using black-box apis for toxicity evaluation in research

L Pozzobon, B Ermis, P Lewis, S Hooker - arxiv preprint arxiv:2304.12397, 2023 - arxiv.org
Perception of toxicity evolves over time and often differs between geographies and cultural
backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as …

A Metrological Perspective on Reproducibility in NLP*

A Belz - Computational Linguistics, 2022 - direct.mit.edu
Reproducibility has become an increasingly debated topic in NLP and ML over recent years,
but so far, no commonly accepted definitions of even basic terms or concepts have emerged …

[HTML][HTML] The GDPR enforcement fines at glance

J Ruohonen, K Hjerppe - Information Systems, 2022 - Elsevier
Abstract The General Data Protection Regulation (GDPR) came into force in 2018. After this
enforcement, many fines have already been imposed by national data protection authorities …

A survey of semantic relatedness evaluation datasets and procedures

MA Hadj Taieb, T Zesch, M Ben Aouicha - Artificial Intelligence Review, 2020 - Springer
Semantic relatedness between words is a core concept in natural language processing.
While countless approaches have been proposed, measuring which one works best is still a …

Can reproducibility be improved in clinical natural language processing? A study of 7 clinical NLP suites

W Digan, A Névéol, A Neuraz, M Wack… - Journal of the …, 2021 - academic.oup.com
Background The increasing complexity of data streams and computational processes in
modern clinical health information systems makes reproducibility challenging. Clinical …