[HTML][HTML] Pre-trained transformers: an empirical comparison
Pre-trained transformers have rapidly become very popular in the Natural Language
Processing (NLP) community, surpassing the previous state of the art in a wide variety of …
Processing (NLP) community, surpassing the previous state of the art in a wide variety of …
French CrowS-pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English
Warning: This paper contains explicit statements of offensive stereotypes which may be
upsetting. Much work on biases in natural language processing has addressed biases …
upsetting. Much work on biases in natural language processing has addressed biases …
The role of machine learning in develo** non-magnetic resonance imaging based biomarkers for multiple sclerosis: a systematic review
Background Multiple sclerosis (MS) is a neurological condition whose symptoms, severity,
and progression over time vary enormously among individuals. Ideally, each person living …
and progression over time vary enormously among individuals. Ideally, each person living …
Position: Key claims in llm research have a long tail of footnotes
Much of the recent discourse within the ML community has been centered around Large
Language Models (LLMs), their functionality and potential--yet not only do we not have a …
Language Models (LLMs), their functionality and potential--yet not only do we not have a …
Just What do You Think You're Doing, Dave?'A Checklist for Responsible Data Use in NLP
A key part of the NLP ethics movement is responsible use of data, but exactly what that
means or how it can be best achieved remain unclear. This position paper discusses the …
means or how it can be best achieved remain unclear. This position paper discusses the …
On the challenges of using black-box apis for toxicity evaluation in research
Perception of toxicity evolves over time and often differs between geographies and cultural
backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as …
backgrounds. Similarly, black-box commercially available APIs for detecting toxicity, such as …
A Metrological Perspective on Reproducibility in NLP*
A Belz - Computational Linguistics, 2022 - direct.mit.edu
Reproducibility has become an increasingly debated topic in NLP and ML over recent years,
but so far, no commonly accepted definitions of even basic terms or concepts have emerged …
but so far, no commonly accepted definitions of even basic terms or concepts have emerged …
[HTML][HTML] The GDPR enforcement fines at glance
Abstract The General Data Protection Regulation (GDPR) came into force in 2018. After this
enforcement, many fines have already been imposed by national data protection authorities …
enforcement, many fines have already been imposed by national data protection authorities …
A survey of semantic relatedness evaluation datasets and procedures
Semantic relatedness between words is a core concept in natural language processing.
While countless approaches have been proposed, measuring which one works best is still a …
While countless approaches have been proposed, measuring which one works best is still a …
Can reproducibility be improved in clinical natural language processing? A study of 7 clinical NLP suites
Background The increasing complexity of data streams and computational processes in
modern clinical health information systems makes reproducibility challenging. Clinical …
modern clinical health information systems makes reproducibility challenging. Clinical …