On the explainability of natural language processing deep models

JE Zini, M Awad - ACM Computing Surveys, 2022 - dl.acm.org
Despite their success, deep networks are used as black-box models with outputs that are not
easily explainable during the learning and the prediction phases. This lack of interpretability …

Beyond accuracy: Behavioral testing of NLP models with CheckList

MT Ribeiro, T Wu, C Guestrin, S Singh - arxiv preprint arxiv:2005.04118, 2020 - arxiv.org
Although measuring held-out accuracy has been the primary approach to evaluate
generalization, it often overestimates the performance of NLP models, while alternative …

Ten years of BabelNet: A survey

R Navigli, M Bevilacqua, S Conia, D Montagnini… - IJCAI, 2021 - iris.uniroma1.it
The intelligent manipulation of symbolic knowledge has been a long-sought goal of AI.
However, when it comes to Natural Language Processing (NLP), symbols have to be …

A comparative evaluation and analysis of three generations of Distributional Semantic Models

A Lenci, M Sahlgren, P Jeuniaux… - Language resources …, 2022 - Springer
Distributional semantics has deeply changed in the last decades. First, predict models stole
the thunder from traditional count ones, and more recently both of them were replaced in …

CogniVal: A framework for cognitive word embedding evaluation

N Hollenstein, A de la Torre, N Langer… - arxiv preprint arxiv …, 2019 - arxiv.org
An interesting method of evaluating word representations is by how much they reflect the
semantic representations in the human brain. However, most, if not all, previous works only …

LINSPECTOR: Multilingual probing tasks for word representations

GG Şahin, C Vania, I Kuznetsov… - Computational …, 2020 - direct.mit.edu
Despite an ever-growing number of word representation models introduced for a large
number of languages, there is a lack of a standardized technique to provide insights into …

Detecting the target of sarcasm is hard: Really??

P Parameswaran, A Trotman, V Liesaputra… - Information Processing & …, 2021 - Elsevier
Sarcasm target detection (identifying the target of mockery in a sarcastic sentence) is an
emerging field in computational linguistics. Although there has been some research in this …

Improving skip-gram embeddings using BERT

Y Wang, L Cui, Y Zhang - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org
Contextualized embeddings such as BERT and GPT have been shown to give significant
improvement in NLP tasks. On the other hand, static embeddings such as skip-gram and …

Trustworthy social bias measurement

R Bommasani, P Liang - Proceedings of the AAAI/ACM Conference on …, 2024 - ojs.aaai.org
How do we design measures of social bias that we trust? While prior work has introduced
several measures, no measure has gained widespread trust: instead, mounting evidence …

Democracy in context: using a distributional semantic model to study differences in the usage of democracy across languages and countries

S Dahlberg, S Axelsson, S Holmberg - Zeitschrift für Vergleichende …, 2020 - diva-portal.org
Cross-cultural survey research rests upon the assumption that if survey features are kept
constant, data will remain comparable across languages, cultures and countries. Yet …