[HTML][HTML] Methods for interpreting and understanding deep neural networks
This paper provides an entry point to the problem of interpreting a deep neural network
model and explaining its predictions. It is based on a tutorial given at ICASSP 2017. As a …
model and explaining its predictions. It is based on a tutorial given at ICASSP 2017. As a …
Post-hoc interpretability for neural nlp: A survey
Neural networks for NLP are becoming increasingly complex and widespread, and there is a
growing concern if these models are responsible to use. Explaining models helps to address …
growing concern if these models are responsible to use. Explaining models helps to address …
On the opportunities and risks of foundation models
AI is undergoing a paradigm shift with the rise of models (eg, BERT, DALL-E, GPT-3) that are
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
trained on broad data at scale and are adaptable to a wide range of downstream tasks. We …
Explainability for large language models: A survey
Large language models (LLMs) have demonstrated impressive capabilities in natural
language processing. However, their internal mechanisms are still unclear and this lack of …
language processing. However, their internal mechanisms are still unclear and this lack of …
Unmasking Clever Hans predictors and assessing what machines really learn
Current learning machines have successfully solved hard application problems, reaching
high accuracy and displaying seemingly intelligent behavior. Here we apply recent …
high accuracy and displaying seemingly intelligent behavior. Here we apply recent …
Explainable artificial intelligence: A survey
In the last decade, with availability of large datasets and more computing power, machine
learning systems have achieved (super) human performance in a wide variety of tasks …
learning systems have achieved (super) human performance in a wide variety of tasks …
A survey of the state of explainable AI for natural language processing
Recent years have seen important advances in the quality of state-of-the-art models, but this
has come at the expense of models becoming less interpretable. This survey presents an …
has come at the expense of models becoming less interpretable. This survey presents an …
Is attention interpretable?
Attention mechanisms have recently boosted performance on a range of NLP tasks.
Because attention layers explicitly weight input components' representations, it is also often …
Because attention layers explicitly weight input components' representations, it is also often …
Generating natural language adversarial examples through probability weighted word saliency
We address the problem of adversarial attacks on text classification, which is rarely studied
comparing to attacks on image classification. The challenge of this task is to generate …
comparing to attacks on image classification. The challenge of this task is to generate …
[PDF][PDF] Linguistic Knowledge and Transferability of Contextual Representations
NF Liu - arxiv preprint arxiv:1903.08855, 2019 - fq.pkwyx.com
Contextual word representations derived from large-scale neural language models are
successful across a diverse set of NLP tasks, suggesting that they encode useful and …
successful across a diverse set of NLP tasks, suggesting that they encode useful and …