Interpreting deep learning models in natural language processing: A review

X Sun, D Yang, X Li, T Zhang, Y Meng, H Qiu… - arxiv preprint arxiv …, 2021 - arxiv.org
Neural network models have achieved state-of-the-art performances in a wide range of
natural language processing (NLP) tasks. However, a long-standing criticism against neural …

Training data influence analysis and estimation: A survey

Z Hammoudeh, D Lowd - Machine Learning, 2024 - Springer
Good models require good training data. For overparameterized deep models, the causal
relationship between training data and model predictions is increasingly opaque and poorly …

State-of-the-art generalisation research in NLP: a taxonomy and review

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arxiv preprint arxiv …, 2022 - arxiv.org
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

Fastif: Scalable influence functions for efficient model interpretation and debugging

H Guo, NF Rajani, P Hase, M Bansal… - arxiv preprint arxiv …, 2020 - arxiv.org
Influence functions approximate the" influences" of training data-points for test predictions
and have a wide variety of applications. Despite the popularity, their computational cost …

Model generalization on COVID-19 fake news detection

Y Bang, E Ishii, S Cahyawijaya, Z Ji, P Fung - Combating Online Hostile …, 2021 - Springer
Amid the pandemic COVID-19, the world is facing unprecedented infodemic with the
proliferation of both fake and real information. Considering the problematic consequences …

Add-remove-or-relabel: Practitioner-friendly bias mitigation via influential fairness

B Richardson, P Sattigeri, D Wei… - Proceedings of the …, 2023 - dl.acm.org
Commensurate with the rise in algorithmic bias research, myriad algorithmic bias mitigation
strategies have been proposed in the literature. Nonetheless, many voice concerns about …

IndoRobusta: Towards robustness against diverse code-mixed Indonesian local languages

MF Adilazuarda, S Cahyawijaya, GI Winata… - arxiv preprint arxiv …, 2023 - arxiv.org
Significant progress has been made on Indonesian NLP. Nevertheless, exploration of the
code-mixing phenomenon in Indonesian is limited, despite many languages being …

Greenformers: Improving computation and memory efficiency in transformer models via low-rank approximation

S Cahyawijaya - arxiv preprint arxiv:2108.10808, 2021 - arxiv.org
In this thesis, we introduce Greenformers, a collection of model efficiency methods to
improve the model efficiency of the recently renowned transformer models with a low-rank …

InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks

S Banerjee, M Sarkar, P Saha, B Mathew… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, influence functions present an apparatus for achieving explainability for deep
neural models by quantifying the perturbation of individual train instances that might impact …

Covid-19 fake news detection using joint doc2vec and text features with PCA

H Mejia, C Chipantiza, J Llumiquinga, IR Amaro… - … on Advanced Research …, 2022 - Springer
With the current pandemic, it is imperative to stay up to date with the news and many
sources contribute to this purpose. However, there is also misinformation and fake news that …