A primer in BERTology: What we know about how BERT works

A Rogers, O Kovaleva, A Rumshisky - Transactions of the Association …, 2021 - direct.mit.edu
Transformer-based models have pushed state of the art in many areas of NLP, but our
understanding of what is behind their success is still limited. This paper is the first survey of …

Gan memory with no forgetting

Y Cong, M Zhao, J Li, S Wang… - Advances in Neural …, 2020 - proceedings.neurips.cc
As a fundamental issue in lifelong learning, catastrophic forgetting is directly caused by
inaccessible historical data; accordingly, if the data (information) were memorized perfectly …

From word types to tokens and back: A survey of approaches to word meaning representation and interpretation

M Apidianaki - Computational Linguistics, 2023 - direct.mit.edu
Vector-based word representation paradigms situate lexical meaning at different levels of
abstraction. Distributional and static embedding models generate a single vector per word …

Joint intent detection and slot filling using weighted finite state transducer and BERT

WA Abro, G Qi, M Aamir, Z Ali - Applied Intelligence, 2022 - Springer
Intent detection and slot filling are the two most essential tasks of natural language
understanding (NLU). Deep neural models have produced impressive results on these …

Transfer fine-tuning of BERT with phrasal paraphrases

Y Arase, J Tsujii - Computer Speech & Language, 2021 - Elsevier
Sentence pair modelling is defined as the task of identifying the semantic interaction
between a sentence pair, ie., paraphrase and textual entailment identification and semantic …

Uncovering trauma in genocide tribunals: An NLP approach using the Genocide Transcript Corpus

M Schirmer, IMO Nolasco, E Mosca, S Xu… - Proceedings of the …, 2023 - dl.acm.org
This paper applies Natural Language Processing (NLP) methods to analyze the exposure to
trauma experienced by witnesses in international criminal tribunals when testifying in court …

Fine-tuning transformers: Vocabulary transfer

V Mosin, I Samenko, B Kozlovskii, A Tikhonov… - Artificial Intelligence, 2023 - Elsevier
Transformers are responsible for the vast majority of recent advances in natural language
processing. The majority of practical natural language processing applications of these …

Offensive language detection in turkish tweets with bert models

A Özberk, İ Çiçekli - 2021 6th international conference on …, 2021 - ieeexplore.ieee.org
As the insulting statements increase on the online platform, these negative statements
create a reaction and disturb the peace of society. Offensive language detection research …

Quad-Faceted Feature-Based Graph Network for Domain-Agnostic Text Classification to Enhance Learning Effectiveness

S Supraja, AWH Khong - IEEE Transactions on Computational …, 2024 - ieeexplore.ieee.org
Enhancing learning effectiveness requires one to define suitable learning outcomes and
align assessment constructs with these outcomes. We present a quad-faceted feature-based …

Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance

H Patel, U Rehman, F Iqbal - arxiv preprint arxiv:2404.15485, 2024 - arxiv.org
Phishing, a prevalent cybercrime tactic for decades, remains a significant threat in today's
digital world. By leveraging clever social engineering elements and modern technology …