Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
From anecdotal evidence to quantitative evaluation methods: A systematic review on evaluating explainable ai
The rising popularity of explainable artificial intelligence (XAI) to understand high-performing
black boxes raised the question of how to evaluate explanations of machine learning (ML) …
black boxes raised the question of how to evaluate explanations of machine learning (ML) …
Towards generalisable hate speech detection: a review on obstacles and solutions
Hate speech is one type of harmful online content which directly attacks or promotes hate
towards a group or an individual member based on their actual or perceived aspects of …
towards a group or an individual member based on their actual or perceived aspects of …
Interpreting graph neural networks for NLP with differentiable edge masking
Graph neural networks (GNNs) have become a popular approach to integrating structural
inductive biases into NLP models. However, there has been little work on interpreting them …
inductive biases into NLP models. However, there has been little work on interpreting them …
Glue-x: Evaluating natural language understanding models from an out-of-distribution generalization perspective
Pre-trained language models (PLMs) are known to improve the generalization performance
of natural language understanding models by leveraging large amounts of data during the …
of natural language understanding models by leveraging large amounts of data during the …
Self-attention attribution: Interpreting information interactions inside transformer
The great success of Transformer-based models benefits from the powerful multi-head self-
attention mechanism, which learns token dependencies and encodes contextual information …
attention mechanism, which learns token dependencies and encodes contextual information …
Handling bias in toxic speech detection: A survey
Detecting online toxicity has always been a challenge due to its inherent subjectivity. Factors
such as the context, geography, socio-political climate, and background of the producers …
such as the context, geography, socio-political climate, and background of the producers …
Contextualizing hate speech classifiers with post-hoc explanation
Hate speech classifiers trained on imbalanced datasets struggle to determine if group
identifiers like" gay" or" black" are used in offensive or prejudiced ways. Such biases …
identifiers like" gay" or" black" are used in offensive or prejudiced ways. Such biases …
Exposing the limits of zero-shot cross-lingual hate speech detection
D Nozza - Proceedings of the 59th Annual Meeting of the …, 2021 - aclanthology.org
Reducing and counter-acting hate speech on Social Media is a significant concern. Most of
the proposed automatic methods are conducted exclusively on English and very few …
the proposed automatic methods are conducted exclusively on English and very few …
BERT meets shapley: Extending SHAP explanations to transformer-based classifiers
Transformer-based neural networks offer very good classification performance across a
wide range of domains, but do not provide explanations of their predictions. While several …
wide range of domains, but do not provide explanations of their predictions. While several …
Entropy-based attention regularization frees unintended bias mitigation from lists
Natural Language Processing (NLP) models risk overfitting to specific terms in the training
data, thereby reducing their performance, fairness, and generalizability. Eg, neural hate …
data, thereby reducing their performance, fairness, and generalizability. Eg, neural hate …