Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Leace: Perfect linear concept erasure in closed form
N Belrose, D Schneider-Joseph… - Advances in …, 2023 - proceedings.neurips.cc
Abstract Concept erasure aims to remove specified features from a representation. It can
improve fairness (eg preventing a classifier from using gender or race) and interpretability …
improve fairness (eg preventing a classifier from using gender or race) and interpretability …
Spectral editing of activations for large language model alignment
Large language models (LLMs) often exhibit undesirable behaviours, such as generating
untruthful or biased content. Editing their internal representations has been shown to be …
untruthful or biased content. Editing their internal representations has been shown to be …
Log-linear guardedness and its implications
Methods for erasing human-interpretable concepts from neural representations that assume
linearity have been found to be tractable and useful. However, the impact of this removal on …
linearity have been found to be tractable and useful. However, the impact of this removal on …
Gold doesn't always glitter: Spectral removal of linear and nonlinear guarded attribute information
We describe a simple and effective method (Spectral Attribute removaL; SAL) to remove
private or guarded information from neural representations. Our method uses matrix …
private or guarded information from neural representations. Our method uses matrix …
Cross-attention is not enough: Incongruity-aware dynamic hierarchical fusion for multimodal affect recognition
Fusing multiple modalities has proven effective for multimodal information processing.
However, the incongruity between modalities poses a challenge for multimodal fusion …
However, the incongruity between modalities poses a challenge for multimodal fusion …
BERT is not the count: Learning to match mathematical statements with proofs
We introduce a task consisting in matching a proof to a given mathematical statement. The
task fits well within current research on Mathematical Information Retrieval and, more …
task fits well within current research on Mathematical Information Retrieval and, more …
Surgical Feature-Space Decomposition of LLMs: Why, When and How?
Low-rank approximations, of the weight and feature space can enhance the performance of
deep learning models, whether in terms of improving generalization or reducing the latency …
deep learning models, whether in terms of improving generalization or reducing the latency …
Taco: Targeted concept erasure prevents non-linear classifiers from detecting protected attributes
Ensuring fairness in NLP models is crucial, as they often encode sensitive attributes like
gender and ethnicity, leading to biased outcomes. Current concept erasure methods attempt …
gender and ethnicity, leading to biased outcomes. Current concept erasure methods attempt …
Advancing Fairness in Natural Language Processing: From Traditional Methods to Explainability
F Jourdan - arxiv preprint arxiv:2410.12511, 2024 - arxiv.org
The burgeoning field of Natural Language Processing (NLP) stands at a critical juncture
where the integration of fairness within its frameworks has become an imperative. This PhD …
where the integration of fairness within its frameworks has become an imperative. This PhD …
How To Build Competitive Multi-gender Speech Translation Models For Controlling Speaker Gender Translation
When translating from notional gender languages (eg, English) into grammatical gender
languages (eg, Italian), the generated translation requires explicit gender assignments for …
languages (eg, Italian), the generated translation requires explicit gender assignments for …