Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Taxonomy of machine learning safety: A survey and primer
The open-world deployment of Machine Learning (ML) algorithms in safety-critical
applications such as autonomous vehicles needs to address a variety of ML vulnerabilities …
applications such as autonomous vehicles needs to address a variety of ML vulnerabilities …
[PDF][PDF] DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.
Abstract Generative Pre-trained Transformer (GPT) models have exhibited exciting progress
in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the …
in their capabilities, capturing the interest of practitioners and the public alike. Yet, while the …
Smoothllm: Defending large language models against jailbreaking attacks
Despite efforts to align large language models (LLMs) with human intentions, widely-used
LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an …
LLMs such as GPT, Llama, and Claude are susceptible to jailbreaking attacks, wherein an …
Surgical fine-tuning improves adaptation to distribution shifts
A common approach to transfer learning under distribution shift is to fine-tune the last few
layers of a pre-trained model, preserving learned features while also adapting to the new …
layers of a pre-trained model, preserving learned features while also adapting to the new …
From admission to discharge: a systematic review of clinical natural language processing along the patient journey
Background Medical text, as part of an electronic health record, is an essential information
source in healthcare. Although natural language processing (NLP) techniques for medical …
source in healthcare. Although natural language processing (NLP) techniques for medical …
Trak: Attributing model behavior at scale
The goal of data attribution is to trace model predictions back to training data. Despite a long
line of work towards this goal, existing approaches to data attribution tend to force users to …
line of work towards this goal, existing approaches to data attribution tend to force users to …
Accuracy on the line: on the strong correlation between out-of-distribution and in-distribution generalization
For machine learning systems to be reliable, we must understand their performance in
unseen, out-of-distribution environments. In this paper, we empirically show that out-of …
unseen, out-of-distribution environments. In this paper, we empirically show that out-of …
[HTML][HTML] Multimodal neurons in artificial neural networks
Gabriel Goh: Research lead. Gabriel Goh first discovered multimodal neurons, sketched out
the project direction and paper outline, and did much of the conceptual and engineering …
the project direction and paper outline, and did much of the conceptual and engineering …
Discover and cure: Concept-aware mitigation of spurious correlation
Deep neural networks often rely on spurious correlations to make predictions, which hinders
generalization beyond training environments. For instance, models that associate cats with …
generalization beyond training environments. For instance, models that associate cats with …
Change is hard: A closer look at subpopulation shift
Machine learning models often perform poorly on subgroups that are underrepresented in
the training data. Yet, little is understood on the variation in mechanisms that cause …
the training data. Yet, little is understood on the variation in mechanisms that cause …