Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Opportunities and risks of large language models in psychiatry
The integration of large language models (LLMs) into mental healthcare and research
heralds a potentially transformative shift, one offering enhanced access to care, efficient data …
heralds a potentially transformative shift, one offering enhanced access to care, efficient data …
Harmbench: A standardized evaluation framework for automated red teaming and robust refusal
Automated red teaming holds substantial promise for uncovering and mitigating the risks
associated with the malicious use of large language models (LLMs), yet the field lacks a …
associated with the malicious use of large language models (LLMs), yet the field lacks a …
Rainbow teaming: Open-ended generation of diverse adversarial prompts
As large language models (LLMs) become increasingly prevalent across many real-world
applications, understanding and enhancing their robustness to adversarial attacks is of …
applications, understanding and enhancing their robustness to adversarial attacks is of …
Artprompt: Ascii art-based jailbreak attacks against aligned llms
Safety is critical to the usage of large language models (LLMs). Multiple techniques such as
data filtering and supervised fine-tuning have been developed to strengthen LLM safety …
data filtering and supervised fine-tuning have been developed to strengthen LLM safety …
Red-Teaming for generative AI: Silver bullet or security theater?
In response to rising concerns surrounding the safety, security, and trustworthiness of
Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red …
Generative AI (GenAI) models, practitioners and regulators alike have pointed to AI red …
Privacy in large language models: Attacks, defenses and future directions
The advancement of large language models (LLMs) has significantly enhanced the ability to
effectively tackle various downstream NLP tasks and unify these tasks into generative …
effectively tackle various downstream NLP tasks and unify these tasks into generative …
Large language model supply chain: A research agenda
The rapid advancement of large language models (LLMs) has revolutionized artificial
intelligence, introducing unprecedented capabilities in natural language processing and …
intelligence, introducing unprecedented capabilities in natural language processing and …
Jailbreak attacks and defenses against large language models: A survey
Large Language Models (LLMs) have performed exceptionally in various text-generative
tasks, including question answering, translation, code completion, etc. However, the over …
tasks, including question answering, translation, code completion, etc. However, the over …
A safe harbor for ai evaluation and red teaming
Independent evaluation and red teaming are critical for identifying the risks posed by
generative AI systems. However, the terms of service and enforcement strategies used by …
generative AI systems. However, the terms of service and enforcement strategies used by …
Llm defenses are not robust to multi-turn human jailbreaks yet
Recent large language model (LLM) defenses have greatly improved models' ability to
refuse harmful queries, even when adversarially attacked. However, LLM defenses are …
refuse harmful queries, even when adversarially attacked. However, LLM defenses are …