Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Safetyprompts: a systematic review of open datasets for evaluating and improving large language model safety
The last two years have seen a rapid growth in concerns around the safety of large
language models (LLMs). Researchers and practitioners have met these concerns by …
language models (LLMs). Researchers and practitioners have met these concerns by …
On prompt-driven safeguarding for large language models
Prepending model inputs with safety prompts is a common practice for safeguarding large
language models (LLMs) against queries with harmful intents. However, the underlying …
language models (LLMs) against queries with harmful intents. However, the underlying …
Biasasker: Measuring the bias in conversational ai system
Powered by advanced Artificial Intelligence (AI) techniques, conversational AI systems, such
as ChatGPT, and digital assistants like Siri, have been widely deployed in daily life …
as ChatGPT, and digital assistants like Siri, have been widely deployed in daily life …
Prosocialdialog: A prosocial backbone for conversational agents
Most existing dialogue systems fail to respond properly to potentially unsafe user utterances
by either ignoring or passively agreeing with them. To address this issue, we introduce …
by either ignoring or passively agreeing with them. To address this issue, we introduce …
Mirages: On anthropomorphism in dialogue systems
Automated dialogue or conversational systems are anthropomorphised by developers and
personified by users. While a degree of anthropomorphism may be inevitable due to the …
personified by users. While a degree of anthropomorphism may be inevitable due to the …
Why so toxic? measuring and triggering toxic behavior in open-domain chatbots
Chatbots are used in many applications, eg, automated agents, smart home assistants,
interactive characters in online games, etc. Therefore, it is crucial to ensure they do not …
interactive characters in online games, etc. Therefore, it is crucial to ensure they do not …
COLD: A benchmark for Chinese offensive language detection
Offensive language detection is increasingly crucial for maintaining a civilized social media
platform and deploying pre-trained language models. However, this task in Chinese is still …
platform and deploying pre-trained language models. However, this task in Chinese is still …
[KNIHA][B] Foundation models for natural language processing: Pre-trained language models integrating media
G Paaß, S Giesselbach - 2023 - library.oapen.org
This open access book provides a comprehensive overview of the state of the art in research
and applications of Foundation Models and is intended for readers familiar with basic …
and applications of Foundation Models and is intended for readers familiar with basic …
[PDF][PDF] SafetyKit: First aid for measuring safety in open-domain conversational systems
The social impact of natural language processing and its applications has received
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …
increasing attention. In this position paper, we focus on the problem of safety for end-to-end …
Through the lens of core competency: Survey on evaluation of large language models
From pre-trained language model (PLM) to large language model (LLM), the field of natural
language processing (NLP) has witnessed steep performance gains and wide practical …
language processing (NLP) has witnessed steep performance gains and wide practical …