Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Safetyprompts: a systematic review of open datasets for evaluating and improving large language model safety
The last two years have seen a rapid growth in concerns around the safety of large
language models (LLMs). Researchers and practitioners have met these concerns by …
language models (LLMs). Researchers and practitioners have met these concerns by …
Towards bidirectional human-ai alignment: A systematic review for clarifications, framework, and future directions
Recent advancements in general-purpose AI have highlighted the importance of guiding AI
systems towards the intended goals, ethical principles, and values of individuals and …
systems towards the intended goals, ethical principles, and values of individuals and …
Political compass or spinning arrow? towards more meaningful evaluations for values and opinions in large language models
Much recent work seeks to evaluate values and opinions in large language models (LLMs)
using multiple-choice surveys and questionnaires. Most of this work is motivated by …
using multiple-choice surveys and questionnaires. Most of this work is motivated by …
Open problems in technical ai governance
AI progress is creating a growing range of risks and opportunities, but it is often unclear how
they should be navigated. In many cases, the barriers and uncertainties faced are at least …
they should be navigated. In many cases, the barriers and uncertainties faced are at least …
Conifer: Improving complex constrained instruction-following ability of large language models
H Sun, L Liu, J Li, F Wang, B Dong, R Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
The ability of large language models (LLMs) to follow instructions is crucial to real-world
applications. Despite recent advances, several studies have highlighted that LLMs struggle …
applications. Despite recent advances, several studies have highlighted that LLMs struggle …
Gender, race, and intersectional bias in resume screening via language model retrieval
Artificial intelligence (AI) hiring tools have revolutionized resume screening, and large
language models (LLMs) have the potential to do the same. However, given the biases …
language models (LLMs) have the potential to do the same. However, given the biases …
Beyond static AI evaluations: advancing human interaction evaluations for LLM harms and risks
Model evaluations are central to understanding the safety, risks, and societal impacts of AI
systems. While most real-world AI applications involve human-AI interaction, most current …
systems. While most real-world AI applications involve human-AI interaction, most current …
Structured chemistry reasoning with large language models
Large Language Models (LLMs) excel in diverse areas, yet struggle with complex scientific
reasoning, especially in the field of chemistry. Different from the simple chemistry tasks (eg …
reasoning, especially in the field of chemistry. Different from the simple chemistry tasks (eg …
Instruct and extract: Instruction tuning for on-demand information extraction
Large language models with instruction-following capabilities open the door to a wider
group of users. However, when it comes to information extraction-a classic task in natural …
group of users. However, when it comes to information extraction-a classic task in natural …
Dolomites: Domain-Specific Long-Form Methodical Tasks
Experts in various fields routinely perform methodical writing tasks to plan, organize, and
report their work. From a clinician writing a differential diagnosis for a patient, to a teacher …
report their work. From a clinician writing a differential diagnosis for a patient, to a teacher …