Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Towards bidirectional human-ai alignment: A systematic review for clarifications, framework, and future directions
Recent advancements in general-purpose AI have highlighted the importance of guiding AI
systems towards the intended goals, ethical principles, and values of individuals and …
systems towards the intended goals, ethical principles, and values of individuals and …
Large language model alignment: A survey
Recent years have witnessed remarkable progress made in large language models (LLMs).
Such advancements, while garnering significant attention, have concurrently elicited various …
Such advancements, while garnering significant attention, have concurrently elicited various …
Language-Models-as-a-Service: Overview of a new paradigm and its challenges
Some of the most powerful language models currently are proprietary systems, accessible
only via (typically restrictive) web or software programming interfaces. This is the …
only via (typically restrictive) web or software programming interfaces. This is the …
A survey on knowledge distillation of large language models
In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a
pivotal methodology for transferring advanced capabilities from leading proprietary LLMs …
pivotal methodology for transferring advanced capabilities from leading proprietary LLMs …
Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback
Large language models (LLMs) are used to generate content for a wide range of tasks, and
are set to reach a growing audience in coming years due to integration in product interfaces …
are set to reach a growing audience in coming years due to integration in product interfaces …
Knowledge of cultural moral norms in large language models
Moral norms vary across cultures. A recent line of work suggests that English large language
models contain human-like moral biases, but these studies typically do not examine moral …
models contain human-like moral biases, but these studies typically do not examine moral …
Aligning large language models through synthetic feedback
Aligning large language models (LLMs) to human values has become increasingly
important as it enables sophisticated steering of LLMs. However, it requires significant …
important as it enables sophisticated steering of LLMs. However, it requires significant …
Interactive natural language processing
Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …
Prp: Propagating universal perturbations to attack large language model guard-rails
Large language models (LLMs) are typically aligned to be harmless to humans.
Unfortunately, recent work has shown that such models are susceptible to automated …
Unfortunately, recent work has shown that such models are susceptible to automated …
Ethical reasoning over moral alignment: A case and framework for in-context ethical policies in LLMs
In this position paper, we argue that instead of morally aligning LLMs to specific set of ethical
principles, we should infuse generic ethical reasoning capabilities into them so that they can …
principles, we should infuse generic ethical reasoning capabilities into them so that they can …