Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Safetyprompts: a systematic review of open datasets for evaluating and improving large language model safety
The last two years have seen a rapid growth in concerns around the safety of large
language models (LLMs). Researchers and practitioners have met these concerns by …
language models (LLMs). Researchers and practitioners have met these concerns by …
Culturally aware and adapted nlp: A taxonomy and a survey of the state of the art
The surge of interest in culturally aware and adapted Natural Language Processing (NLP)
has inspired much recent research. However, the lack of common understanding of the …
has inspired much recent research. However, the lack of common understanding of the …
Kmmlu: Measuring massive multitask language understanding in korean
We propose KMMLU, a new Korean benchmark with 35,030 expert-level multiple-choice
questions across 45 subjects ranging from humanities to STEM. While prior Korean …
questions across 45 subjects ranging from humanities to STEM. While prior Korean …
Culturebank: An online community-driven knowledge base towards culturally aware language technologies
To enhance language models' cultural awareness, we design a generalizable pipeline to
construct cultural knowledge bases from different online communities on a massive scale …
construct cultural knowledge bases from different online communities on a massive scale …
Social bias evaluation for large language models requires prompt variations
Warning: This paper contains examples of stereotypes and biases. Large Language Models
(LLMs) exhibit considerable social biases, and various studies have tried to evaluate and …
(LLMs) exhibit considerable social biases, and various studies have tried to evaluate and …
CaLMQA: Exploring culturally specific long-form question answering across 23 languages
S Arora, M Karpinska, HT Chen, I Bhattacharjee… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) are used for long-form question answering (LFQA), which
requires them to generate paragraph-length answers to complex questions. While LFQA has …
requires them to generate paragraph-length answers to complex questions. While LFQA has …
Exploring cross-cultural differences in English hate speech annotations: From dataset construction to analysis
Warning: this paper contains content that may be offensive or upsetting. Most hate speech
datasets neglect the cultural diversity within a single language, resulting in a critical …
datasets neglect the cultural diversity within a single language, resulting in a critical …
Survey of cultural awareness in language models: Text and beyond
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
Ask LLMs Directly," What shapes your bias?": Measuring Social Bias in Large Language Models
Social bias is shaped by the accumulation of social perceptions towards targets across
various demographic identities. To fully understand such social bias in large language …
various demographic identities. To fully understand such social bias in large language …
Do Multilingual Large Language Models Mitigate Stereotype Bias?
While preliminary findings indicate that multilingual LLMs exhibit reduced bias compared to
monolingual ones, a comprehensive understanding of the effect of multilingual training on …
monolingual ones, a comprehensive understanding of the effect of multilingual training on …