Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Glitch tokens in large language models: Categorization taxonomy and effective detection
With the expanding application of Large Language Models (LLMs) in various domains, it
becomes imperative to comprehensively investigate their unforeseen behaviors and …
becomes imperative to comprehensively investigate their unforeseen behaviors and …
An empirical study on large language models in accuracy and robustness under chinese industrial scenarios
Recent years have witnessed the rapid development of large language models (LLMs) in
various domains. To better serve the large number of Chinese users, many commercial …
various domains. To better serve the large number of Chinese users, many commercial …
MORTAR: Metamorphic Multi-turn Testing for LLM-based Dialogue Systems
G Guo, A Aleti, N Neelofar… - arxiv preprint arxiv …, 2024 - arxiv.org
With the widespread application of LLM-based dialogue systems in daily life, quality
assurance has become more important than ever. Recent research has successfully …
assurance has become more important than ever. Recent research has successfully …
Combating Missed Recalls in E-commerce Search: A CoT-Prompting Testing Approach
S Wu, Y Hu, Y Wang, J Gu, J Meng, L Fan… - … Proceedings of the …, 2024 - dl.acm.org
Search components in e-commerce apps, often complex AI-based systems, are prone to
bugs that can lead to missed recalls—situations where items that should be listed in search …
bugs that can lead to missed recalls—situations where items that should be listed in search …
SPOLRE: Semantic Preserving Object Layout Reconstruction for Image Captioning System Testing
Image captioning (IC) systems, such as Microsoft Azure Cognitive Service, translate image
content into descriptive language but can generate inaccuracies leading to …
content into descriptive language but can generate inaccuracies leading to …
MTAS: A Reference-Free Approach for Evaluating Abstractive Summarization Systems
Abstractive summarization (AS) systems, which aim to generate a text for summarizing
crucial information of the original document, have been widely adopted in recent years …
crucial information of the original document, have been widely adopted in recent years …
VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
Z Chang, M Li, J Wang, C Li, Q Wang - arxiv preprint arxiv:2403.02581, 2024 - arxiv.org
Visual entailment (VE) is a multimodal reasoning task consisting of image-sentence pairs
whereby a promise is defined by an image, and a hypothesis is described by a sentence …
whereby a promise is defined by an image, and a hypothesis is described by a sentence …