Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Colpali: Efficient document retrieval with vision language models
Documents are visually rich structures that convey information through text, but also figures,
page layouts, tables, or even fonts. Since modern retrieval systems mainly rely on the textual …
page layouts, tables, or even fonts. Since modern retrieval systems mainly rely on the textual …
Auxiliary task demands mask the capabilities of smaller language models
Developmental psychologists have argued about when cognitive capacities such as
language understanding or theory of mind emerge. These debates often hinge on the …
language understanding or theory of mind emerge. These debates often hinge on the …
VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation
Existing metrics for evaluating the factuality of long-form text, such as FACTSCORE (Min et
al., 2023) and SAFE (Wei et al., 2024), decompose an input text into" atomic claims" and …
al., 2023) and SAFE (Wei et al., 2024), decompose an input text into" atomic claims" and …
Financemath: Knowledge-intensive math reasoning in finance domains
We introduce FinanceMath, a novel benchmark designed to evaluate LLMs' capabilities in
solving knowledge-intensive math reasoning problems. Compared to prior works, this study …
solving knowledge-intensive math reasoning problems. Compared to prior works, this study …
Fast state restoration in LLM serving with hcache
The growing complexity of LLM usage today, eg, multi-round conversation and retrieval-
augmented generation (RAG), makes contextual states (ie, KV cache) reusable across user …
augmented generation (RAG), makes contextual states (ie, KV cache) reusable across user …
On the Diversity of Synthetic Data and its Impact on Training Large Language Models
The rise of Large Language Models (LLMs) has accentuated the need for diverse, high-
quality pre-training data. Synthetic data emerges as a viable solution to the challenges of …
quality pre-training data. Synthetic data emerges as a viable solution to the challenges of …
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models
Despite the recent breakthroughs achieved by Large Vision Language Models (LVLMs) in
understanding and responding to complex visual-textual contexts, their inherent …
understanding and responding to complex visual-textual contexts, their inherent …
Decoder-only streaming transformer for simultaneous translation
Simultaneous Machine Translation (SiMT) generates translation while reading source
tokens, essentially producing the target prefix based on the source prefix. To achieve good …
tokens, essentially producing the target prefix based on the source prefix. To achieve good …
Evaluating language models as risk scores
Current question-answering benchmarks predominantly focus on accuracy in realizable
prediction tasks. Conditioned on a question and answer-key, does the most likely token …
prediction tasks. Conditioned on a question and answer-key, does the most likely token …
Automated Text Scoring in the Age of Generative AI for the GPU-poor
Current research on generative language models (GLMs) for automated text scoring (ATS)
has focused almost exclusively on querying proprietary models via Application …
has focused almost exclusively on querying proprietary models via Application …