Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Document parsing unveiled: Techniques, challenges, and prospects for structured information extraction
Document parsing is essential for converting unstructured and semi-structured documents-
such as contracts, academic papers, and invoices-into structured, machine-readable data …
such as contracts, academic papers, and invoices-into structured, machine-readable data …
OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation
Retrieval-augmented Generation (RAG) enhances Large Language Models (LLMs) by
integrating external knowledge to reduce hallucinations and incorporate up-to-date …
integrating external knowledge to reduce hallucinations and incorporate up-to-date …
VISA: Retrieval Augmented Generation with Visual Source Attribution
Generation with source attribution is important for enhancing the verifiability of retrieval-
augmented generation (RAG) systems. However, existing approaches in RAG primarily link …
augmented generation (RAG) systems. However, existing approaches in RAG primarily link …
UniCoRN: Unified Commented Retrieval Network with LMMs
Multimodal retrieval methods have limitations in handling complex, compositional queries
that require reasoning about the visual content of both the query and the retrieved entities …
that require reasoning about the visual content of both the query and the retrieved entities …
MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval
Despite the rapidly growing demand for multimodal retrieval, progress in this field remains
severely constrained by a lack of training data. In this paper, we introduce MegaPairs, a …
severely constrained by a lack of training data. In this paper, we introduce MegaPairs, a …
Document Screenshot Retrievers are Vulnerable to Pixel Poisoning Attacks
Recent advancements in dense retrieval have introduced vision-language model (VLM)-
based retrievers, such as DSE and ColPali, which leverage document screenshots …
based retrievers, such as DSE and ColPali, which leverage document screenshots …
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating
Large vision language models (LVLMs) have improved the document understanding
capabilities remarkably, enabling the handling of complex document elements, longer …
capabilities remarkably, enabling the handling of complex document elements, longer …
VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos
Retrieval-Augmented Generation (RAG) has demonstrated remarkable success in
enhancing Large Language Models (LLMs) through external knowledge integration, yet its …
enhancing Large Language Models (LLMs) through external knowledge integration, yet its …
How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey
Y Qi, H Li, Y Song, X Wu, J Luo - arxiv preprint arxiv:2412.08158, 2024 - arxiv.org
The exploration of various vision-language tasks, such as visual captioning, visual question
answering, and visual commonsense reasoning, is an important area in artificial intelligence …
answering, and visual commonsense reasoning, is an important area in artificial intelligence …
An archaeological Catalog Collection Method Based on Large Vision-Language Models
Archaeological catalogs, containing key elements such as artifact images, morphological
descriptions, and excavation information, are essential for studying artifact evolution and …
descriptions, and excavation information, are essential for studying artifact evolution and …