Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Aya dataset: An open-access collection for multilingual instruction tuning
Datasets are foundational to many breakthroughs in modern artificial intelligence. Many
recent achievements in the space of natural language processing (NLP) can be attributed to …
recent achievements in the space of natural language processing (NLP) can be attributed to …
Grag: Graph retrieval-augmented generation
Naive Retrieval-Augmented Generation (RAG) focuses on individual documents during
retrieval and, as a result, falls short in handling networked documents which are very …
retrieval and, as a result, falls short in handling networked documents which are very …
Docfinqa: A long-context financial reasoning dataset
For large language models (LLMs) to be effective in the financial domain--where each
decision can have a significant impact--it is necessary to investigate realistic tasks and data …
decision can have a significant impact--it is necessary to investigate realistic tasks and data …
Anchor-based large language models
Large language models (LLMs) predominantly employ decoder-only transformer
architectures, necessitating the retention of keys/values information for historical tokens to …
architectures, necessitating the retention of keys/values information for historical tokens to …
[PDF][PDF] Qlarify: Bridging scholarly abstracts and papers with recursively expandable summaries
As scientific literature has grown exponentially, researchers often rely on paper triaging
strategies such as browsing abstracts before deciding to delve into a paper's full text …
strategies such as browsing abstracts before deciding to delve into a paper's full text …
TruthReader: Towards Trustworthy Document Assistant Chatbot with Reliable Attribution
Document assistant chatbots are empowered with extensive capabilities by Large Language
Models (LLMs) and have exhibited significant advancements. However, these systems may …
Models (LLMs) and have exhibited significant advancements. However, these systems may …
Docxchain: A powerful open-source toolchain for document parsing and beyond
C Yao - arxiv preprint arxiv:2310.12430, 2023 - arxiv.org
In this report, we introduce DocXChain, a powerful open-source toolchain for document
parsing, which is designed and developed to automatically convert the rich information …
parsing, which is designed and developed to automatically convert the rich information …
WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
Multimodal document understanding is a challenging task to process and comprehend large
amounts of textual and visual information. Recent advances in Large Language Models …
amounts of textual and visual information. Recent advances in Large Language Models …
M-longdoc: A benchmark for multimodal super-long document understanding and a retrieval-aware tuning framework
The ability to understand and answer questions over documents can be useful in many
business and practical applications. However, documents often contain lengthy and diverse …
business and practical applications. However, documents often contain lengthy and diverse …
Fragrel: Exploiting fragment-level relations in the external memory of large language models
To process contexts with unlimited length using Large Language Models (LLMs), recent
studies explore hierarchically managing the long text. Only several text fragments are taken …
studies explore hierarchically managing the long text. Only several text fragments are taken …