Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Neural machine translation for low-resource languages: A survey
Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since
the early 2000s and has already entered a mature phase. While considered the most widely …
the early 2000s and has already entered a mature phase. While considered the most widely …
[HTML][HTML] A survey on recent named entity recognition and relationship extraction techniques on clinical texts
Significant growth in Electronic Health Records (EHR) over the last decade has provided an
abundance of clinical text that is mostly unstructured and untapped. This huge amount of …
abundance of clinical text that is mostly unstructured and untapped. This huge amount of …
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
Language models demonstrate both quantitative improvement and new qualitative
capabilities with increasing scale. Despite their potentially transformative impact, these new …
capabilities with increasing scale. Despite their potentially transformative impact, these new …
Refiner: Reasoning feedback on intermediate representations
Language models (LMs) have recently shown remarkable performance on reasoning tasks
by explicitly generating intermediate inferences, eg, chain-of-thought prompting. However …
by explicitly generating intermediate inferences, eg, chain-of-thought prompting. However …
mgpt: Few-shot learners go multilingual
This paper introduces mGPT, a multilingual variant of GPT-3, pretrained on 61 languages
from 25 linguistically diverse language families using Wikipedia and the C4 Corpus. We …
from 25 linguistically diverse language families using Wikipedia and the C4 Corpus. We …
Comparison of text preprocessing methods
CP Chai - Natural Language Engineering, 2023 - cambridge.org
Text preprocessing is not only an essential step to prepare the corpus for modeling but also
a key area that directly affects the natural language processing (NLP) application results. For …
a key area that directly affects the natural language processing (NLP) application results. For …
Data augmentation using llms: Data perspectives, learning paradigms and challenges
In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has
emerged as a pivotal technique for enhancing model performance by diversifying training …
emerged as a pivotal technique for enhancing model performance by diversifying training …
An empirical survey of data augmentation for limited data learning in NLP
NLP has achieved great progress in the past decade through the use of neural models and
large labeled datasets. The dependence on abundant data prevents NLP models from being …
large labeled datasets. The dependence on abundant data prevents NLP models from being …
Walklm: A uniform language model fine-tuning framework for attributed graph embedding
Graphs are widely used to model interconnected entities and improve downstream
predictions in various real-world applications. However, real-world graphs nowadays are …
predictions in various real-world applications. However, real-world graphs nowadays are …
mmarco: A multilingual version of the ms marco passage ranking dataset
The MS MARCO ranking dataset has been widely used for training deep learning models for
IR tasks, achieving considerable effectiveness on diverse zero-shot scenarios. However, this …
IR tasks, achieving considerable effectiveness on diverse zero-shot scenarios. However, this …