[PDF][PDF] An evaluation framework for plagiarism detection
We present an evaluation framework for plagiarism detection. 1 The framework provides
performance measures that address the specifics of plagiarism detection, and the PAN-PC …
performance measures that address the specifics of plagiarism detection, and the PAN-PC …
Overview of the author identification task at PAN 2014
The author identification task at PAN-2014 focuses on author verification. Similar to PAN-
2013 we are given a set of documents by the same author along with exactly one document …
2013 we are given a set of documents by the same author along with exactly one document …
Overview of the 5th international competition on plagiarism detection
This paper overviews 18 plagiarism detectors that have been evaluated within the fifth
international competition on plagiarism detection at PAN 2013. We report on their …
international competition on plagiarism detection at PAN 2013. We report on their …
[PDF][PDF] Removing boilerplate and duplicate content from web corpora
J Pomikálek - Disertacnı práce, Masarykova univerzita, Fakulta …, 2011 - is.muni.cz
In the recent years, the Web has become a popular source of textual data for linguistic
research. The Web provides an extremely large volume of texts in many languages …
research. The Web provides an extremely large volume of texts in many languages …
Improving the reproducibility of PAN's shared tasks: Plagiarism detection, author identification, and author profiling
This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on
plagiarism detection, author identification, and author profiling. To improve the …
plagiarism detection, author identification, and author profiling. To improve the …
Wikipedia vandalism detection: Combining natural language, metadata, and reputation features
Wikipedia is an online encyclopedia which anyone can edit. While most edits are
constructive, about 7% are acts of vandalism. Such behavior is characterized by …
constructive, about 7% are acts of vandalism. Such behavior is characterized by …
Plagiarism meets paraphrasing: Insights for the next generation in automatic plagiarism detection
Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little
attention has been paid to its analysis in the framework of automatic plagiarism detection …
attention has been paid to its analysis in the framework of automatic plagiarism detection …
Overview of the 3rd international competition on plagiarism detection
This paper overviews eleven plagiarism detectors that have been developed and evaluated
within PAN'11. We survey the detection approaches developed for the two sub-tasks" …
within PAN'11. We survey the detection approaches developed for the two sub-tasks" …
Plagiarism detection using stopword n‐grams
E Stamatatos - Journal of the American Society for Information …, 2011 - Wiley Online Library
In this paper a novel method for detecting plagiarized passages in document collections is
presented. In contrast to previous work in this field that uses content terms to represent …
presented. In contrast to previous work in this field that uses content terms to represent …
State-of-the-art in detecting academic plagiarism
The problem of academic plagiarism has been present for centuries. Yet, the widespread
dissemination of information technology, including the internet, made plagiarising much …
dissemination of information technology, including the internet, made plagiarising much …