[PDF][PDF] An evaluation framework for plagiarism detection

M Potthast, B Stein, A Barrón-Cedeño… - Coling 2010 …, 2010 - aclanthology.org
We present an evaluation framework for plagiarism detection. 1 The framework provides
performance measures that address the specifics of plagiarism detection, and the PAN-PC …

Overview of the author identification task at PAN 2014

E Stamatatos, W Daelemans, B Verhoeven… - CEUR Workshop …, 2014 - cris.unibo.it
The author identification task at PAN-2014 focuses on author verification. Similar to PAN-
2013 we are given a set of documents by the same author along with exactly one document …

Overview of the 5th international competition on plagiarism detection

M Potthast, M Hagen, T Gollub, M Tippmann… - CLEF Conference on …, 2013 - riunet.upv.es
This paper overviews 18 plagiarism detectors that have been evaluated within the fifth
international competition on plagiarism detection at PAN 2013. We report on their …

[PDF][PDF] Removing boilerplate and duplicate content from web corpora

J Pomikálek - Disertacnı práce, Masarykova univerzita, Fakulta …, 2011 - is.muni.cz
In the recent years, the Web has become a popular source of textual data for linguistic
research. The Web provides an extremely large volume of texts in many languages …

Improving the reproducibility of PAN's shared tasks: Plagiarism detection, author identification, and author profiling

M Potthast, T Gollub, F Rangel, P Rosso… - … , and Interaction: 5th …, 2014 - Springer
This paper reports on the PAN 2014 evaluation lab which hosts three shared tasks on
plagiarism detection, author identification, and author profiling. To improve the …

Wikipedia vandalism detection: Combining natural language, metadata, and reputation features

BT Adler, L De Alfaro, SM Mola-Velasco… - … and Intelligent Text …, 2011 - Springer
Wikipedia is an online encyclopedia which anyone can edit. While most edits are
constructive, about 7% are acts of vandalism. Such behavior is characterized by …

Plagiarism meets paraphrasing: Insights for the next generation in automatic plagiarism detection

A Barrón-Cedeño, M Vila, MA Martí… - Computational …, 2013 - direct.mit.edu
Although paraphrasing is the linguistic mechanism underlying many plagiarism cases, little
attention has been paid to its analysis in the framework of automatic plagiarism detection …

Overview of the 3rd international competition on plagiarism detection

M Potthast, A Eiselt, A Barrón-Cedeño, B Stein… - CEUR workshop …, 2011 - cris.unibo.it
This paper overviews eleven plagiarism detectors that have been developed and evaluated
within PAN'11. We survey the detection approaches developed for the two sub-tasks" …

Plagiarism detection using stopword n‐grams

E Stamatatos - Journal of the American Society for Information …, 2011 - Wiley Online Library
In this paper a novel method for detecting plagiarized passages in document collections is
presented. In contrast to previous work in this field that uses content terms to represent …

State-of-the-art in detecting academic plagiarism

N Meuschke, B Gipp - International Journal for Educational Integrity, 2013 - ojs.unisa.edu.au
The problem of academic plagiarism has been present for centuries. Yet, the widespread
dissemination of information technology, including the internet, made plagiarising much …