February 1, 2007

Plagiarism Detection in arXiv (2007)

Sorokina Daria, Gehrke Johannes, Warner Simeon, Ginsparg Paul

Abstract
We describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times larger>>>

Random Posts


  • How to find Plagiarism in Dissertations - Copy, Shake, and Paste

    Germany is awash in another wave of discussions about plagiarism. This time it is the Minister of Education and Research, Annette Schavan. The story about plagiarism in her dissertation broke in May, and the University of Düsseldorf has been examining the case since. Today, October 17, the comm... READ MORE>>

  • Scientific fraud: a sign of the times? - The Guardian

    If you read about scientific fraud in the recent news, it would seem that there is much to worry about. It's on the rise, apparently! There has been a 10-fold increase in the number of retracted papers since the 1970's, and a number of these are due to fraud or suspected fraud. An investigation o... READ MORE>>

  • Misconduct, Not Error, Found Behind Most Journal Retractions - THE CHRONICLE

    Paul BaskenResearch misconduct, rather than error, is the leading cause of retractions in scientific journals, with the problem especially pronounced in more prestigious publications, a comprehensive analysis has concluded. The analysis, described on Monday in PNAS, the Proceedings of the National... READ MORE>>

  • Plagiarism in Turkey - Copy, Shake, and Paste

    Some Turkish academics have been very busy the past few months, it seems. Perhaps inspired by the VroniPlag Wiki documentation in Germany, the authors have put together a massive documentation of plagiarism in Turkish theses that A. Murat Eren, a computer science Ph.D. and post-doc researcher i... READ MORE>>

  • Mathgen paper accepted!

    Nate Eldredge I’m pleased to announce that Mathgen has had its first randomly-generated paper accepted by a reputable journal! On August 3, 2012, a certain Professor Marcie Rathke of the University of Southern North Dakota at Hoople submitted a very interesting article to Advances in Pure M... READ MORE>>

  • False positives: fraud and misconduct are threatening scientific research - The Guardian

    Alok Jha Dirk Smeesters had spent several years of his career as a social psychologist at Erasmus University in Rotterdam studying how consumers behaved in different situations. Did colour have an effect on what they bought? How did death-related stories in the media affect how people picked produc... READ MORE>>

  • How Plagiarism Happens - The Atlantic

     Ta-Nehisi CoatesWhen Fareed Zakaria was caught plagiarizing Jill Lepore, he offered the same defense that almost every person caught plagiarizing offers:The mistake, he said, occurred when he confused the notes he had taken about Ms. Lepore's article -- he said he often w... READ MORE>>

.

.
.

Popular Posts