February 1, 2007

Plagiarism Detection in arXiv (2007)

Sorokina Daria, Gehrke Johannes, Warner Simeon, Ginsparg Paul

Abstract
We describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times larger>>>

Random Posts


  • A good day for transparency

    Daniel Mietchen Some brief excerpts from today’s news on matters of plagiarized dissertations in Germany: >>> READ MORE>>

  • Research ethics: science faces On Fact and Fraud (Ars Technica)

    By JOHN TIMMER David Goodstein has a unique perspective on scientific fraud, having pursued a successful career in research physics before becoming the provost of Caltech, one of the world's premier research institutions. As an administrator, he helped formulate Caltech's first policy for scien... READ MORE>>

  • How journal editors can detect and deter scientific misconduct?

    Misconduct happens. So what can journal editors do find and prevent it? While we don’t claim to be experts in working on the other side of the fence — eg as editors — Ivan was flattered to be asked by session organizers at the Council of Science Editors to appear on a panel on the subject. He was j... READ MORE>>

  • Turkish testing official embroiled in new academic scandal

    A top educational official already embroiled in a controversy over an alleged cheating scandal on a national exam has been accused of academic plagiarism.According to the new claims, Professor Ali Demir, the chairman of Turkey’s Student Selection and Placement Center, or ÖSYM, plagiarized in an arti... READ MORE>>

  • Koch-Mehrin plagiarism charges thicken

    Silvana Koch-Mehrin, a leading light of Germany's Free Democratic Party and vice president of the European Parliament, has been accused of widely plagiarising her doctoral dissertation. A preliminary report released by the internet platform Vroniplag Wiki says 56 of the 201 pages of the MEP’s disse... READ MORE>>

  • Keeping science fair

    Alex Holcombe Blind justice. A beautiful ideal! That the merits of a case are to be decided without regard to the identities of the parties involved or the size of their bank accounts. This is something science aspires to in evaluating manuscripts for publication. In fact it’s fundamental to the in... READ MORE>>

  • Plagiarism: Can It Be Stopped?

    G. Jay Christensen,  Business Communication Quarterly, published online 13 April 2011 PLAGIARISM CAN BE CONTROLLED, not stopped. The more appropriate question to ask is: What can be done to encourage students to “cheat” correctly by doing the assignment the way it was intended? I have lived my ... READ MORE>>

.

.
.

Popular Posts