February 1, 2007

Plagiarism Detection in arXiv (2007)

Sorokina Daria, Gehrke Johannes, Warner Simeon, Ginsparg Paul

Abstract
We describe a large-scale application of methods for finding plagiarism in research document collections. The methods are applied to a collection of 284,834 documents collected by arXiv.org over a 14 year period, covering a few different research disciplines. The methodology efficiently detects a variety of problematic author behaviors, and heuristics are developed to reduce the number of false positives. The methods are also efficient enough to implement as a real-time submission screen for a collection many times larger>>>

Random Posts


  • Allow me to rephrase, and boost my tally of articles: THE

    Rebecca AttwoodScholars are passing off old work as new to drive up publications counts. Pressure to publish is pushing many academics to plagiarise large volumes of their own work by "dressing up" their old research to appear as if it were new, a study has found.Researchers using text-matching s... READ MORE>>

  • Publish or perish, but at what cost?

    J Clin Invest. 2008 July 1; 118(7): 2368. doi: 10.1172/JCI36371. Ushma S. Neill, Executive EditorThe academic scientific enterprise rewards those with the longest CVs and the most publications. Under pressure to generate voluminous output, scientists often fall prey to double publishing, self plagi... READ MORE>>

  • Repairing research integrity : COMMENTARY: NATURE

    A survey suggests that many research misconduct incidents in the United States go unreported to the Office of Research Integrity. Sandra L. Titus, James A. Wells and Lawrence J. Rhoades say it’s time to change that.>>> READ MORE>>

  • Scientific misconduct: Tip of the iceberg?

    Editor's Summary A survey of US researchers suggests that scientific misconduct is greatly under-reported. The Office of Research Integrity was told of only 201 instances of likely misconduct relating to work funded by the Department of Health and Human Services in three years. Yet extrapolation fr... READ MORE>>

  • EDITORIAL - Research Integrity and Scientific Misconduct

    Anthony J. (Tony) Smith, Editor J Dent Res 87(3):197, 2008 >>> Most institutions have policies and guidelines for research integrity and misconduct, but I wonder how many of us have read these? The fact that some countries have set up organizations to regulate research integrity perhaps re... READ MORE>>

  • How Did Honor Evolve?

    The biology of integrity By David P. BARASH The Chronicle Review,Volume 54, Issue 37, Page B11 P.S.- David P. Barash is an evolutionary biologist, a professor of psychology at the University of Washington, and a frequent Chronicle contributor. He has never had to turn in any honor-code violators b... READ MORE>>

  • The Plagiarism Decision Process: The Role of Pressure and Rationalization

    IEEE TRANSACTIONS ON EDUCATION, VOL. 51, NO. 2, Page(s): 152-156, MAY 2008 Richard H. McCuen Abstract — Plagiarism is more than just the failure to use quotation marks or to cite a paraphrased passage. Dual publishing, self-plagiarism, and ghost authorship are other forms of plagiarism. Plagiari... READ MORE>>

.

.
.

Popular Posts