December 11, 2014

Breaking news and analysis from the world of science policy : Study of massive preprint archive hints at the geography of plagiarism - ScienceInsider

New analyses of the hundreds of thousands of technical manuscripts submitted to arXiv, the repository of digital preprint articles, are offering some intriguing insights into the consequences—and geography—of scientific plagiarism. It appears that copying text from other papers is more common in some nations than others, but the outcome is generally the same for authors who copy extensively: Their papers don’t get cited much.
Since its founding in 1991, arXiv has become the world's largest venue for sharing findings in physics, math, and other mathematical fields. It publishes hundreds of papers daily and is fast approaching its millionth submission. Anyone can send in a paper, and submissions don’t get full peer review. However, the papers do go through a quality-control process. The final check is a computer program that compares the paper's text with the text of every other paper already published on arXiv. The goal is to flag papers that have a high likelihood of having plagiarized published work.
"Text overlap" is the technical term, and sometimes it turns out to be innocent. For example, a review article might quote generously from a paper the author cites, or the author might recycle and slightly update sentences from their own previous work. The arXiv plagiarism detector gives such papers a pass. "It's a fairly sophisticated machine learning logistic classifier," says arXiv founder Paul Ginsparg, a physicist at Cornell University. "It has special ways of detecting block quotes, italicized text, text in quotation marks, as well statements of mathematical theorems, to avoid false positives."
Only when there is no obvious reason for an author to have copied significant chunks of text from already published work—particularly if that previous work is not cited and has no overlap in authorship—does the software affix a “flag” to the article, including links to the papers from which it has text overlap. That standard “is much more lenient" than those used by most scientific journals, Ginsparg says.
To explore some of the consequences of "text reuse," Ginsparg and Cornell physics Ph.D. student Daniel Citron compared the text from each of the 757,000 articles submitted to arXiv between 1991 and 2012. The headline from that study, published Monday in the Proceedings of the National Academy of Sciences (PNAS) is that the more text a paper poaches from already published work, the less frequently that paper tends to be cited. (The full paper is also available for free on arXiv.) It also found that text reuse is surprisingly common. After filtering out review articles and legitimate quoting, about one in 16 arXiv authors were found to have copied long phrases and sentences from their own previously published work that add up to about the same amount of text as this entire article. More worryingly, about one out of every 1000 of the submitting authors copied the equivalent of a paragraph's worth of text from other people's papers without citing them.
So where in the world is all this text reuse happening? Conspicuously missing from the PNAS paper is a global map of potential plagiarism. Whenever an author submits a paper to arXiv, the author declares his or her country of residence. So it should be possible to reveal which countries have the highest proportion of plagiarists. The reason no map was included, Ginsparg told ScienceInsider, is that all the text overlap detected in their study is not necessarily plagiarism.
Ginsparg did agree, however, to share arXiv’s flagging data with ScienceInsider. Since 1 August 2011, when arXiv began systematically flagging for text overlap, 106,262 authors from 151 nations have submitted a total of 301,759 articles. (Each paper can have many more co-authors.) Overall, 3.2% (9591) of the papers were flagged. It's not just papers submitted en masse by a few bad apples, either. Those flagged papers came from 6% (6737) of the submitting authors. Put another way, one out of every 16 researchers who have submitted a paper to arXiv since August 2011 has been flagged by the plagiarism detector at least once.
The map above, prepared by ScienceInsider, takes a conservative approach. It shows only the incidence of flagged authors for the 57 nations with at least 100 submitted papers, to minimize distortion from small sample sizes. (In Ethiopia, for example, there are only three submitting authors and two of them have been flagged.)
Researchers from countries that submit the lion's share of arXiv papers—the United States, Canada, and a small number of industrialized countries in Europe and Asia—tend to plagiarize less often than researchers elsewhere. For example, more than 20% (38 of 186) of authors who submitted papers from Bulgaria were flagged, more than eight times the proportion from New Zealand (five of 207). In Japan, about 6% (269 of 4759) of submitting authors were flagged, compared with over 15% (164 out of 1054) from Iran.
Such disparities may be due in part to different academic cultures, Ginsparg and Citron say in their PNAS study. They chalk up scientific plagiarism to "differences in academic infrastructure and mentoring, or incentives that emphasize quantity of publication over quality."
*Correction, 11 December, 4:57 p.m.:  The map has been corrected to reflect current national boundaries.

Random Posts


  • Plagiarism and Essay Mills

    Dan Ariely Sometimes as I decide what kind of papers to assign to my students, I can’t help but think about their potential to use essay mills. Essay mills are companies whose sole purpose is to generate essays for high school and college students (in exchange for a fee, of course).  S... READ MORE>>

  • Higher education: Call for a European integrity standard - NATURE

    Nature 491,192(08 November 2012) doi:10.1038/491192d Alina Mungiu-Pippidi & Ligia Deca The global market for diplomas and academic rankings has had the unintended consequence of stimulating misconduct, from data manipulation and plagiarism, to sheer fraud. If incentives for integrity prov... READ MORE>>

  • Scientific fraud is rife: it's time to stand up for good science - The Guardian

    The way we fund and publish science encourages fraud. A forum about academic misconduct aims to find practical solutions    Peer review happens behind closed doors, with anonymous reviews only seen by editors and authors. This means we have no idea how effective it is. Photo: Alamy ... READ MORE>>

  • Write My Essay, Please! - The Atlantic

    Richard Gunderman These days, students can hire online companies to do all their coursework, from papers to final exams. Is this ethical, or even legal? A colleague tells the following story. A student in an undergraduate course recently submitted a truly first-rate term paper. In form, it was ... READ MORE>>

  • Study Shows Studies Show Nothing - Money Morning

    Nick Hubble If you’ve ever wondered how a study can show something that just can’t be true, or how studies can completely contradict each other, we’ve figured it out. With a little help of course. After today’s Daily Reckoning, I hope you never believe another ‘study’. Our heartfelt congratulatio... READ MORE>>

  • How to find Plagiarism in Dissertations - Copy, Shake, and Paste

    Germany is awash in another wave of discussions about plagiarism. This time it is the Minister of Education and Research, Annette Schavan. The story about plagiarism in her dissertation broke in May, and the University of Düsseldorf has been examining the case since. Today, October 17, the comm... READ MORE>>

  • Scientific fraud: a sign of the times? - The Guardian

    If you read about scientific fraud in the recent news, it would seem that there is much to worry about. It's on the rise, apparently! There has been a 10-fold increase in the number of retracted papers since the 1970's, and a number of these are due to fraud or suspected fraud. An investigation o... READ MORE>>

.

.
.

Popular Posts