December 11, 2014

Breaking news and analysis from the world of science policy : Study of massive preprint archive hints at the geography of plagiarism - ScienceInsider

New analyses of the hundreds of thousands of technical manuscripts submitted to arXiv, the repository of digital preprint articles, are offering some intriguing insights into the consequences—and geography—of scientific plagiarism. It appears that copying text from other papers is more common in some nations than others, but the outcome is generally the same for authors who copy extensively: Their papers don’t get cited much.
Since its founding in 1991, arXiv has become the world's largest venue for sharing findings in physics, math, and other mathematical fields. It publishes hundreds of papers daily and is fast approaching its millionth submission. Anyone can send in a paper, and submissions don’t get full peer review. However, the papers do go through a quality-control process. The final check is a computer program that compares the paper's text with the text of every other paper already published on arXiv. The goal is to flag papers that have a high likelihood of having plagiarized published work.
"Text overlap" is the technical term, and sometimes it turns out to be innocent. For example, a review article might quote generously from a paper the author cites, or the author might recycle and slightly update sentences from their own previous work. The arXiv plagiarism detector gives such papers a pass. "It's a fairly sophisticated machine learning logistic classifier," says arXiv founder Paul Ginsparg, a physicist at Cornell University. "It has special ways of detecting block quotes, italicized text, text in quotation marks, as well statements of mathematical theorems, to avoid false positives."
Only when there is no obvious reason for an author to have copied significant chunks of text from already published work—particularly if that previous work is not cited and has no overlap in authorship—does the software affix a “flag” to the article, including links to the papers from which it has text overlap. That standard “is much more lenient" than those used by most scientific journals, Ginsparg says.
To explore some of the consequences of "text reuse," Ginsparg and Cornell physics Ph.D. student Daniel Citron compared the text from each of the 757,000 articles submitted to arXiv between 1991 and 2012. The headline from that study, published Monday in the Proceedings of the National Academy of Sciences (PNAS) is that the more text a paper poaches from already published work, the less frequently that paper tends to be cited. (The full paper is also available for free on arXiv.) It also found that text reuse is surprisingly common. After filtering out review articles and legitimate quoting, about one in 16 arXiv authors were found to have copied long phrases and sentences from their own previously published work that add up to about the same amount of text as this entire article. More worryingly, about one out of every 1000 of the submitting authors copied the equivalent of a paragraph's worth of text from other people's papers without citing them.
So where in the world is all this text reuse happening? Conspicuously missing from the PNAS paper is a global map of potential plagiarism. Whenever an author submits a paper to arXiv, the author declares his or her country of residence. So it should be possible to reveal which countries have the highest proportion of plagiarists. The reason no map was included, Ginsparg told ScienceInsider, is that all the text overlap detected in their study is not necessarily plagiarism.
Ginsparg did agree, however, to share arXiv’s flagging data with ScienceInsider. Since 1 August 2011, when arXiv began systematically flagging for text overlap, 106,262 authors from 151 nations have submitted a total of 301,759 articles. (Each paper can have many more co-authors.) Overall, 3.2% (9591) of the papers were flagged. It's not just papers submitted en masse by a few bad apples, either. Those flagged papers came from 6% (6737) of the submitting authors. Put another way, one out of every 16 researchers who have submitted a paper to arXiv since August 2011 has been flagged by the plagiarism detector at least once.
The map above, prepared by ScienceInsider, takes a conservative approach. It shows only the incidence of flagged authors for the 57 nations with at least 100 submitted papers, to minimize distortion from small sample sizes. (In Ethiopia, for example, there are only three submitting authors and two of them have been flagged.)
Researchers from countries that submit the lion's share of arXiv papers—the United States, Canada, and a small number of industrialized countries in Europe and Asia—tend to plagiarize less often than researchers elsewhere. For example, more than 20% (38 of 186) of authors who submitted papers from Bulgaria were flagged, more than eight times the proportion from New Zealand (five of 207). In Japan, about 6% (269 of 4759) of submitting authors were flagged, compared with over 15% (164 out of 1054) from Iran.
Such disparities may be due in part to different academic cultures, Ginsparg and Citron say in their PNAS study. They chalk up scientific plagiarism to "differences in academic infrastructure and mentoring, or incentives that emphasize quantity of publication over quality."
*Correction, 11 December, 4:57 p.m.:  The map has been corrected to reflect current national boundaries.

October 25, 2014

intihal - Plagiarism in Turkey - Copy, Shake & Paste

Debora Weber-Wulff
I was recently invited to speak at a symposium organized by the Inter-Universities Ethics Platform and held at the Eurasian Institute of the University of Istanbul on October 17, 2014. They kindly organized two interpreters who took turns interpreting the talks given in Turkish for me, and my talk into Turkish for those who had need of it. Apparently, even in academic circles English is not a common language. I will describe the talks as far as I was able to understand them here. The conference was focused on intihal, the Turkish word for plagiarism. 
The deputy rector of the Istanbul University welcomed the 60-70 people present (more would come and go during the course of the day), noting that he himself is the editor of an international journal that tests articles submitted for plagiarism. They reject half of the articles submitted for this reason.
The first speaker was Hasan Yazıcı, a retired professor of rheumatology who sued the Turkish government in the European Court of Human Rights and won. He first described his case, which was recently decided (April 2014) and is available online. Since he was speaking to a room of people who had followed the case more or less closely, he did not go into details, but they are given in the judgement:
In 1997 Yazıcı had informed the Turkish Academy of Sciences that a book by a Turkish professor (I.D.) and the founder and former president of the Higher Education Council of Turkey (YÖK) entitled Mother's Book was basically a plagiarism of the popular US book on rearing children by Dr. Spock, Baby and Childcare. In 2000 Yazıcı  published an article about the plagiarism in the Turkish Journal of Physical Medicine and Rehabilitation and a shortened version in a Turkish daily newspaper.
In the article Yazıcı praised YÖK for establishing a committee to examine the scientific ethics of candidates for associate professorships, and proposed that YÖK start the conversation about plagiarism by asking their founder to apologize for the plagiarism in his book. In response, I.D. filed charges against Yazıcı, stating that this publication violated his personality rights. In the following six years the case wound its way back and forth through the court system, with expert witnesses who were close colleagues of I.D. stating that they found no plagiarism in the book, but that the passages in question were "anonymous" information regarding child health and care and that this was a handbook without bibliography or sources, not a scientific work. Yazıcı was found guilty of defamation because his allegations were thus untrue and fined. Yazıcı challenged the selection of experts, and the Court of Cassation kept referring the case back to the lower courts. Again and again close friends were appointed experts, found no plagiarism, and thus Yazıcı was found to be guilty.
Yazıcı finally gave up on the Turkish courts, paid the fine, but took took his case to the European Court of Human Rights, stating that his right to freedom of expression—here stating that he found the book to be a plagiarism—had been interfered with and that the Turkish courts had not properly dealt with the case. He noted that due to the plagiarism, there was outdated information on baby sleeping positions in the book that had been updated by Dr. Spock in his 1998 edition, but was not changed by I.D. The European court found in its judgement that it is indeed necessary in a democratic society for persons to be able to state value judgements, which are impossible to prove either true or false. However, there must exist a sufficient factual basis, so the court (p. 13), to support the value judgement. In this case, the court found sufficient factual basis for the allegations, and ordered the fine paid by Yazıcı to be refunded and his costs for the court cases to be reimbursed.
Yazıcı made the point in his speech that the extent of plagiarism in a country correlates strongly with a lack of freedom of speech. He sees Turkey in the same league as China on this aspect. He noted that everyone knows about plagiarism, but no one speaks about it.

In order to decrease plagiarism we have to speak about plagiarism. He stated in later discussions that it is imperative that Turkish judges understand what plagiarism is, most particularly because there is a law in Turkey now declaring that plagiarism is a crime punishable by prison, but it is still not clear what exactly constitute plagiarism. 
The second talk on "Plagiarism and Philosophy of Law" was given by Sevtap Metin. She described the Turkish legal situation, in particular the law of intellectual property. She noted that there are many sanctions for plagiarism, for example academics can be cut off from their university jobs or from funding. She also described the process for application for a professorship and noted that the committees are currently not doing their job in vetting the publications provided by the applicants. The reason for this is that if they note a suspicion of plagiarism that they cannot prove, they can be sued for defamation of character by the applicant. This discourages people from looking closely at publication lists. However, with Yazıcı recently winning his case in the EU, it must now be possible to speak freely about plagiarism. Citing Kant's categorical imperative, she feels that we must not plagiarize unless we want everyone to plagiarize. And if we tell our children not to lie, but lie ourselves, they will follow our actions and not our words. 
The third talk was by Mustafa Kıcalıoğlu, a former judge now retired from the Court of Cassation, on "Plagiarism in Turkish Law." He spoke about the problems that occur in plagiarism cases in which personality rights have to be weighed against intellectual property rights. He noted that Ernst Eduard Hirsch, a German legal expert who taught at the University of Ankara, was instrumental in drafting the Turkish Copyright Act. Kıcalıoğlu went into some detail on copyright and intellectual property, I noted in the discussion that plagiarism and violation of copyright are not the same things: there is plagiarism that does not violate copyright law and violations of copyright law that are not plagiarisms. Kıcalıoğlu also discussed another long, drawn out plagiarism case of a business management professor who plagiarized on 65 out of 500 pages in a book. He was demoted from the faculty after YÖK found that he had plagiarized, and he sued YÖK, but lost. This person is now a high government official. The discussion on this talk was quite long and emotional, as many people in the audience wanted to relate a story or call for all academic institutions to take action against plagiarism.
After a lunch and tea break I photographed this fine stature of a dervish before we got into the technical part of the symposium. Altan Gürsel of TechKnowledge, the Turkey and Middle East representatives of iParadigms (the company that markets Turnitin and iThenticate), spoke about that software. He first gave the definition of intihal from the Turkish Wikipedia, showed a few cases of cheating that made the news, and then launched into the standard Turnitin talk. He did note, however, that the reports have to be interpreted by and expert and cannot determine plagiarism, so it appears that my constant repeating of this has at least been understood by the software companies themselves, if not all of the users of such systems. He reported on some new features of Turnitin, for example that now also Excel sheets can be checked, and Google Drive and Dropbox can be used for submitting work. In answering a question, he noted that YÖK now scans all dissertations handed in to Turkish universities with iThenticate, but not those from the past. They are planning on including open access dissertations in the future in their database. 
I gave my standard talk on the "Chances and Limits of Plagiarism Software", noting that software cannot determine plagiarism, it can only indicate possible plagiarism, and that there are many false positives and false negatives. During questions a number of people were perplexed that there were so many plagiarisms documented in doctoral dissertations in Germany, since dissertations need to be original research and Germany has a reputation as having a solid academic tradition. They had only heard about the politicians being forced to resign, and wanted to know what was different in Germany that a politician would actually resign on the basis of plagiarism found in his dissertation. They wanted to know if judges in Germany understand plagiarism. I noted that indeed, they understand plagiarism much better than many universities and persons suing their universities because their doctoral degree have been rescinded. The judgements of the VG Cologne and the VG Düsseldorf are very clear and very exact in their application of law to plagiarism cases, as are the judgements in many other cases. 
After a tea break Tayfun Akgül, a professor of Electrical Engineering at the Technical University of Istanbul and the Ethics and Member Conduct Committee of the IEEE spoke on "Plagiarism in Science." Akgül is also a professional cartoonist, with a lively presentation peppered with cartoons that kept the audience laughing and caused the interpreters to apologize for not being able to translate them. He outlined the IEEE organizations and policies for dealing with scientific misconduct on the part of its members. He spoke at length about the case of Turkish physicists having to retract almost 70 papers from the preprint server arXiv. Nature reported on the case in 2007, the authors complained thereafter that they were just borrowing better English. 
Özgür Kasapçopur, the speaker of the ethics committee of the Istanbul University gave the facts and figures of the committee itself and the cases that it has looked at since it was set up in 2010. They have had 29 cases submitted to the committee, but only determined plagiarism in 3 cases. 
Nuran Yıldırım spoke about YÖK and plagiarism. She is a former prefect who was on the ethical boards of both the University of Istanbul and YÖK. The Higher Education Council was established in 1981. From 1998 plagiarism was added to the cases that are investigated there, as plagiarism is considered a crime that can incur a sanction. However, there was only a 2 year statute of limitations in place. This has been since removed, and all applications for assistant professor need to be investigated by YÖK. If they find plagiarism, they have a process to follow and if plagiarism is the final decision, the person applying for a professorship is removed from the university. However, this harsh sentence has now been changed to "more reasonable punishments", whatever that is. She noted that at small universities it is hard to have only a local hearing, as often the members of the committee to investigate a case are relatives of the accused. She had some fascinating stories, especially from the military universities, including one about a General Prof. Dr. found to have plagiarized. She also noted that people do accuse their rivals of plagiarism just to try and get them out of the way. Her final story was about someone who published a dissertation, and eventually found that all of his tables and data were being used in a paper by someone else. He informed YÖK, and the second researcher defended himself by saying that he had used the same laboratory, the lab must have confused the results and given him the results from the other person instead. YÖK then requested the lab notebooks from both parties, only the author of the dissertation could produce them. Since the journal paper author couldn't find his, he was found guilty of plagiarism.   
In the final round, İlhan İlkılıç, a professor of medical ethics at the University of Istanbul, on leave from the University of Mainz and a member of the German national ethics committee, presented a to-do list that included setting out better definitions of plagiarism and academic misconduct and finding ways of objectively looking at plagiarism without personal hostilities or ideologies getting in the way. Discussion about plagiarism is essential, even if it won't prevent plagiarism or scientific misconduct from happening.   
Sadat Murat, chairman of the Turkish national ethics committee, spoke about their work which is to investigate complaints about state servants. However, exempt from this are low-level state servants, as well as the top-ranking politicians. They only report on violations, however, they cannot sanction. They also try to disseminate ethical culture in Turkey by providing ethics training.  
I especially want to thank the interpreters for their work—any errors here are mine for not paying exact attention, they did a great job permitting me to understand a small portion of what is happening in the area of intihal in Turkey.



TURKEY (45) EDITORIAL (37) NATURE (27) Copy Shake Paste (16) Debora Weber-Wulff (16) Mustafa Saltı (15) İhsan Yılmaz (14) WITHDRAWALS (11) Oktay Aydoğdu (10) Retraction Watch (10) RETRACTION (9) arXiv (9) Ars Technica (8) Murat Korunur (8) Sezgin Aygün (8) TheScientist (7) İsmail Tarhan (7) Hüsnü Baysal (6) Romania (6) CULTURE (5) John Timmer (5) Melis Aygün (5) The Guardian (5) Times Higher Education (5) Ali Havare (4) Astrophys.Sp.Sci. (4) Can Aktaş (4) Germany (4) INSIDE HIGHER ED (4) SCIENTIFIC AMERICAN (4) Serkan Anılır (4) The Chronicle (4) A. Murat Eren (3) BBC (3) David Cyranoski (3) Diederik Stapel (3) Figen Binbay (3) Hungary (3) Irfan Açıkgöz (3) JHEP (3) METU (3) Maxine Clarke (3) The Atlantic (3) The New York Times (3) Yonah Lehrer (3) Yoshiki Sasai (3) Ahmet Yıldırım (2) Aisha Labi (2) Alison Abbott (2) Annette Schavan (2) Chin.Phys.Lett. (2) ChinaDaily (2) DW-World (2) Ginsparg Paul (2) Hürriyet Daily News (2) Jonathan M. Gitlin (2) Mathgen (2) Pakistan (2) Pal Schmitt (2) SAGE (2) Science Magazine (2) ScienceInsider (2) Self - plagiarism (2) Singapore Statement (2) THE LANCET (2) The Jakarta Post (2) The Washington Post (2) Tom Bartlett (2) Turnitin (2) AHRP (1) Adam Marcus (1) Alex Bienkowski (1) Alex Holcombe (1) Alina Mungiu-Pippidi (1) Alok Jha (1) Ars Mathematica (1) Arturo Casadevall (1) Bengü Sezen (1) Blake Stacey (1) CEW Brownbag Discussion (1) Carl Zimmer (1) Chris Leonard (1) CrossRef (1) Dan Ariely (1) Daniel Lemire (1) Daniel Mietchen (1) David Colquhoun (1) David P. Barash (1) Discover Magazin (1) EMBO Reports (1) EUREKA (1) Edward J. Eckel (1) Edyta Zielinska (1) Elsevier (1) Eric A. Powell (1) Fareed Zakaria (1) Ferric C. Fang (1) Forrest M. Mims (1) FoxNews (1) Fred Barbash (1) Freek Vermeulen (1) G. Jay Christensen (1) G. Thomas Couser (1) GRG (1) Gastroenterology and Endoscopy News (1) Gehrke Johannes (1) Geraldine S. Pearson (1) Glenn McGee (1) Haruko Obokata (1) Hilary Spencer (1) Howy Jacobs (1) ICTP (1) Infection and Immunity (1) Ioan Mang (1) Irulandy Ponniah (1) Ivan Pacheco (1) J. T. Trevors (1) Janet D. Stemwedel (1) Janet Raloff (1) Jeffrey Beall (1) Joel Achenbach (1) John Bohannon (1) Jon Marcus (1) Josh Levin (1) K.R. Sreenivasan (1) Karachi University (1) LV Guardian Express (1) Leila Meyer (1) Leonard Berlin (1) Ligia Deca (1) M. H. Saier (1) Mathieu Bouville (1) Miguel Roig (1) Milton Ruiz (1) Mo Costandi (1) Money Morning (1) Murat Civaner (1) NIH (1) Nate Eldredge (1) Nick Hubble (1) Nicola Jones (1) Nigel Hawkes (1) Ottawa Citizen (1) Paul Basken (1) Paul Wouters (1) Peer Muhammad (1) Pete Etchells (1) Peter Woit (1) Phil Baty (1) Phys.Rev.D (1) PlagiPedi (1) Pramana (1) Pál Schmitt (1) R. P. Morrison (1) Richard Gunderman (1) Richard H. McCuen (1) Richard Knox (1) Richard Van Noorden (1) Robert Wright (1) Ronald Piana (1) Scholarly Open Access (1) ScienceDaily (1) Scott Jaschik (1) Setiono Sugiharto (1) Shanghai Daily (1) Simon Harris (1) Slate (1) Sorokina Daria (1) Sujit D Rathod (1) Suzi Gage (1) Ta-Nehisi Coates (1) Taiwan (1) The Chronicle Review (1) The Daily Times (1) The Express Tribune (1) The Local (1) The k2p blog (1) TheGuardian (1) TheSunDaily (1) TheSundayTimes (1) Tia Ghose (1) Tom Spears (1) Toni Feder (1) UNESCO (1) Victor Ponta (1) Warner Simeon (1) YÖK (1) Zahir Ebrahim (1) fake conference (1) fake science (1) iThenticate (1) npr (1) (1) theJOURNAL (1) Çağla Pınar Tunçel (1)

Popular Posts