Duplication Detector

Bearbeiten
 
Recommendations to Avoid Plagiarism on Wikinews

Earwig's Copyvio Detector

Bearbeiten
  • Earwig's Copyvio Detector at tools.wmflabs.org: This tool attempts to detect copyright violations in articles. Simply give the title of the page or ID of the revision you want to check and hit Submit. The tool will search for similar content elsewhere on the web using Yahoo! BOSS and then display a report if a match is found. If you give a URL, it will skip the search engine step and directly display a report comparing the article to that particular webpage, like the Duplication Detector. Specific websites can be excluded from the check (for example, if their content is in the public domain) by being added to the excluded URL list.
  • en:User:EarwigBot/Copyvios/FAQ How? Basically, it searches for chunks of text in the whole web via Yahoo! search (using a Yahoo! BOSS API key paid by the WMF).
  • by Ben Kurtovic (en:User:The Earwig)
  • en:User:Ahecht/Scripts/CV
  • en:User:MusikAnimal/MoreMenu "Page" tab -> Tools -> Copyright vio detector

EranBot/ Turnitin

Bearbeiten
 
Eranbot
  • en:User:EranBot: This is a copy and paste detection bot based on the multi year efforts here WP:Turnitin. (...) All recent edits (currently only medical articles) are scanned, and if it adds enough new text (that wasn't there in the previous and previous-previous revisions) we send it to ithenticate for copyright violation detection. Reverts and edits that were already reverted are skipped. Edits with similar text to external sources are consider as possible copyright violations, and are reported in the report page. (...) Currently it only runs on medical articles as this was the community that was most keen for its development. It is currently in a pilot phase to determine how accurate the results are and what can be done to improve this accuracy further. The results are being placed here WP:MED/Copyright and the bot runs 8 times per day.
  • en:Wikipedia:Turnitin: "Turnitin, a plagiarism detection company, is interested in checking all of English Wikipedia through its algorithms and content database for copyright violations, for free, on an informal, non-exclusive basis. In turn they would like to receive attribution on the off-Wikipedia pages where Turnitin reports are located and have the ability to publicly say that they've collaborated with Wikipedia. Turnitin is willing to adapt their software specifically for Wikipedia, to test its efficacy in a pilot program, and to provide access to their servers on an ongoing basis. The community must discuss this idea and ultimately decide whether or not to pursue it. Please see the Request for Comment to decide if we can run a trial of Turnitin's software."
Example report 20:53, 19 August 2014 (UTC)
Title Diff Editor Source
Cardiology Versionsvergleich I dream of horses
  • en:Wikipedia:Wikipedia_Signpost/2014-09-03/Op-ed:"Automated copy-and-paste detection under trial" By James Heilman: "I'd been trying to find someone to develop such a tool for more than two years; then, at Wikimania in London, I managed to corner a pywikibot programmer, ValHallASW, and convinced him to do a little work. This was followed by meeting a wonderful Israeli instructor from the Sackler School of Medicine Shani Evenstein who knew two incredibly able programmers, User:Eran and User:Ravid ziv. By the end of Wikimania our impromptu team had produced a basic bot – User:EranBot – that does what I'd envisioned. It works by taking all edits over a certain size and running them through Turnitin / iThenticate. Edits that come back positive are listed for human follow-up. Development of this idea began back in March of 2012 by User:Ocaasi and can be seen here."

Jeblad/Gadget-copyvio-check (noWP)

Bearbeiten
 
icon with automatic search link

   

Tineye @commons

Bearbeiten
 
Gadget-tab (ohne Tineye gadget)

GoogleImages @commons

Bearbeiten
 
Gadget-tab (ohne GoogleImages gadget)
 
Google reverse image search

See also

Bearbeiten
  • Elog.io makes it easy to search for information about photographs you see while browsing. Authors, titles and licensing information is available for any photograph part of the Elog.io catalog (currently 22 million images from Wikimedia Commons). (...) The matching algorithm that powers Elog.io is based on Blockhash, a free and open source software implementation of an photograph matching algorithm. (...) For any photograph that's part of Elog.io, you can choose to copy the photo as an HTML fragment that includes the attribution. Perfect for pasting into LibreOffice, Word, Wordpress, or any other application that can manage HTML. For those that don't, we include the option to copy just the attribution string as text as well. Open API: Elog.io provides an open API for developers to make use of.
  • OpenAttribute firefox addon (Nathan R. Yergler) Displays license and attribution information for Creative Commons licensed content. (cf. MozCC)