User:EranBot: Difference between revisions
mNo edit summary |
No edit summary |
||
Line 16: | Line 16: | ||
==How it works== |
==How it works== |
||
#All recent edits to En Wikipedia over a certain size are scanned (that wasn't there in the previous and previous-previous revisions). The text is sent to |
#All recent edits to En Wikipedia over a certain size are scanned (that wasn't there in the previous and previous-previous revisions). The text is sent to plagiarism detection service [[iThenticate]]. |
||
#Edits with similar text to external sources are consider as possible copyright violations, and are reported in the [[User:EranBot/Copyright|medical report page]] and [[User:EranBot/Copyright/rc|Wikipedia wide report page]]. |
#Edits with similar text to external sources are consider as possible copyright violations, and are reported in the [[User:EranBot/Copyright|medical report page]] and [[User:EranBot/Copyright/rc|Wikipedia wide report page]]. |
||
##If the external source is mirror of Wikipedia, it is either removed by |
##If the external source is mirror of Wikipedia, it is either removed by iThenticate itself or afterwards by the bot (based on [[User:EranBot/Copyright/Blacklist|blacklist]]) |
||
##If the source is broken link the bot removes it |
##If the source is broken link the bot removes it |
||
#Each entry in the report page have the following fields: Title of the edited page, Diff with link to the relevant edit diff and page history, Editor, Source - link to report page in |
#Each entry in the report page have the following fields: Title of the edited page, Diff with link to the relevant edit diff and page history, Editor, Source - link to report page in iThenticate (titled "report") and links to possible sources of the edit (titled "compare"), Status - Should be filled manually with TP/FP. The bot adds hints for possible good edits: |
||
#*<span class="success">citation</span> - the added text mention the source. For short text it is OK (in copyright sense) and for long text it is a violation (see also [[Wikipedia:Close paraphrasing]]). |
#*<span class="success">citation</span> - the added text mention the source. For short text it is OK (in copyright sense) and for long text it is a violation (see also [[Wikipedia:Close paraphrasing]]). |
||
#*<span class="success">Mirror?</span> - the added text comes from a source that may be a possible mirror site of Wikipedia. E.g the source seems to be unknown mirror (that doesn't appear in our blacklist, but have attribution to Wikipedia). Editors can add such sites to [[User:EranBot/Copyright/Blacklist|blacklist]], so they don't appear in future. |
#*<span class="success">Mirror?</span> - the added text comes from a source that may be a possible mirror site of Wikipedia. E.g the source seems to be unknown mirror (that doesn't appear in our blacklist, but have attribution to Wikipedia). Editors can add such sites to [[User:EranBot/Copyright/Blacklist|blacklist]], so they don't appear in future. |
||
Line 34: | Line 34: | ||
==Source== |
==Source== |
||
The bot is based on pywikibot and you can find its source code in [https://github.com/valhallasw/plagiabot github]. It is possible to run the bot in other Wikipedia languages, but to run the bot you have to request account for |
The bot is based on pywikibot and you can find its source code in [https://github.com/valhallasw/plagiabot github]. It is possible to run the bot in other Wikipedia languages, but to run the bot you have to request account for iThenticate. |
||
|} |
|} |
||
----- |
----- |
Revision as of 04:19, 25 April 2016
This bot runs on Wikimedia Toolforge. Administrators: If this bot needs to be blocked due to a malfunction, please remember to disable autoblocks so that other Toolforge bots are not affected. |
This user account is a bot operated by Eran (talk). It is used to make repetitive automated or semi-automated edits that would be extremely tedious to do manually, in accordance with the bot policy. The bot is approved and currently active – the relevant request for approval can be seen here. Administrators: if this bot is malfunctioning or causing harm, please block it. |
EranBot | |
---|---|
This is a bot account owned by User:ערן (aka Eran). | |
Copy & Paste detection | |
This is a copy and paste detection bot based on the multi year efforts here WP:Turnitin. How it works
Current stateCurrently it only on all En Wikipedia. It has been a great help for medical articles. Efforts to make it more functional are ongoing. The results are being placed here WP:MED/Copyright and the bot runs 8 times per day. There is NO plan for this bot to make edits to mainspace. There may be potential for it to expand to other languages. The concept has been discussed with the WMF legal team who are happy with it. Who runs the botIt is run by the Hebrew Wikipedia User:Eran. Doc James and User:Ocaasi have been guiding its development. SourceThe bot is based on pywikibot and you can find its source code in github. It is possible to run the bot in other Wikipedia languages, but to run the bot you have to request account for iThenticate. |