Benutzer Diskussion:Stefan Kühn/Check Wikipedia/Archiv/2010/April
New interface for WikiCleaner
Just to show the current look of the window for working on errors reported by Check Wikipedia:
I have also added all the Wikis recognized by Check Wikipedia. --NicoV 15:33, 11. Apr. 2010 (CEST)
- I have just found that the Check Wiki interface is available only if the Experimental functions are available (in the Options). I am releasing a new version without this constraint. --NicoV 19:04, 19. Apr. 2010 (CEST)
HTTP error when db error ?
Hi, from time to time, URLs like http://toolserver.org/~sk/cgi-bin/checkwiki/checkwiki.cgi?project=frwiki&view=bots&id=46&offset=0&limit=25 show an error instead of the list of pages (for example, when the number of connections to the database allowed to you is reached). But the error is just HTML text : would it be possible to return an HTTP error instead ? It would be easier for bots to detect a problem with the request and eventually submit it again a little later. --NicoV 18:48, 1. Apr. 2010 (CEST)
Hi again, is there a possibility to speed up requests like http://toolserver.org/~sk/cgi-bin/checkwiki/checkwiki.cgi?id=48&limit=100&offset=0&project=frwiki&view=bots, for example by not sorting the results ? --NicoV 11:15, 3. Apr. 2010 (CEST)
- Hi again, is it also possible to have an URL for the "done" button that doesn't retrieve the updated list of pages? For a bot, it's not useful and it seems to take a lot of resources on the toolserver. --NicoV 14:10, 24. Apr. 2010 (CEST)
Unable to retrieve pageid if page has been deleted
Hi. I'm using the MediaWiki API to interact with Wikipédia from my tool. Most of the time, I can get the "pageid" with the API but it doesn't work when the page has been deleted. I would like to automaticall simulate a call to your "done" button when I detect that a page has been deleted, but I can't do it because I can't retrieve the pageid through the API. I asked on the mailing list for the API, and Platonides suggested me to ask if you could add a way to remove an error using the pagetitle instead of the pageid. Thanks. --NicoV 01:02, 5. Apr. 2010 (CEST)
- You can just check if the pageid is there or not. This information is already in the response you receive from the API. --Superyetkin 01:32, 6. Apr. 2010 (CEST)
- My problem is not to know if the page still exists on Wikipedia (I'm using the
missing
attribute for that), but, once I have found that a page has been deleted, I want to notify that the page has been done. So, currently, I need the pageid in this situation. --NicoV 12:04, 6. Apr. 2010 (CEST)
- My problem is not to know if the page still exists on Wikipedia (I'm using the
An other option could be to have an other list of pages for bots returning only pageids and not pagetitles : the pageid is enough to retrieve the page informations (title, existing, ...). Would it be possible to add this other list ? --NicoV 14:05, 24. Apr. 2010 (CEST)
- Having a list of pageids would also enable bots to deal properly with pages that have been renamed since the detection: the page has its title modified but keeps it pageid, so bots would have the id of the page where the problem was detected (and not the old name which is now a redirect). --NicoV 17:31, 25. Apr. 2010 (CEST)
Arrows to sort priority pages
Is it possible to add arrows so we can sort tables by error ID or number of errors (To-do) on the priority pages please? – A2 05:15, 6. Apr. 2010 (CEST)
- Is one my to-do-list. :-) -- sk 09:14, 29. Apr. 2010 (CEST)
Deleted pages
Deleted pages in yi wiki are being flagged for errors. Please do not scan deleted pages. --Redaktor0 00:54, 23. Apr. 2010 (CEST)
- When the script scan an article and it found an error, then the script put this article in the database. After some days or if someone set this article as done, the script will scan this article again. If then this article is deleted the script will delete this in the database. The script can not scan every day all articles from the database. So if you find an deleted page, then set this as done in the interface, after that the script scan this in the next run. -- sk 09:05, 29. Apr. 2010 (CEST)
Reference list missing
In yi wiki, the template {{רעפליסטע}} corresponds to {{reflist}} in en; it automatically includes <references/>. --Redaktor0 01:07, 23. Apr. 2010 (CEST)
- , I have insert this template in the script. -- Oksk 09:01, 29. Apr. 2010 (CEST)
Edit button not working with & in titles
The edit button for Flesh & Stone article gives “http://fr.wikipedia.org/w/index.php?title=Flesh & Stone&action=edit” (that open “flesh” disambiguation). It should link to http://fr.wikipedia.org/w/index.php?title=Flesh_%26_Stone&action=edit instead. A2 12:55, 5. Apr. 2010 (CEST)
- Same with a Ż (%C5%BB) and ł (%C5%82) in an another article. A2 06:17, 1. Mai 2010 (CEST)
Swedish Wikisource
We'd like this checker to work for sv.wikisource.org, please. --LA2 07:49, 15. Apr. 2010 (CEST)
- Hello LA2, I will test this next week, when I have holiday. -- sk 06:32, 16. Apr. 2010 (CEST)
- Sounds fine to us. It would also be of help if you included ns:Författare (Author) and ns:Sida (Page) into this - but start with what is the most comfortable for You. -- Lavallen 10:59, 16. Apr. 2010 (CEST)
- How is it going? --LA2 15:12, 23. Apr. 2010 (CEST)
- Until now I don't have work at this. But in the next days I start with this. -- sk 09:14, 29. Apr. 2010 (CEST)
- , I have added this project. -- Oksk 11:22, 29. Apr. 2010 (CEST)
Excellent, thanks! Would it be possible to exclude pages with raw OCR? These contain <pagequality level="1" user="" />, e.g. sv:src:Sida:Bref och skrifvelser af och till Carl von Linné (1910).djvu/375 --LA2 16:51, 1. Mai 2010 (CEST)
- We have some texts (newspaper articles from the 1830s) where a paragraph actually begins with a single equal sign (=), e.g. sv:src:Sida:Post- och Inrikes Tidningar 1836-01-04.djvu/4#Notifikationer.. This is not a broken headline, and it does not mean that the previous headline contained an empty section. Should we have to include these in <nowiki> tags, or could your checks make an exception? --LA2 19:35, 1. Mai 2010 (CEST)
ID 6 on svwikisource
- Defaultsort with Swedish letters åäöÅÄÖ should be accepted! -- Lavallen 20:43, 1. Mai 2010 (CEST)
The letters Å, Ä and Ö (together with å,ä,ö) are members of the swedish alphabet and should be considered as accepted in "Defaultsort:/Standardsortering:". -- Lavallen 11:37, 7. Mai 2010 (CEST)
False positive on frwiki error 37
We have on frwiki, error 37 (missing defaultsort), 114 articles about japanese/chinese alphabet characters (kanjis/sinogrammes) that are detected by the script since about a year but we can't add a correct defaultsort for them. I had a regexp like
- (?<!\|\ )\[\[\:[\p{IsCJKSymbolsandPunctuation}\p{IsCJKUnifiedIdeographs}](|\ \(sinogramme\)|\ \(kanji\))\]\](,\ |}}
to remove them on the wiki output (before). Can these detections (kind of false positives) be removed on the toolserver interface? A2 08:37, 1. Apr. 2010 (CEST)
- This request has been treated by NicoV's Wikicleaner with whitelists for false positives. — A2 18:13, 1. Sep. 2010 (CEST)