Benutzer:Stefan Kühn/Check Wikipedia
The WikiProject Check Wikipedia will help to clean the syntax of Wikipedia and to find some other errors. Here you can discuss new features.
News
[Bearbeiten | Quelltext bearbeiten]- 2013-12-01 - New page at tools.wmflabs.org (now here)
- 2013-01-27 – Some bugfixes, add alswiki, svwiktionary, enwiktionary
- 2012-11-10 – Many changes an checkwiki.cgi (faster, faster, …), new startpage
- 2012-09-21 – Start with elwiki
- 2012-03-28 – Code at GitHub
- 2010-05-01 – New page for "next dumpscan" in the interface
- 2010-04-29 – Start scan of svwikisource
- 2010-04-29 – rewrite the script for updating the interface, not so much deadlocks
- 2010-04-28 – Fix the setting of priorities in every project
- 2010-02-19 – From now the new article and the last change articles will be insert into database.
- 2010-02-07 – Fix CGI-Script
- 2010-01-29 – Write a new script for output of wikipage. I need this for a rebuild of the mainscript.
- 2010-01-29 – Now the statistic data will collect in a database. Later I build a nice output.
- 2010-01-22 – Interface now with translations. Please update your Translation-Page into XHTML. In the future I will not support the Wikipage-Output. The only way is the new interface. In the next days I will insert the statistic feature in the new interface.
- 2010-01-20 – Change the startpage. Now faster, but update only every 15 minutes.
- 2009-10-12 – Please vote!
- 2009-09-01 – New interface – 0.1 Alpha
- 2009-06-20 – I have start the reprogramming of the script. After some hours I had a new version of the script. It is really faster. The pdcwiki was scanned live with the old script 1 minutes. The new script need only 0.18 seconds. In second test with dewiki I have a reducing of 45 minutes. At Sunday I scan all languages of the project an this need only 13 hours. This is really fast. With the old script I need more then 30 hours. But there is a new problem. In some chase the title of the article will don't fit to the error. (For examlpe here). I will fix this tonight.
- 2009-06-14 – I have found a way to reduce the CPU-usage at the Toolserver (Thanks to Kalan). The script will be faster check the live Wikipedia with a better request to the API. This need a big change in the script. I will redesign the script in the next weeks. I have no idea, how much time I need. But after this relaunch the script will really faster. - At this weekend I have not the time to do this, so I reduce the CPU-usage now only with 3 points. First: The every language will start in a alphabetical row at 0:00 UTM (af, ar, cs, cy, de, en, …). So the start time is not fix. Second: All language together need more then 24 hours to scan. So the script start only Monday, Wednesday, Friday. Third: No dump scan in the next weeks. – Sorry for this interruption, but it need time.
- 2009-06-11 – I have stopped all scripts for all languages. The CPU-usage at the Toolserver was too high. I will try to solve the problem at the weekend.
- 2009-05-28 – Start the international project-page in yi
- 2009-05-09 – Big problems at Toolserver with my script. Reason: The new very fast creation of dumps. Every 4-5 days we get so a new dump for every language. In the past it was every 30 days. I stop my cronjobs and will work at this problem. Sorry of the interruption.
- 2009-04-26 – Start the international project-page in eo, id and sk
- 2009-04-22 – Start the international project-page in uk
- 2009-04-12 – Start the international project-page in hu and zh
- 2009-03-16 – Start the international project-page in ar and tr
- 2008-11-16 – Start the international project-page in pdc
- 2008-10-17 – There are big problems at the Toolserver after a power failure.
- 2008-10-04 – Start the international project-page in gd
- 2008-09-24 – Problem with scan. Limitation were set on 20 errors. I start a new scan at 05:20 UTC.
- 2008-09-17 – Start the international project-page in is, fy, ro
- 2008-09-16 – Start the international project-page in cy, no
- 2008-09-14 – Start the international project-page in af, ca, fi, he, la
- 2008-09-13 – Start the international project-page in commons, ja
- 2008-09-07 – Start the international project-page in cs, da, es, it, nds-nl, pl, pt, sv
- 2008-09-06 – Start the international project-page in nl, en, de, nds, fr, nl, ru
Project pages
[Bearbeiten | Quelltext bearbeiten]All languages
[Bearbeiten | Quelltext bearbeiten]- (and more languages to follow)
Translation of a project page
[Bearbeiten | Quelltext bearbeiten]Under ~sk/checkwiki on the Toolserver, you will find for every project a translation text (dewiki/dewiki_translation.txt). Please copy this file at your page of translation (for example in de: Wikipedia:WikiProjekt Syntaxkorrektur/Übersetzung). Then you can translate the text in that file. With the next run of the script, that page will be used for the automatic translation of the project site. If I insert a new error and the script doesn't find the translation at your page of translation, then the new error appears in English.
# Example for frwiki error_005_prio_script=1 END error_005_head_script=Comment not currect end END error_005_desc_script=Found a comment "<!--" with no "-->" end. END error_005_prio_frwiki=-1 END error_005_head_frwiki=Commentaire non fermé END error_005_desc_frwiki=Un commentaire "<!--" sans la balise "-->" a été trouvé. END
Thanks, to all translators.
The script
[Bearbeiten | Quelltext bearbeiten]Download
[Bearbeiten | Quelltext bearbeiten]- The script: checkwiki.pl (GPL)
Data
[Bearbeiten | Quelltext bearbeiten]- The data (Toolserver): ~sk/checkwiki, for example in folder dewiki:
- dewiki_translation.txt = text for translation
- dewiki_output_for_wikipedia.txt = text for the project page
- dewiki_error_list.txt = full list of articles with errors
Last changes at the script
[Bearbeiten | Quelltext bearbeiten]- 2009-05-25 – Split error 7 in error 7 and 83
- 2009-05-23 – New error 82
- 2009-05-21 – New error 81 and table sortable
- 2009-05-07 – New error 79, 80
- 2009-04-12 – Split error 69 (ISBN-Check) in error 69-73
- 2009-04-12 – New error 74, 75, 76
- 2009-04-12 – New error 77, 78
- 2009-04-09 – New error 69 (ISBN-Check).
- 2009-04-05 – New error 68.
- 2009-03-16 – Big change inside the script: Scan is now very fast.
- 2009-03-16 – 9 new errors in the last days, many little changes
- 2009-02-22 – New error (047): Template not correct begin
- 2009-02-22 – New error (046): Square brackets not correct begin
- 2009-02-22 – Fix namespaces, fix code of error 010 and 030
- 2009-02-18 – New error (045): Interwiki double
- 2009-02-18 – New error (044): Headlines with bold
- 2009-02-15 – Merge Templatetiger with Check Wikipedia
- 2009-02-07 – Many changes (better detection of image, categories; now with namespacealias, …)
- 2009-01-29 – Use "Special:RecentChanges" as resource (at the moment: only 4000 Recent Changes)
- 2008-12-28 – Fixing problem with error 037 – now "/" possible
- 2008-12-22 – Insert statistic
- 2008-12-04 – Splitting error 026 (html-elements) in 028, 038, 039, 040, 041, 042, 043
- 2008-12-04 – Fixing problem with error 034 – template programming elements – ifeq:
- 2008-11-29 – Fixing problem with error 028 – image description
- 2008-11-29 – More errors in error 030 – image description
- 2008-11-29 – Fixing problem with error 036 – redirect with ":"
- 2008-11-24 – Fixing problem with error 003 ref without references
- 2008-11-20 – Error 030: update for more images without description ( " ...|thumb]]" )
- 2008-11-20 – New error (036): Redirect not correct ("#REDIRECT = [[Target page]]")
- 2008-11-20 – New error (035): Gallery without description
- 2008-11-18 – New error (034): Template programming element
- 2008-11-16 – Fix "deactivating of error with the translation page"
- 2008-11-15 – Deactivate Error 033 "HTML text style underline"
- 2008-11-01 – Error 003 only in article namespace and include "references group"
- 2008-10-23 – Error 002 activated, only for <br\> or <br.> or <\br>; Please update your translation !
- 2008-10-13 – Error 006 (defaultsort), 030 (image description) and 028 (table end) now only in namespace 0 (article)
- 2008-10-13 – Error 003 now without "listaref" in eswiki
- 2008-10-06 – New error (032) double pipe in link ([[text|text2|text3]])
- 2008-10-04 – Fixing error error 020 (†)
- 2008-10-03 – New articles will be included in the scan
- 2008-10-02 – Exclude all pages .js and .css from scan
- 2008-10-02 – Fixing error 012 (list elements) and 026 (text style elements) – only one error per article
- 2008-10-01 – Fixing error 031 (table elements) – only one error per article
- 2008-10-01 – Defaultsort in ca with ORDENA
- 2008-10-01 – Fixing problem with error 003 ref without references
- 2008-10-01 – Fixing translation of category (category_001=)
- 2008-09-30 – Update error limit from 10000 to 40000
- 2008-09-27 – Fixing this
*[[]]
in en and commons - 2008-09-27 – Fixing he interwiki
- 2008-09-25 – Update error 26 with <font> <u>
- 2008-09-25 – Insert error 31: HTML table elements (<table> ...)
- 2008-09-25 – Fixing interwiki af, he ,is and ja
- 2008-09-24 – Better English
- 2008-09-24 – Fixing problem with priority (unkown, deactivated, top, middle, lowest)! Reason for long run time.
- 2008-09-23 – Translation finish!
- 2008-09-20 – Deactivation of "br"-error and update the number of error from 5000 to 10000, in dewiki without limit (30893 errors)
- 2008-09-19 – Fixing the DEFAULTSORT with special letters for pl and ro
- 2008-09-17 – Fixing the DEFAULTSORT with special letters for nn, no, da and cs
- 2008-09-16 – New error: Image without a description.
- 2008-09-14 – New error: HTML elements <b>, <i> and <p> will be detected
- 2008-09-14 – Fixing the problem with DEFAULTSORT without the {{ }}
- 2008-09-14 – Fixing the problem with very long DEFAULTSORT
- 2008-09-13 – New error: check hierarchy of headlines (nr. 25)
- 2008-09-13 – Fixing the HTML-problem with <ol start=19>
- 2008-09-13 – Fixing the DEFAULTSORT with special letters for sv and fr
- 2008-09-13 – Fixing the problem with double :, like sv:Kategori:USA:s presidenter in sv
- 2008-09-13 – Fixing of the pre problem.
- 2008-09-12 – Fixing of the nowiki problem. Now it will correct work and will be ignored by that check.
- 2008-09-12 – Fixing of the source problem. cs:ALTER will now not have a error.
- 2008-09-12 – Create page for Check Wikipedia with last changes, new and discussion.
- 2008-09-10 – Fixing of the source problem.
Tools
[Bearbeiten | Quelltext bearbeiten]Some tools can be used to help fixing errors detected by Check Wikipedia script. Also see the list of gadgets and user scripts (German).
Tool | Wikis | Detected errors | Bot capabilities | Description |
---|---|---|---|---|
WikiSyntaxTextMod | All | Description | No | Polishes and corrects wiki syntax automatically while editing. |
Auto-Formatter | All | List | No | Adds an Auto-Format button to the wiki editor toolbar. |
WPCleaner | All | List | Yes | WPCleaner is a tool designed to help with various maintenance tasks. It's written in Java. |
AutoWikiBrowser | All | List | Yes | AutoWikiBrowser is a semi-automated editor designed to make tedious repetitive tasks quicker and easier. It's written in C#. |
AutoEd | Description | No |
FAQ
[Bearbeiten | Quelltext bearbeiten]HTML into XHTML or Wikisyntax
[Bearbeiten | Quelltext bearbeiten]See Wikipedia:WikiProjekt HTML5.
Not correct | Correct |
---|---|
<b>testo</b> | '''testo''' |
<i>testo</i> | ''testo'' |
<u>testo</u> | <span style="text-decoration:underline;">testo</span> |
<strike>testo</strike> | <s>testo</s> |
<tt>testo</tt> | <code>testo</code> |
<big>testo</big> | <span style="font-size:larger;">testo</span> |
<b><big>testo</big></b> | <span style="font-size:larger;">'''testo'''</span> |
<center>testo</center> | <div class="center">testo</div> |
<p align="center">testo</p> | <div class="center">testo</div> |
<font color="#224466">testo</font> | <span style="color:#224466;">testo</span> |
<font style="text-decoration:overline">testo</font> | <span style="text-decoration:overline;">testo</span> |
Next features
[Bearbeiten | Quelltext bearbeiten]To-do-list
[Bearbeiten | Quelltext bearbeiten]- change errors
- error 3 – allowed <references></references>
- error 37 exclude chinese and japanese characters (here) like en:囝
- error 37 template:lifetime (here) also it:Template:Bio
- errors 10, 46, 43 and 47 with section where this error is
- for error 60 output with linenumber
- error 36 with tabulator and
line break - error 54 – after line break math, code …
- error 63 – not in user signatuers
- error 69 – no detect "ISBN-10:", "ISBN-13:", "(ISBN-10)", "(ISBN-13)" most before or after a ISBN
- eowiki error 6 and 37: "DEFAŬLTORDIGO" ĉ, ĝ, ĥ, ĵ, ŝ, ŭ and also Ĉ, Ĝ, Ĥ, Ĵ, Ŝ, Ŭ
- error 84 – only pre-Block de:Dekker-Algorithmus
- error 30 – only 60px or bigger images
- new error
- endless tag like <poem> or <ref>
- Plainlinks in article namespace <span class="plainlinks">(here)
- thumbs with forced size: [[File:Foo.jpg|thumb|250px|Foo]]
- Pipe in external link [http:/www.wikipedia.org|Wikipedia]
- Definition ; '''name''' : definition no bold!
- detection for page titles that contains characters outside Basic Multilingual Plane of Unicode; "𩺊" (U+29E8A), which was once redirected to ja:アラ, is prohibited for use in page titles in Japanese Wikipedia because it is outside BMP (from U+0000 to U+FFFF).
- detect excessive boldface text, more then 10 pairs of '''
- DEFAULTSORTs which have a sortkey identical to the page's name (these can be removed).
- Pages where every category has an identical sortkey but no DEFAULTSORT (the individual sortkeys can be removed and a DEFAULTSORT should be added).
- ºC and °C, Wikipedia:Bots/Anfragen/Archiv/2009-1#Nummernzeichen statt Gradzeichen
- double defaultsort in one article
- references with name and not double; Example here
- category with space in front or behinde
"category: test "
- new interface
- 10 or 25 errors set as done from one page.
- other
problem with "+" for example [[GTK+|GIMP-Toolkit]]- translation phrase "This error was found *** times".
- translation phrase "This output was limited to *** article."
- translation phrase for statistic
Your wish-list
[Bearbeiten | Quelltext bearbeiten]Please write your wish in English or German at the site: Benutzer Diskussion:Stefan Kühn/Check Wikipedia. Thanks! -- sk 22:11, 12. Sep. 2008 (CEST)