en.wikipedia.org

Wikipedia:Bots/Requests for approval/WikiCleanerBot 18 - Wikipedia

The following discussion is an archived debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA. The result of the discussion was  Approved.

Operator: NicoV (talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)

Time filed: 13:40, Friday, June 12, 2020 (UTC)

Function overview: Fix some nowiki tags after internal links (cf. Wikipedia:CHECKWIKI/WPC 553 dump).

Automatic, Supervised, or Manual: Automatic

Programming language(s): Java (WPCleaner)

Source code available: On GitHub (especially algorithm 553)

Links to relevant discussions (where appropriate):

Edit period(s): Twice a month

Estimated number of pages affected: About 10k pages found during the dump analysis, not all can be fixed automatically, so a few thousand edits.

Namespace(s): Main

Exclusion compliant (Yes/No): Yes

Function details: Tools like VE or CX tend to create internal links with incorrect formatting (the hyperlink is not covering all the letters), because the user doesn't always select exactly on what the link should apply. Part of such errors could be fixed automatically (see for example what my bot did on frwiki for several thousand articles). Examples of situations where the bot can automatically fix the internal link:

  • ’Ori tahiti, [[Eugène Caillot|Eugène Caillo]]<nowiki/>t replaced by [[Eugène Caillot]]: displayed text is the same as the target of the link
  • Şabran (raion), [[forêt]]<nowiki/>s replaced by [[forêt]]s: "s" is configured on frwiki as a possible extension (plural). Configuration for enwiki will also include "s", I will see with what is left after a first pass if other extensions can be added.
  • Œdipe et le Sphinx, [[Jean-Auguste-Dominique Ingres|Ingres]]<nowiki/> replaced by [[Jean-Auguste-Dominique Ingres|Ingres]] : whitespace after the nowiki makes it useless.
  • İbrahim Tatlıses, [[Divorce|Divorcé]]<nowiki/>s replaced by [[Divorce|Divorcés]]: "s" is configured on frwiki as a possible extension (plural).

After the first run on frwiki, I'm adding some other automatic fixing abilities to the bot:

  • Albert Rhys Williams, [[Marietta (Ohio)|Mariett]]<nowiki/>a replaced by [[Marietta (Ohio)|Marietta]]: displayed text is the same as the target of the link minus the text after the opening parenthesis
  • Amarok (mythologie), [[Black metal|Black Meta]]<nowiki/>l replaced by [[Black metal|Black Metal]]: displayed text is the same as the target of the link, regardless of uppercase/lowercase

Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Primefac (talk) 23:55, 15 June 2020 (UTC)[reply]

Thanks Primefac. Trial complete. I've done the 50 edits, and bot behaved as expected. --NicoV (Talk on frwiki) 18:46, 16 June 2020 (UTC)[reply]
I looked through all 50 test edits, and they all looked fine to me. In diff 1, I would have changed the link to "Wake Forest's" (I think this is the expected format on en.WP, although I can't find the guideline at the moment; I don't think you'll get any complaints), but the bot's "Wake Forest's" is acceptable. — Preceding unsigned comment added by NicoV (talkcontribs) 06:00, 17 June 2020 (UTC)[reply]
 Approved. Primefac (talk) 17:12, 19 June 2020 (UTC)[reply]
The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.