Jump to content

Incubator:List of Wikimedia language codes

From Wikimedia Incubator

This page is a list of languages codes as they are used on Wikimedia sites, especially for which there are existing issues or problems.

Language codes on Wikimedia should follow the ISO 639 standard as closely as possible, but historically exceptions have been made, and sometimes an exception is still made:

  • A language written in more than one script:
    • should have one wiki with a conversion script
    • if that is not possible, name wikis with "xx[x]-xxxx" (ISO 639 language code + ISO 15924 script code), no current examples (pa - pnb?)
  • A language written in more than one orthography or standard should either:
    • find a way to co-exist on one wiki
    • name wikis with "xx[x]-x[...]" (ISO 639 language code + IETF tag), e.g. "be-tarask" or "qu-kichwa"
    • old form: "nds"/"nds-nl".
  • For the rest, only one exception should be possible: "simple".

LIST TO BE COMPLETED

code links code validity
(as given here)
intended language script issue(s)
SIL, Ethn. specialAlemannic German || Latn || should be moved to "gsw" (bug 23215)
SIL, Ethn. historyAramaic || || to "syc" (bug 26725)
SIL, Ethn. specialSamogitian || Latn || should be moved to "sgs" (bug 25522)
SIL, Ethn. validBelarusian || Cyrl
SIL, Ethn. invalidBelarusian (Taraškievica || Cyrl || should be "be-tarask" (bug 9823)
SIL, Ethn. retiredBhojpuri || Deva || should be moved to "bho"; "bh" is collective (info)
SIL, Ethn. existingGerman || Latn || —
SIL, Ethn. existingStandard English
(see also Wikipedia:Manual of Style) || Latn || –
SIL, Ethn. retiredEmiliano-Romagnolo || Latn || should be split into "egl" (Emilian) and "rgn" (Romagnol)
SIL, Ethn. existingFrench || Latn || —
SIL, Ethn. existingItalian || Latn || —
SIL, Ethn. existingDutch language
(see also Dutch Language Union) || Latn || –
SIL, Ethn. specialNorman language || Latn || refers to the wrong language (bug 23216), should be renamed to "roa-x-nrm", "xno" (Anglo-Norman) or something (or closed)
SIL, Ethn. validWestern Punjabi || Arab || Or Punjabi in Arabic-based script in Majhi dialect? LangCom will investigate
SIL, Ethn. macroQuechua languages, especially Southern Quechua || Latn || — (Which varieties included?)
qug (wp) SIL, Ethn. validKichwa language (Unified Orthography) || Latn || will probably be named "qu-kichwa" after receiving IETF tag (see here)
SIL, Ethn. specialClassical/Literary Chinese || Hant || should be moved to "lzh" (bug 28443)
SIL, Ethn. specialYue Chinese / Cantonese || Hant || should be moved to "yue" (bug 28441)
Other pseudo-codes or other issues
ten SIL, Ethn. historyUsed for tenth anniversary of Wikipedia; code reserved for extinct Tama language of Colombia
www SIL, Ethn. specialCode for Wawa language; conflicts with portal

See also

[edit source]