User:POSSUM chowg/AutoSpellingConverter

From Wikimedia Incubator

What is this?[edit | edit source]

This is a crappy prototype for a tool that could be used on the Chinook Jargon Wikipedia. In the future we might be able to use algorithms to convert between different CJ spelling styles from a GR-based standard to Duployan, Dave Robertson's northern learners' system (a variant of which is used on the Kaltash Wawa), and the classic English-based Gibbs-style CJ spelling. Perhaps algorithms to transliterate "the Wawa" into other orthographies like ISO 369's unique style or Nvolut's keyboard-friendly spelling system could be implemented as well.

So "ɬuchmən" could be rewritten as "tloochman" (northern learners' system), "klootchman" (OG English-based), "lhuchmvn" (Nvolut-style), łuučmən (ISO style), or 𛰃𛰆𛱛𛰣‌𛰙𛱁𛰚 (‎‎Duployan chinuk-pipa stenography.)

Serbian and Old English Wikipedia have similar systems.

This is designed to convert words into an English-based system based on old CJ documents. Consider this merely a test or proof-of-concept.

Credit goes to Zompist for the "SCA2" re-spelling tool. I might port this tool to lingojam.com later.

How powerful is this tool?[edit | edit source]

Abilities[edit | edit source]

  • This tool can handle most punctuation and capital letters.
  • This tool is somewhat flexible, and can handle a few orthographic variations (albeit somewhat rudimentarily.)
  • I only had to hardwire one word into the tool, "Chinook", which would have appeared as "Chinnook" with two N's.

Limitations[edit | edit source]

  • This tool is unable to recognize writing in languages like English and French as distinct from Chinook, so any text written in a language other than Chinook gets garbled. Abbreviations and acronyms are not supported either.
  • You can't use this on the source of a wikipedia page; you'd screw up the wikitext.
  • Any all-caps words longer than 25-or-so letters will not be fully capitalized.
  • Not all words will be spelled quite as you would expect (illahee is "illihie", mamook is "mammook", hiyu is "hyo", etc.)
  • The only capitalization supported is for the first letter of the sentence or a whole word, so you can't have something like "POssum chOwg" (which becomes "POSSOME chowg.")
  • A few rare unicode characters break this tool. I hope that they will never have to be used in an article here.

How do you use it?[edit | edit source]

Simply copy and paste all the text in the big gray box below (make sure into a blank google doc or microsoft word document, then save the document as a .txt file. Then go to https://zompist.com/sca2.html, click "Choose File", and upload your .txt file. Now you type whatever you want to transliterate into the box labeled "Input Lexicon". (I tested it by copying and pasting articles from Chinook Jargon Wikipedia.)

ꑕ=bcdfghklmnpqrstvxχчɬшwy
䬃=ABCDEFGHIJKLMNOPQRSTUVWXYZΧЧꞭШŁČŠʔƏᜬ̣
䩔=abcdefghijklmnopqrstuvwxyzχчɬшłčšʔǝᜬ̣
ᚼ=bcdfghklmnpqrstvxχчɬшwy
䕶=sч
ᬔ=aeiou


[V]/Ə/_
7/ʔ/䩔_
7/ʔ/_䩔
[və]/ǝ/_
[Hh]/ʰ/[ptkqPTKQ]_
[ʰʼ'̓]//_
ʷ/w/_
ɢ/g/_
䬃/ᚂ䬃/_[䬃䩔]/[䬃䩔]_
䩔/ᚂ䩔/_[䬃䩔]/[䬃䩔]_
䬃/䬃ᚂ/[䬃䩔]_/_[䬃䩔]
䩔/䩔ᚂ/[䬃䩔]_/_[䬃䩔]
䬃/··䩔/[ᚂ·]_䬃
䬃/·䩔/[ᚂ·]_
䬃/䩔/_
x̣/χ/_
ł/ɬ/_
ch/ч/_
č/ч/_
š/ш/_
sh/ш/_
tɬ/tl/_
ƛ/tl/_
ɬ/tl/_
t/k/[ᚂ·]_l
[χx]/h/[ᚂ·]_
[χx]/h/ꑕ_
i/y/ᬔ_
u/w/ᬔ_
ay/iː/_
o/oː/_
i/e/[qχ]_
i/e/_[qχ]
u/oː/[qχ]_
u/oː/_[qχ]
a/o/[q]_/_ᚂ
a/o/_[q]
q/k/_
ʔ//[ᚂ·]_
ʔ//ꑕ_
ʔ//_ꑕ
ʔ/h/_
χ/x/_
aw/ow/_ꑕ
aw/ow/_ᚂ
u/oː/_ꑕᬔ
u/oː/_ꑕᚂ
oː/u/_k
ː//_ː
u/uː/_
ǝ/a/_ᚂ
ǝ/u/_/l_ꑕ
ǝ/a/_
w//ꑕ_ꑕ
w//ꑕ_ᚂ
i/eː/_ᚂ
ꑕ/ꑕ²/ᬔ_ᬔ/ː_/
ꑕ//[ᚂ·]ꑕ(ꑕ)[aeiou](ː)ꑕ(ꑕ)(ꑕ)[aeiou]_²
ꑕ//[ᚂ·]l[aeiou](ː)_²
x/gh/_
u/a/[ᚂ·](ꑕ)(ꑕ)ᬔ(ᬔ)ꑕ(ꑕ)(ꑕ)_ᚼ(ꑕ)(ꑕ)ᬔ
[bdfgklmnprst]//[ᚂ·](ꑕ)(ꑕ)(ꑕ)[aeiou]ꑕ(ꑕ)(ꑕ)[aeiou]_²
k/c/_k
k/ck/ᬔ_/[ᚂ·]lᬔ_
ч/t/_ч
ꑕ/ꑕe/[aio]ː_ᚂ
uː/oo/_
oː/o/_
eː/ee/_
iː/y/_[aeiou]
iː/i/_
ч/ch/_
ш/sh/_
hh/h/_
ghgh/gh/_
shsh/sh/_
ww/w/_
yy/y/_
xw/wh/_
hw/wh/_
ts/ch/[ᚂ·]_
s/ss/_ᚂ/ᚼ_
ch/tch/_[ᚂᚼ]/ᚼ_
ee/ie/_[ᚂ]
i/e/[ᚂ]ᚼ(ᚼ)(ᚼ)_e
[ou]y/oi/_
o//_oi
oo/o/_ᚂ
i/e/_w
ck/k/_/_ᚂ
u/a/[ᚂ·](ꑕ)(ꑕ)ᬔ(ᬔ)ꑕ(ꑕ)(ꑕ)_ᚼ(ꑕ)(ꑕ)ᬔ
hinnoo/hinoo/_
䩔/䬃/·_
䩔/䬃/··䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䬃䩔䬃䩔_
䩔/䬃/䩔䬃_
䩔/䬃/䬃䬃䬃_
䩔/䬃/··䬃_
[ᚂ·]//_
ḥ/h/_

Example[edit | edit source]