Jump to content

Talk:Wp/sgh

Add topic
From Wikimedia Incubator
Latest comment: 1 month ago by DaveZ123 in topic Latin to Cyrillic conversion table

Unifying the Latin Alphabet for Shughni

[edit source]

Currently, two slightly different Latin-based alphabets are being used for Shughni. While both are similar, one version is more widely used among speakers, learners, and online resources. However, some contributors continue to use the less common version. To improve readability and standardization, I propose adopting the more popular alphabet for Wikipedia articles. This would ensure: Greater consistency across Shughni-language content. Easier learning for new readers and speakers. Compatibility with educational and linguistic resources already using this system. I understand that updating all existing pages to the widely accepted alphabet will take time, but I believe this transition is necessary for long-term consistency.

Here is the more commonly used alphabet.

1)Aa
2)Āā
3)Bb
4)Cc
5)Čč
6)Dd
7)Δδ
8)Ee
9)Êê
10)Ff
11)Gg
12)Ɣɣ
13)Ɣ̌ɣ̌
14)Ii
15)Īī
16)Jj
17)Kk
18)Ll
19)Mm
20)Nn
21)Oo
22)Pp
23)Qq
24)Rr
25)Ss
26)Šš
27)Tt
28)Θθ
29)Uu
30)Ūū
31)Ůů
32)Vv
33)Ww
34)Xx
35)X̌x̌
36)Yy
37)Zz
38)Žž
39)Ʒʒ

I encourage discussion on how we can gradually standardize Shughni Wikipedia content to reflect the widely accepted system. Pomereyum (talk) 23:26, 14 March 2025 (UTC)Reply

@Pomereyum, I have created a Shughni keyboard here.
Shughni latin to cyrillic conversion can be found here. --DaveZ123 (talk) 13:25, 3 June 2025 (UTC)Reply
@DaveZ123, Wow, thank you so much! It will help us a lot! Pomereyum (talk) 02:47, 20 June 2025 (UTC)Reply
@DaveZ123, But I also want to add that for Ɣ̆ɣ̆, X̆x̆, and Ȇȇ symbols you used the breve and the inverted breve diacritics, but the actual symbols should contain the caron and circumflex symbols Pomereyum (talk) 03:37, 20 June 2025 (UTC)Reply
@Pomereyum, Thank you for the feedback. It now displays as Ɣ̌ɣ̌, X̌x̌, and Êê on the transliteration tool and the Shughni latin keyboard.--DaveZ123 (talk) 06:05, 21 June 2025 (UTC)Reply
Thank you so much! Pomereyum (talk) 22:58, 15 August 2025 (UTC)Reply

Some recommendations

[edit source]

Hello everyone, and welcome to the Shughni Wikipedia Incubator! I’m excited to see this project grow, and I hope we can all work together to make it a reliable and valuable resource for Shughni speakers and learners.

Writing System

To ensure consistency and readability, I encourage contributors to use the Latin-based Shughni alphabet. This writing system has been widely adopted in linguistic research and publications and is better suited for digital platforms. I recommend transitioning existing articles written in Cyrillic to the Latin-based script for a unified experience.

Article Consolidation

Currently, there are several duplicate articles on the same topics under different titles. To improve quality and make editing more efficient, I think we should merge overlapping articles and work together on a single, well-developed page instead of maintaining multiple versions.

Please don’t be shy to leave comments or share your thoughts on the talk pages, every opinion helps us improve! Let’s work together to build a high-quality, unified resource for all Shughni speakers! Chixiijak (talk) 00:59, 8 September 2025 (UTC)Reply

@Chixiijak, There is no problem using the Latin alphabet for Shughni. In the future we can implement an automatic conversion so every article can be read in both latin and cyrillic. For example:
Crimean Tatar: Page in latin, same page in Cyrillic
Uzbek: Page in latin, same page in Cyrillic
Talysh: Page in latin, same page in cyrillic
Tajik: Page in latin, Same page in cyrillic
No automatic conversion system has yet been implemented on Wikimedia Incubabtor for Shughni. After it has been implemented, we can merge the cyrillic articles into Latin. The two writing systems can co-exist side by side for now. --DaveZ123 (talk) 06:49, 9 September 2025 (UTC)Reply
The main focus now should be the creation of new Shughni articles. Please create new articles that have more content and are longer than 1 sentence. --DaveZ123 (talk) 06:49, 9 September 2025 (UTC)Reply
Hi @DaveZ123, thanks for clarifying! I just wanted to ask, how exactly would the automatic conversion work? The examples you provided, like Crimean Tatar and Uzbek, have separate links for Latin and Cyrillic, but I’m not sure how the system handles switching between the two scripts on the same page. Does it automatically display the preferred script based on user settings, or are there two versions of the page maintained separately? Chixiijak (talk) 20:04, 9 September 2025 (UTC)Reply
@Chixiijak, in some wikipedia versions, there are two pages maintained separately.
However in all four examples listed above (Crimean Tatar, Uzbek, Talysh, and Tajik), there is only one page created which can then be viewed in the other writing system by clicking on the dropdown option. For example on the Uzbek page, you click on "lotin/кирилл", then select either cyrillic or latin to view the page in one of the two writing systems (see the screenshot attached below).
\
--DaveZ123 (talk) 05:43, 10 September 2025 (UTC)Reply
@DaveZ123 I see, thanks. Chixiijak (talk) 14:51, 10 September 2025 (UTC)Reply

Latin to Cyrillic conversion table

[edit source]
  • This section is to document the Latin to Cyrillic conversion table for Shughni language to enable the specialists at Tech Support of the Wikimedia Foundation to implement it.
  • Background: When Talysh Wikipedia was at Wikimedia Incubator, a language convertor was implemented. This enabled them to only create an article ONCE instead of duplicating it.
  • All articles on Interslavic Test Wikipedia (including the Main Page) are able to be viewed in both the Latin and Cyrillic alphabet without any article duplication necessary by clicking on the page tab (isv, then selecting either Latinica or Кирилица).
  • Implementing this language convertor on Shughni Wikipedia at Wikimedia Incubator will enable Shughni articles to be created in the latin alphabet instead of needing to duplicate it into cyrillic every time.
  • The converter will probably be the simplest in MediaWiki to date, because the correspondence between the letters is perfectly one to one, without any special rules.
  • Online Shughni transliteration tool can be found here.
  • This table has now been mentioned on Wikipedias_in_multiple_writing_systems#Shughni.
A formal request has been lodged as Phabricator task number 404496.
Upper Case letters Latin
AĀBCČDΔEÊFGƔƔ̌IĪKLMNOPQRSŠTΘUŪŮVWXYZŽƷ
Upper Case letters cyrillic
АА̄БЦЧДД̌ЕӖФГҒГ̌ИӢҶКЛМНОПҚРСШТТ̌УӮУ̊ВВ̌XХ̌ЙЗЖҘ
lower case letters latin
aābcčdδeêfgɣɣ̌iīǰklmnopqrsštθ
ϑ
uūůvwxyzžʒ
lower case letters cyrillic
аа̄бцчдд̌еӗфгғг̌иӣҷклмнопқрсштт̌
т̌
уӯу̊вв̌xх̌йзжҙ

Please note that in the Shughni latin alphabet, "θ" and "ϑ" are pronounced the same. "θ" is the preferred letter. In cyrillic they are both written as "т̌".

Therefore т̌ will be transliterated to θ by default.

--DaveZ123 (talk) 13:25, 13 September 2025 (UTC)Reply

I think for Ʒ use cyrillic З with caron -- Rumsor (talk) 14:26, 4 October 2025 (UTC)Reply
see article on Rumsor (talk) 14:27, 4 October 2025 (UTC)Reply
@User:Rumsor, that is not true in the shughni context. "Ʒ" is converted to "ҙ", here is sample text:
naɣ̌ʒimč-at idê Sūriyā okim Kwirīnus ca vud. Atā fukaθ-en toyd tar saršumori, ar čāy tar xu x̌ār. наг̌ҙимч-ат идӗ Сӯрийа̄ оким Кв̌ирӣнус ца вуд. Ата̄ фукат̌-ен тойд тар саршумори, ар ча̄й тар xу х̌а̄р.

Refer to this page: Dhe_(Cyrillic) --DaveZ123 (talk) 15:30, 4 October 2025 (UTC)Reply