I just integrated:
changeset: 13312:537259ad27f6tag: tipuser: Garrett D'Amoredate: Wed Mar 23 08:35:14 2011 -0700description:324 need serbian locale supportReviewed by: Rich LoweApproved by: Garrett D'Amore
This is a bit unusual relative to most of the locales, because Serbo-Croatian is a language fraught with some unique political considerations:
There is a common root language, that everyone speaks and understands. But speakers of it rarely agree on what to call it. In Serbia its Serbian. In Bosnia its Bosnian. And so on for Croatian and Montenegrin.
In illumos, we have followed the Unicode CLDR example, and we now have these locales:
hr_HR.UTF-8 - Croatian in Croatia
sr_BA.UTF-8 - Serbian in Bosnia and Herzegovina
sr_ME.UTF-8 - Serbian in Montenegro
sr_RS.UTF-8 - Serbian in Serbia
I want to apologize to anyone offended by this decision, but rather than make a contentious decision on our own, I decided it was best to simply follow the decisions of an international standards body. I believe that there is no fundamental difference in the languages, although some national variances appear to be present in the data files. If someone has more accurate names for these, or believes that some aliased locales will assist with compatibility with other operating systems, then I would be happy to hear suggestions. Ideally from someone familiar with accepted practice in these locations.
There is another wrinkle in all this too. This language -- thanks largely to occupation by Soviet forces as part of the SFR Yugoslavia, is commonly represented using two different alphabets -- Cyrillic and Latin. Generally most locations use Latin, but within Serbia, Cyrillic is mandated by law. So sr_RS uses Cyrillic, while the others use Latin.
Here are the two alphabets:
А Б В Г Д Ђ Е Ж З И Ј К Л Љ М Н Њ О П Р С Т Ћ У Ф Х Ц Ч Џ Ш
A B C Č Ć D Dž Đ E F G H I J K L Lj M N Nj O P R S Š T U V Z Ž
Anyway, if someone sees room for corrections or improvements here, especially if they are familiar with the language(s) and/or region(s), I would appreciate hearing back from you.

5 comments:
Good call on the names, and good work on making the OS more accessible.
"This language -- thanks largely to occupation by Soviet forces as part of the SFR Yugoslavia, is commonly represented using two different alphabets -- Cyrillic and Latin."
This sentence is patently WRONG!!!
1) Cyrilic was the FIRST written form for the Slavonic languages, ever. It was used in Serbia BEFORE it spread to the Russian region. See http://en.wikipedia.org/wiki/Cyrilic
2) Soviet troops NEVER occupied Yugoslavia. Not even for a day. Not even during WW2.
I am sure this text was just the inevitable consequence of US education system, so please correct it.
Keep up the good work!
Cheers from Slovakia, the place where Cyrilic originated 1100+ yrs ago.
Milan Niznansky
ihosama:
Thanks for the clarification!
As much as I'd like to blame someone else for my errors, I think I have to blame my own ignorance, and my own incorrect assumptions. The errors in the statement are wholly my own fault, and I'm grateful for the correction.
Thank you very much for the Serbian Language Family Support, with regard to politicnih and historical fact, in my opinion it is less important than the fact that the Serbian and other languages from the former Yugoslavia to be supported by the operating system (Illumos), which continues the tradition of OpenSolaris.
1) Cyrilic was the FIRST written form for the Slavonic languages, ever.
To be exact: Glagolitic alphabet was the FIRST written form for the Slavic languages, ever. It's successor, Cyrillic, became more popular later.
:D
Cheers from Slovakia, the place where Cyrilic originated 1100+ yrs ago.
Cyrillic alphabet was developed in Bulgaria. On the other hand, Glagolitic alphabet was created or at least formalized and expanded with new letters for non-Greek sounds by Saint Cyril during his visit to Great Moravia in 862. So this must be the Slovakian connection.
There is a common root language, that everyone speaks and understands.
OK, I'll try to explain this.
No. There is a common "language standard" that everyone speaks and understands in Croatia, Bosnia And Herzegovina, Serbia and Montenegro. Root languages (Chakavian, Kajkavian and Shtokavian dialects) in some cases are so different and so versatile that the people within the same ethnic group have difficulties to understand each other. Croats speak all three of them, and other nations speak different versions of Shtokavian. But even that Serbo-Croatian standard (Shtokavian) was in the days of Yugoslavia always pluricentric, which means that it was one language with several standard versions, both in spoken and in written forms. So differences exist, and if there is nothing unusual about en_US and en_UK locales or fr_CA and fr_FR locales, there shouldn't be anything unusual about this matter.
Keep the good work, Garret...
Cheers from Croatia.
Post a Comment