Mailinglist Archive: opensuse-factory (710 mails)

< Previous Next >
Re: [opensuse-factory] collation bug in locales using UTF-8 (cur= a<A<b<B<z<Z; should be A<B<C<Z<a<b<c<z)
  • From: Anders Johansson <ajh@xxxxxxxx>
  • Date: Sun, 27 May 2012 23:30:53 +0200
  • Message-id: <2148324.ksAiJshia9@carolin>
On Sunday 27 May 2012 13:07:13 Linda Walsh wrote:
It seems that Open SuSE suffers from this bug as well:

Somewhere along the line, due to people paying attention to POSIX, they
though they could change collation orders for any locality outside of the
'C' locality to anything they wanted.

What they didn't realize is that Unicode also specifies a correlation order
that is roughly (maybe exactly in the C range), equivalent to the C range.

No it doesn't. In fact, the link in the bug explicitly says that there must be
a way in the implementation of parametrizing the sort order, to account for
national standards.

While localized character sets iso-8859-xx... and others might have
different collating orders, for those who have UTF-8 as the default
encoding, or more so, set encodings to lang_CO.UTF-8, the character set
collation order should take precedence -- in __AT LEAST__, character ranges
as used in Regex's (as those are 'pure characters and no words are involved
-- then only the ordering of the character set should be used).

What are you saying here? At first you sound as though you are arguing for a
single, standard sorting order across all countries, languages, alphabets in
teh world, and here you seem to say it should be character set specific.

In any case, there is no way you are ever going to get a standard sorting
order for the whole world.

To unsubscribe, e-mail: opensuse-factory+unsubscribe@xxxxxxxxxxxx
To contact the owner, e-mail: opensuse-factory+owner@xxxxxxxxxxxx

< Previous Next >
Follow Ups