Re: [opensuse-factory] collation bug in locales using UTF-8 (cur= a<A<b<B<z<Z; should be A<B<C<Z<a<b<c<z)

27 May 2012

      On Sunday 27 May 2012 13:07:13 Linda Walsh wrote:
...
It seems that Open SuSE suffers from this bug as well:
https://bugs.launchpad.net/ubuntu/+source/bash/+bug/120687
Somewhere along the line, due to people paying attention to POSIX, they
though they could change collation orders for any locality outside of the
'C' locality to anything they wanted.
What they didn't realize is that Unicode also specifies a correlation order
that is roughly (maybe exactly in the C range), equivalent to the C range.
No it doesn't. In fact, the link in the bug explicitly says that there must be 
a way in the implementation of parametrizing the sort order, to account for 
national standards.
...
While localized character sets iso-8859-xx... and others might have
different collating orders,  for those who have UTF-8 as the default
encoding, or more so, set encodings to lang_CO.UTF-8, the character set
collation order should take precedence -- in __AT LEAST__, character ranges
as used in Regex's (as those are 'pure characters and no words are involved
-- then only the ordering of the character set should be used).
What are you saying here? At first you sound as though you are arguing for a 
single, standard sorting order across all countries, languages, alphabets in 
teh world, and here you seem to say it should be character set specific.

In any case, there is no way you are ever going to get a standard sorting 
order for the whole world.

Anders
-- 
To unsubscribe, e-mail: opensuse-factory+unsubscribe@opensuse.org
To contact the owner, e-mail: opensuse-factory+owner@opensuse.org

Re: [opensuse-factory] collation bug in locales using UTF-8 (cur= a<A<b<B<z<Z; should be A<B<C<Z<a<b<c<z)

Anders Johansson