Re: [suse-amd64] wchar_t with 4 bytes?

19 Feb 2005


      Detlef Grittner wrote:
...
I am using Unicode in a C++ program and found out that wchar_t uses 4 
bytes instead of 2 bytes, which would be sufficient.
Probably wchar_t is defined as int for 64 bit platforms, I haven't 
looked it up in the header file yet.
Is there any chance to downsize wchar_t to unsigned short?
Except that Unicode is more than 16 bit data when stored raw. Three 
bytes are needed, so that UTF-32 format of 4 bytes makes a lot of sense. 
2 byte UTF-16 format would mean that you have to read every character to 
find the character count, and character position in a Unicode string, 
since some characters would still be 4 bytes.
We are having the same discussion on a number of other lists over Unicode :)

-- 
Lester Caine
-----------------------------
L.S.Caine Electronic Services