https://bugzilla.novell.com/show_bug.cgi?id=743617 https://bugzilla.novell.com/show_bug.cgi?id=743617#c0 Summary: GB18030 converter is broken Classification: openSUSE Product: openSUSE 12.1 Version: Final Platform: PC OS/Version: Other Status: NEW Severity: Critical Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: bruno@clisp.org QAContact: qa@suse.de Found By: --- Blocker: --- User-Agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/534.34 (KHTML, like Gecko) konqueror/4.7.4 Safari/534.34 The GB18030 converter (from the glibc package) is broken: It cannot encode valid Unicode characters such as U+C50B HANGUL SYLLABLE SSEUH to GB18030. This is critical, because 1) GB18030 support is essential for shipping in China. 2) Users may convert their documents using this converter and realize only afterwards that they have lost data through incorrect conversion. Reproducible: Always Steps to Reproduce: In a shell: printf '\x00\x00\xc5\x0B' | LC_ALL=C /usr/bin/iconv -f UCS-4BE -t GB18030 | od -t x1 | head -n 1 Actual Results: $ printf '\x00\x00\xc5\x0B' | LC_ALL=C /usr/bin/iconv -f UCS-4BE -t GB18030 | od -t x1 | head -n 1 /usr/bin/iconv: illegal input sequence at position 0 0000000 Expected Results: $ printf '\x00\x00\xc5\x0B' | LC_ALL=C /usr/bin/iconv -f UCS-4BE -t GB18030 | od -t x1 | head -n 1 0000000 83 32 da 36 The situation is apparently caused by the glibc-2.3.4-gb18030-big5hkscs.diff.bz2. glibc's current GB18030 converter (upstream) cannot be used either, see http://sourceware.org/bugzilla/show_bug.cgi?id=11837 -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.