Mailinglist Archive: opensuse-bugs (4856 mails)
| < Previous | Next > |
[Bug 650932] New: perl XML::Parser (and in effect XML::Parser::Expat) does not return UTF-8 strings as expected.
- From: bugzilla_noreply@xxxxxxxxxx
- Date: Wed, 3 Nov 2010 11:54:00 +0000
- Message-id: <bug-650932-21960@xxxxxxxxxxxxxxxxxxxxxxxx/>
https://bugzilla.novell.com/show_bug.cgi?id=650932
https://bugzilla.novell.com/show_bug.cgi?id=650932#c0
Summary: perl XML::Parser (and in effect XML::Parser::Expat)
does not return UTF-8 strings as expected.
Classification: openSUSE
Product: openSUSE 11.3
Version: Final
Platform: x86-64
OS/Version: openSUSE 11.3
Status: NEW
Severity: Critical
Priority: P5 - None
Component: Development
AssignedTo: pth@xxxxxxxxxx
ReportedBy: DOlsson@xxxxxx
QAContact: qa@xxxxxxx
Found By: ---
Blocker: ---
Created an attachment (id=398363)
--> (http://bugzilla.novell.com/attachment.cgi?id=398363)
Small Perl program using XML::Parser to illustrate the problem reported.
User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.10)
Gecko/20100914 SUSE/3.6.10-30.2 Firefox/3.6.10
When using the Perl module "XML::Parser" to parse a valid UTF-8 XML text
containing e.g. German umlauts, these are *not* returned as UTF-8 characters,
but as Latin-1 (ISO-8859-1) characters!
Reproducible: Always
Steps to Reproduce:
Write a Perl program using the XML::Parser module -- see attachment for small
test program to illustrate problem.
Actual Results:
The UTF-8 characters "öäüÖÄÜß" are returned as Latin-1 characters "�������".
Expected Results:
Expected to get the actual characters from read input, i.e. "öäüÖÄÜß".
Download attachment, make it executable and run it "./xmlParserBug".
Requires the "perl-XML-Parser" package to be installed.
--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
| < Previous | Next > |