Mailinglist Archive: opensuse-bugs (4856 mails)

< Previous Next >
[Bug 650932] New: perl XML::Parser (and in effect XML::Parser::Expat) does not return UTF-8 strings as expected.
  • From: bugzilla_noreply@xxxxxxxxxx
  • Date: Wed, 3 Nov 2010 11:54:00 +0000
  • Message-id: <bug-650932-21960@xxxxxxxxxxxxxxxxxxxxxxxx/>

https://bugzilla.novell.com/show_bug.cgi?id=650932

https://bugzilla.novell.com/show_bug.cgi?id=650932#c0


Summary: perl XML::Parser (and in effect XML::Parser::Expat)
does not return UTF-8 strings as expected.
Classification: openSUSE
Product: openSUSE 11.3
Version: Final
Platform: x86-64
OS/Version: openSUSE 11.3
Status: NEW
Severity: Critical
Priority: P5 - None
Component: Development
AssignedTo: pth@xxxxxxxxxx
ReportedBy: DOlsson@xxxxxx
QAContact: qa@xxxxxxx
Found By: ---
Blocker: ---


Created an attachment (id=398363)
--> (http://bugzilla.novell.com/attachment.cgi?id=398363)
Small Perl program using XML::Parser to illustrate the problem reported.

User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.10)
Gecko/20100914 SUSE/3.6.10-30.2 Firefox/3.6.10

When using the Perl module "XML::Parser" to parse a valid UTF-8 XML text
containing e.g. German umlauts, these are *not* returned as UTF-8 characters,
but as Latin-1 (ISO-8859-1) characters!

Reproducible: Always

Steps to Reproduce:
Write a Perl program using the XML::Parser module -- see attachment for small
test program to illustrate problem.
Actual Results:
The UTF-8 characters "öäüÖÄÜß" are returned as Latin-1 characters "�������".

Expected Results:
Expected to get the actual characters from read input, i.e. "öäüÖÄÜß".

Download attachment, make it executable and run it "./xmlParserBug".

Requires the "perl-XML-Parser" package to be installed.

--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
< Previous Next >
Follow Ups