[Bug 249431] New: gettext should convert untranslated non-ASCII msgids into locale encoding - openSUSE Bugs - openSUSE Mailing Lists

newer
[Bug 194711] IBM Java 1.4.2 &...

[Bug 249431] New: gettext should convert untranslated non-ASCII msgids into locale encoding

older
[Bug 200812] Nautilus:...

bugzilla_noreply＠novell.com

27 Feb 2007 27 Feb '07

17:47

https://bugzilla.novell.com/show_bug.cgi?id=249431 Summary: gettext should convert untranslated non-ASCII msgids into locale encoding Product: openSUSE 10.3 Version: Alpha 1 Platform: Other OS/Version: Linux Status: NEW Severity: Normal Priority: P5 - None Component: Basesystem AssignedTo: bnc-team-screening@forge.provo.novell.com ReportedBy: mfabian@novell.com QAContact: qa@suse.de gettext should convert untranslated non-ASCII msgids into locale encoding. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

Show replies by date

bugzilla_noreply＠novell.com

27 Feb 27 Feb

17:50

New subject: [Bug 249431] gettext should convert untranslated non-ASCII msgids into locale encoding

https://bugzilla.novell.com/show_bug.cgi?id=249431 ------- Comment #1 from mfabian@novell.com 2007-02-27 10:50 MST ------- See bug #248859, especially comment #12 for the reason why I think this would be useful. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

bugzilla_noreply＠novell.com

18:00

New subject: [Bug 249431] gettext should convert untranslated non-ASCII msgids into locale encoding

https://bugzilla.novell.com/show_bug.cgi?id=249431 ------- Comment #2 from mfabian@novell.com 2007-02-27 11:00 MST ------- I've added the maintainer of the standalone gettext package, Philipp Thomas , and the maintainer of the glibc package, Petr Baudis , to CC:. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

bugzilla_noreply＠novell.com

18:06

New subject: [Bug 249431] gettext should convert untranslated non-ASCII msgids into locale encoding

https://bugzilla.novell.com/show_bug.cgi?id=249431 ------- Comment #3 from mfabian@novell.com 2007-02-27 11:06 MST ------- Added Bruno Haible , one of the upstream authors of gettext to CC:. Bruno, what do you think about the idea of converting the encoding of the msgid the same way as the msgstr to the encoding of the current locale? Then it would be easily possible to use non-ASCII msgids if UTF-8 would be used consistently in the source code as well as in the .po files. -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

bugzilla_noreply＠novell.com

18:07

New subject: [Bug 249431] gettext should convert untranslated non-ASCII msgids into locale encoding

https://bugzilla.novell.com/show_bug.cgi?id=249431 mfabian@novell.com changed: What |Removed |Added ---------------------------------------------------------------------------- AssignedTo|bnc-team- |pbaudis@novell.com |screening@forge.provo.novell| |.com | -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

bugzilla_noreply＠novell.com

28 Feb 28 Feb

00:21

New subject: [Bug 249431] gettext should convert untranslated non-ASCII msgids into locale encoding

https://bugzilla.novell.com/show_bug.cgi?id=249431 bruno@clisp.org changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |WONTFIX ------- Comment #4 from bruno@clisp.org 2007-02-27 17:21 MST ------- The function gettext() is now standardized by the LSB, see http://www.linux-foundation.org/spec/booksets/LSB-Core-generic/LSB-Core-gene... therefore there is not much room for changing its behaviour. But anyway, what does it mean to support non-ASCII msgids? 1) You must provide a way for handling the case that the user's locale encoding cannot represent the non-ASCII characters. Let's take the example from the BR you mentioned: https://bugzilla.novell.com/show_bug.cgi?id=248859 (funny bug number, when it's about encodings, by the way :-)) gettext ("St Dévote Day") The user certainly wishes to see "St Devote Day" then, i.e. the transliteration of glibc and libiconv could do it. But in more general cases, this does not work any more. So IMO the programmer must jump in and provide the ASCII equivalent too: gettext_na ("St Devote Day", "St Dévote Day") 2) The argument to gettext is used as a key into the hash table in the .mo file. It's a hash table so that it's fast. If you use "St Dévote Day" as gettext argument, you have - to convert this string to the .mo file's text encoding first, using iconv(), [you cannot reliably enforce that all .mo files are in UTF-8 encoding], - to specify globally the source encoding, for example, through a new hypothetical function call bind_textdomain_source_codeset ("ISO-8859-1"); If you use "St Devote Day" as gettext argument, you have none of these two problems. 3) If gettext returns the msgid, i.e. if the message was not translated, you have to convert it to the locale encoding yourself. Again, you have to specify globally the source encoding, for example, through a hypothetical function call bind_textdomain_source_codeset ("ISO-8859-1"); Additionally, you don't want a memory leak here, when the same call is made repeatedly. So you need to cache the result for later reuse. You can get rid of the need for bind_textdomain_source_codeset if you assume that the source code is in UTF-8. So what you end up with is a user-defined function /* Return the localization of a string whose original writing is not ASCII. MSGID_UTF8 is the real string, written in UTF-8 with octal or hexadecimal escape sequences. MSGID_ASCII is a fallback written only with ASCII characters. */ const char * gettext_utf8 (const char *msgid_ascii, const char *msgid_utf8) { /* See whether there is a translation. */ const char *translation = gettext (msgid_ascii); if (translation == msgid_ascii) { /* Access a cache here. A little homework. */ /* locale_charset, c_strcasecmp, xstr_iconv are defined in gnulib. */ const char *locale_code = locale_charset (); if (c_strcasecmp (locale_code, "UTF-8") == 0) translation = msgid_utf8; else { #if HAVE_ICONV const char *converted = xstr_iconv (msgid_utf8, "UTF-8", locale_code); if (converted == NULL) { # if (__GLIBC__ == 2 && __GLIBC_MINOR__ >= 2) || __GLIBC__ > 2 \ || _LIBICONV_VERSION >= 0x0105 size_t len = strlen (locale_code); char *locale_code_translit = XNMALLOC (len + 10 + 1, char); memcpy (locale_code_translit, locale_code, len); memcpy (locale_code_translit + len, "//TRANSLIT", 10 + 1); converted = xstr_iconv (msgid_utf8, "UTF-8", locale_code_translit); free (locale_code_translit); # endif if (converted == NULL) { /* Use msgid_ascii as a fallback. */ converted = msgid_ascii; } } translation = converted; } /* Store the translation in the cache. Homework part 2. */ } return translation; } Finally, you make this function available to xgettext by putting this into po/Makevars: XGETTEXT_OPTIONS = \ --keyword=gettext_utf8:1 --flag=gettext_utf8:1:pass-c-format -- Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug, or are watching someone who is.

Reply

Sign in to reply online Use email software

6268

Age (days ago)

6269

Last active (days ago)

Download

5 comments

1 participants

tags

participants (1)

bugzilla_noreply＠novell.com