Hello List, i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html pages to kanna or kanji ? Regards Rolf
i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html
pages to kanna or kanji?
The English or German has to be translated into Japanese. Pretty obviously, you'll need one or more translators to do this. There are several ways of encoding Japanese. I don't think that any one of them has compelling advantages over any of the others. What's important is that you can't mix more than one on the same page. (However, it is easy to convert between them.) Decide which *one* you're going to use -- euc-jp, for example. It may not be necessary to specify the charset, but if you do specify the so-called "charset" (encoding system), make sure that the specification is correct. I suggest that you don't attempt to specify the font, other perhaps than in CSS by "serif" or "sans-serif" (which, by convention, have different but somewhat analogous meanings in Japanese). If you now specify lang="en", xml:lang="de" or whatever, in the <html> tag, change this to "ja". I think that's about it.
Peter Evans <peter@despammed.com> さんは書きました:
i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html
pages to kanna or kanji?
The English or German has to be translated into Japanese. Pretty obviously, you'll need one or more translators to do this.
There are several ways of encoding Japanese. I don't think that any one of them has compelling advantages over any of the others.
UTF-8 would give you the advantage to use not only Japanese but also German Umlauts in the same page. This might be useful.
What's important is that you can't mix more than one on the same page. (However, it is easy to convert between them.) Decide which *one* you're going to use -- euc-jp, for example. It may not be necessary to specify the charset,
One should always specify the charset. Either one can specify it directly in the html-page. See for example http://directory.google.com/Top/World/Japanese/?oe=UTF-8 http://directory.google.com/Top/World/Japanese/?oe=EUC-JP http://directory.google.com/Top/World/Japanese/?oe=Shift_JIS http://directory.google.com/Top/World/Japanese/?oe=ISO-2022-JP Or one can setup the web server to send the charset in the html-header. One of these possibilities is enough, but you should use at least one of them unless your pages contain only ASCII. Actually Google does both, you can see that Google sends the charset in the html-header as well like this: mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=UTF-8 HTTP/1.0 200 OK Date: Thu, 03 Jul 2003 16:37:00 GMT Server: Apache/1.3.26 (Unix) Content-Length: 5759 Content-Type: text/html; charset=UTF-8 X-Cache: MISS from proxy.suse.de Proxy-Connection: close mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=EUC-JP HTTP/1.0 200 OK Date: Thu, 03 Jul 2003 16:37:09 GMT Server: Apache/1.3.26 (Unix) Content-Length: 5572 Content-Type: text/html; charset=EUC-JP X-Cache: MISS from proxy.suse.de Proxy-Connection: close mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=Shift_JIS HTTP/1.0 200 OK Date: Thu, 03 Jul 2003 16:37:14 GMT Server: Apache/1.3.26 (Unix) Content-Length: 5575 Content-Type: text/html; charset=Shift_JIS X-Cache: MISS from proxy.suse.de Proxy-Connection: close mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=ISO-2022-JP HTTP/1.0 200 OK Date: Thu, 03 Jul 2003 16:38:07 GMT Server: Apache/1.3.26 (Unix) Content-Length: 5828 Content-Type: text/html; charset=ISO-2022-JP X-Cache: MISS from proxy.suse.de Proxy-Connection: close mfabian@magellan:~$ -- Mike Fabian <mfabian@suse.de> http://www.suse.de/~mfabian 睡眠不足はいい仕事の敵だ。
participants (3)
-
Mike FABIAN
-
Peter Evans
-
Rolf Lindenbring