Hello List, i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html pages to kanna or kanji ? Regards Rolf
i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html
pages to kanna or kanji?
The English or German has to be translated into Japanese. Pretty obviously, you'll need one or more translators to do this. There are several ways of encoding Japanese. I don't think that any one of them has compelling advantages over any of the others. What's important is that you can't mix more than one on the same page. (However, it is easy to convert between them.) Decide which *one* you're going to use -- euc-jp, for example. It may not be necessary to specify the charset, but if you do specify the so-called "charset" (encoding system), make sure that the specification is correct. I suggest that you don't attempt to specify the font, other perhaps than in CSS by "serif" or "sans-serif" (which, by convention, have different but somewhat analogous meanings in Japanese). If you now specify lang="en", xml:lang="de" or whatever, in the <html> tag, change this to "ja". I think that's about it.
Peter Evans
i try to develop a programm with qt. For some messages in my programm i want to use html pages. Can someone explain me how to convert english or german html
pages to kanna or kanji?
The English or German has to be translated into Japanese. Pretty obviously, you'll need one or more translators to do this.
There are several ways of encoding Japanese. I don't think that any one of them has compelling advantages over any of the others.
UTF-8 would give you the advantage to use not only Japanese but also German Umlauts in the same page. This might be useful.
What's important is that you can't mix more than one on the same page. (However, it is easy to convert between them.) Decide which *one* you're going to use -- euc-jp, for example. It may not be necessary to specify the charset,
One should always specify the charset. Either one can specify it
directly in the html-page. See for example
http://directory.google.com/Top/World/Japanese/?oe=UTF-8
http://directory.google.com/Top/World/Japanese/?oe=EUC-JP
http://directory.google.com/Top/World/Japanese/?oe=Shift_JIS
http://directory.google.com/Top/World/Japanese/?oe=ISO-2022-JP
Or one can setup the web server to send the charset in the
html-header. One of these possibilities is enough, but you should
use at least one of them unless your pages contain only ASCII.
Actually Google does both, you can see that Google sends
the charset in the html-header as well like this:
mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=UTF-8
HTTP/1.0 200 OK
Date: Thu, 03 Jul 2003 16:37:00 GMT
Server: Apache/1.3.26 (Unix)
Content-Length: 5759
Content-Type: text/html; charset=UTF-8
X-Cache: MISS from proxy.suse.de
Proxy-Connection: close
mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=EUC-JP
HTTP/1.0 200 OK
Date: Thu, 03 Jul 2003 16:37:09 GMT
Server: Apache/1.3.26 (Unix)
Content-Length: 5572
Content-Type: text/html; charset=EUC-JP
X-Cache: MISS from proxy.suse.de
Proxy-Connection: close
mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=Shift_JIS
HTTP/1.0 200 OK
Date: Thu, 03 Jul 2003 16:37:14 GMT
Server: Apache/1.3.26 (Unix)
Content-Length: 5575
Content-Type: text/html; charset=Shift_JIS
X-Cache: MISS from proxy.suse.de
Proxy-Connection: close
mfabian@magellan:~$ w3m -dump_head http://directory.google.com/Top/World/Japanese/?oe=ISO-2022-JP
HTTP/1.0 200 OK
Date: Thu, 03 Jul 2003 16:38:07 GMT
Server: Apache/1.3.26 (Unix)
Content-Length: 5828
Content-Type: text/html; charset=ISO-2022-JP
X-Cache: MISS from proxy.suse.de
Proxy-Connection: close
mfabian@magellan:~$
--
Mike Fabian
participants (3)
-
Mike FABIAN
-
Peter Evans
-
Rolf Lindenbring