New subject: Encoding confusion (was: Japanese, CJK and LaTeX)

12 Feb 2003

      Ludger Sicking  さんは書きました:
...
I mention it's "only" an encoding problem.
I want to use the "Wadalab-test.tex"-file as a base.
...
So I opened it in my emacs. My emacs couldnt't fontify the kanjis
well. In spite of kanjis "he" displayed:
[... mojibake ...]
That's EUC-JP encoded Japanese. Although your e-mail header doesn't
say so.

I wonder why your Emacs doesn't display it correctly. Both GNU Emacs
and XEmacs display Wadalab-test.tex correctly for me by default.
You use GNU Emacs, do you?

Does the Japanese in the hello page look correct to you in GNU Emacs?
Try

   M-x view-hello-file

If that doesn't look right, you probably lack the most basic
Japanese fonts. Is xfntjp.rpm installed?

You can switch fontsets in Emacs with "Shift+Left-mouse-button", then
select "Fontset" and select one of the offered fontsets.  the
"standard: 16-dot medium" fontset should always work, even if your
only Japanese fonts are those already included in the xf86.rpm.

What the locale you are using?

You can force to load a file in a specific encoding, e.g. EUC-JP
with the following key combination:

    C-x RET c euc-jp RET C-x C-f Wadalab-test.tex RET
...
I input my kanji by SKK. (I didn't find an SKK package on SuSE 8.1, so I
installed it by the SKK web page.... Is there such a package provided by
SuSE???)
SKK is included in the XEmacs packages for SuSE Linux. I didn't yet
make a SKK package to use with GNU Emacs for SuSE Linux (You are the
first one who asks ...)

Personally I use XEmacs with the native Canna interface to input
Japanese, for GNU Emacs there is the tamago.rpm package which offers a
nice, direct interface to Canna. I think using Canna is easier than
SKK, but the choice of the input method really is a personal
preference.
...
And I saved the file in ISO-2022-JP-2.
Don't do that. Save it as euc-jp. If you want to use the Wadalab
PostScript fonts, you *must* save it as EUC-JP. In the
Wadalab-test.tex file you have for example:

    \begin{CJK*}[dnp]{JIS}{min}

And the documentation

    /usr/share/doc/packages/cjk-latex/doc/CJK.doc

clearly says:

CJK.doc>     \begin{CJK*}[<fontencoding>]{<encoding>}{<family>}
CJK.doc>     ...
CJK.doc>     \end{CJK*}
CJK.doc> 
CJK.doc>     are defined. The parameters have the following meaning:
CJK.doc> 
CJK.doc>     <encoding>      These character sets resp. encodings are currently
CJK.doc>                     implemented in CJK.enc:
CJK.doc>  
CJK.doc>     [...]
CJK.doc>                        JIS  (For Japanese.
CJK.doc>                               Character set: JIS X 0208:1997.
CJK.doc>                               Encoding: EUC.)

You see, you *must* use EUC-JP encoding, if you use the {JIS}
parameter in the \begin{CJK*} command.
...
I processed it with latex (and sjislatex)
Forget sjislatex unless you use SJIS.
...
but there was the following message:
...
the latex command:
! Text line contains an invalid character.
l.20 light: {\fontseries{l}\selectfont ^^[
                                          $BC]Fb^^[(B}\ normal:
^^[$BC]Fb^^[...
[...]
...
So there is a problem with the encoding....
Yes. You get funny error messages like that when you don't use the
correct encoding. Use EUC-JP together with \begin{CJK*}[dnp]{JIS}{min}
and all is well.
...
BTW: the latex command on the example files worked well. The output was fine
and I can see the kanjis. But I don't want to present a "hello world" as my
titlepage for my diploma thesis... ;-)
I didnt find an encoding like UTF8 for my emacs... (ok, it's my fault. it's
of course not the emacs given by SuSE 8.1... it has that encoding
possibility....)
If you use the GNU emacs on SuSE 8.1, it already has the coding system
utf-8, but only for a very limited subset of Unicode. This does not
cover Japanese. To be able to use Japanese as well in UTF-8, you need
to install the Mule-UCS.rpm package, which is an extension for GNU
Emacs for better Unicode coverage.

The XEmacs package on SuSE Linux 8.1 already includes Mule-UCS, but
you may need to activate it in your XEmacs profile (which is
~/.xemacs/init.el). Add the following line:

   (if (locate-library "un-define") (require 'un-define))

If you use the default ~/.xemacs/init.el file as distributed with SuSE
Linux 8.1, you already have that.

-- 
Mike Fabian      http://www.suse.de/~mfabian
睡眠不足はいい仕事の敵だ。

Re: [m17n] Japanese, CJK and LaTeX

Mike FABIAN

Ludger Sicking

Mike FABIAN

Ludger Sicking

Mike FABIAN

Mike FABIAN

Mike FABIAN

tags

participants (3)