Converting Japanese filenames to UTF-8
I did the right tthing and backed up my SuSE 8.2 Pro system to an external 160GB USB2.0 drive formatted with reiserfs. My SuSE 8.2 Pro system was installed in Japanese. I took my copy of SuSE 9.1 Pro and installed it. I plugged in the USB2.0 drive, it mounted read-only, and proceeded to try to recover my home directory only to find all of the Japanese filenames were 100% unreadable-scramble (文字化け!!!). I realize that SuSE 9.1 uses UTF-8 -- and I am really happy about that -- but I need to recover the files from this backup. I now have filenames that used to be "第8課のノート.txt" that now look like "□□機■□.txt" or "?????????.txt" or any other unreadable malarky. I did look into the utility *convmv* and tried it using a variety of source encodings -- jis, jis7, sjis, shift_jis, iso-2022-jp. None worked. :( Any ideas? I am sure UTF-8 will be much better, but I really need to be able to restore from this archive first. Perhaps part of my problem is that I have no idea what the source encoding would have been on the SuSE 8.2 Pro system. I did do the entire installation in Japanese, so it kept the default for Japanese, I imagine. I really want this new SuSE 9.1 environment to work. -- David Nettles web: http://www.miteyo.org email: tetsuoni3000@yahoo.co.jp __________________________________________________ GANBARE! NIPPON! Yahoo! JAPAN JOC OFFICIAL INTERNET PORTAL SITE http://mail.ganbare-nippon.yahoo.co.jp/
David Nettles <tetsuoni3000@yahoo.co.jp> さんは書きました:
I did look into the utility *convmv* and tried it using a variety of source encodings -- jis, jis7, sjis, shift_jis, iso-2022-jp. None worked. :(
convmv -f EUC-JP -t UTF-8 *
Any ideas? I am sure UTF-8 will be much better, but I really need to be able to restore from this archive first.
Perhaps part of my problem is that I have no idea what the source encoding would have been on the SuSE 8.2 Pro system. I did do the entire installation in Japanese, so it kept the default for Japanese, I imagine.
Then the source encoding to use with convmv should be EUC-JP.
I really want this new SuSE 9.1 environment to work.
-- Mike FABIAN <mfabian@suse.de> http://www.suse.de/~mfabian 睡眠不足はいい仕事の敵だ。
Mike FABIAN wrote:
David Nettles <tetsuoni3000@yahoo.co.jp> さんは書きました:
I did look into the utility *convmv* and tried it using a variety of source encodings -- jis, jis7, sjis, shift_jis, iso-2022-jp. None worked. :(
convmv -f EUC-JP -t UTF-8 *
EXCELLENT!!!!!!! Now the 2nd questions: I have a bunch of *.TXT files full of text in Japanese using the old encoding (now, I imagine that this is also EUC-JP)... what tool could I use to convert these? I believe that once this is finished I'll be 100% done and ready to go. :-D -- David Nettles web: http://www.miteyo.org email: tetsuoni3000@yahoo.co.jp __________________________________________________ GANBARE! NIPPON! Yahoo! JAPAN JOC OFFICIAL INTERNET PORTAL SITE http://mail.ganbare-nippon.yahoo.co.jp/
David Nettles <tetsuoni3000@yahoo.co.jp> さんは書きました:
Now the 2nd questions: I have a bunch of *.TXT files full of text in Japanese using the old encoding (now, I imagine that this is also EUC-JP)
Most likely.
... what tool could I use to convert these?
http://www.suse.de/~mfabian/suse-cjk/encodings-file-contents.html For example: iconv -f EUC-JP -t UTF-8 < old-file > new-file or recode EUC-JP..UTF-8 file -- Mike FABIAN <mfabian@suse.de> http://www.suse.de/~mfabian 睡眠不足はいい仕事の敵だ。
Mike FABIAN wrote:
David Nettles <tetsuoni3000@yahoo.co.jp> さんは書きました:
Now the 2nd questions: I have a bunch of *.TXT files full of text in Japanese using the old encoding (now, I imagine that this is also EUC-JP)
Most likely.
... what tool could I use to convert these?
http://www.suse.de/~mfabian/suse-cjk/encodings-file-contents.html
For example:
iconv -f EUC-JP -t UTF-8 < old-file > new-file
or
recode EUC-JP..UTF-8 file
Thanks... I'll give them a try. :) -- David Nettles web: http://www.miteyo.org email: tetsuoni3000@yahoo.co.jp
participants (2)
-
David Nettles
-
Mike FABIAN