New subject: [m17n] What is wadokujt for? and: Some Scripts for Chinese and Japanese learning and Noxon Audio

19 May 2005

      In SuSE 9.3, there's a package wadokujt with a Japanese 
German dictionary. But it comes with no Program, only data 
is contained in the package.

What programs in the SuSE distribution are thought to be 
used with wadokujt?

KWordQuiz, KVTML:

I use it together with KWordQuiz, but for this, I need to 
convert the data to kvtml. That's why I wrote 
wadokujt2kvtml.pl.

There is another very similar dictionary file, named CEDICT 
for Chinese English translation. Unfortunately that does 
not come with SuSE. That file can be converted to kvtml 
using cedict2kvtml.pl.

Noxon and Twonkyvision:

All my CDs are stored in MP3 files, served via UPnP using 
Twonkyvision and played by a Noxon audio device. This works 
fine, but UTF-8 is not handled correctly. All German 
Umlauts and all Chinese music files are unreadable.

Fortunately, on the unicode.org page, there's a file 
Unihan.txt, which defines the Chinese Unicode mapping, 
including the Mandarin PinYin translation. My script 
create-mapping.sh downloads this file and extracts the 
PinYin mapping I am interested in. The output from there is 
then used by utf8-to-ascii.pl, which converts all Chinese 
characters, German umlauts, French, Spanish, Italian 
accents to plain 7bit ASCII.

I use this script in create-mp3-ascii-dir.sh (which is only 
an example and works only on my system), to create links to 
all my MP3 filenames, which are then 7bit ASCII and display 
well on Noxon.

Another possible application: If you also consider the 
PinYin tones in utf8-to-ascii.pl (as they are considered in 
cedict2kvtml.pl), then you could easily build a Chinese 
text to speech synthesizer! - Or semi-automated 
translation: Translate the individual charaters to English 
using Unihan.txt.

Is this also useful to others? Shall I make a webpage 
containing this information and scripts?

Regards
Marc

What is wadokujt for? and: Some Scripts for Chinese and Japanese learning and Noxon Audio

Marc Waeckerlin

Mike FABIAN

Marc Waeckerlin

Mike FABIAN

Mike FABIAN

tags

participants (2)