Felix resources

Tanaka Corpus available in Felix TM and TMX formats

I converted the Tanaka Corpus of aligned Japanese and English sentences into Felix translation memory (TM) and TMX formats.

The Tanaka Corpus is a collection of around 150,000 Japanese-English sentence translation pairs, compiled over several years by university students, with later cleanup and correction by Jim Breen and his colleagues.

Download the Felix/TMX versions of the Tanaka Corpus here.


New Felix resource added: TM and glossary of legal terms (J-E)

I’ve converted the “Standard Bilingual Dictionary” into a Felix translation memory (TM) and glossary, and posted them to the Felix website:

Felix TM and glossary of Japanese-English legal terms

These should be of use to anyone who has to translate Japanese laws into English.

About the Standard Bilingual Dictionary

The Standard Bilingual Dictionary is a glossary of official translations of terms from Japanese law. It’s part of a major effort by the Japanese government to translate its laws into English.


EDICT dictionary files available as Felix glossaries

I’ve converted the EDICT and ENAMDICT dictionary files created by Jim Breen into Felix format. The converted glossary files are available from the Felix Website.

The EDICT file is multilingual (Japanese/English/French/German/Russian), and I’ve converted it into 20 Felix glossaries representing each language combination. Of course, since Japanese is the central language, language pairs that don’t have Japanese as the source or translation language may be less useful.

The ENAMDICT file is a dictionary of proper names. All together the file was humungous, so I broke it into several smaller glossaries by category (personal names, place names, organizations, and so on). Of course, you’re free to load them all up into your Felix glossary window, since the number of glossaries you can have open is only limited by how much memory your computer has.