Archive for February 6th, 2010

Tanaka Corpus available in Felix TM and TMX formats

Feb. 6th 2010

I converted the Tanaka Corpus of aligned Japanese and English sentences into Felix translation memory (TM) and TMX formats.

The Tanaka Corpus is a collection of around 150,000 Japanese-English sentence translation pairs, compiled over several years by university students, with later cleanup and correction by Jim Breen and his colleagues.

Download the Felix/TMX versions of the Tanaka Corpus here.

Posted by Ryan Ginstrom | in Felix, resources | 1 Comment »
  • Search

  • Categories

  • Calendar

    February 2010
    M T W T F S S
    « Jan   Mar »
  • Pages

  • Meta