79,000 Chinese-English, French, German, Italian, Japanese, and Spanish sentences

Hi lamington,

you’re welcome! That‘s exactly it, sorry about that; I will upload a corrected file in about two hours. Now you could either delete the category or select Undo last import in Import Flashcards to correct the error.

Cheers,

Shun
 
Hi leguan,

many thanks! :) One could also use a good dictionary and check how many of the words in the sentence of one language match up with a word in the corresponding sentence of the other language, then divide that number by the sentence length. Maybe we could try this with the CC-CEDICT or HanDeDict once. This should allow us to spot mismatched sentences. We could try removing the last 1-3 letters in this comparison to account for word inflections. But to use TensorFlow would, of course, be more on the cutting edge.

In any case, it's good to see that Tatoeba ranks highest in sentence quality of all his sources.

Cheers,

Shun
 
Top