Bilingual Terminology Extraction
Samstag, Mai 26, 2007, 08:18 PM - Terminology Extraction
The bilingual terminology extraction has been extended with additional functions, esp. searching terms in the extracted term list. In addition terms can now be sorted in different orders (e.g. by score, by source term, target term, number). Extraction also has been heavily improved with reagrd to speed. A progress dialog is shown now too.
The tests done with the Europarl corpus have been continued for terms consiosting of several words und the minimum frequency has been lowered.
Europarl Corpus as test bed
Donnerstag, Mai 3, 2007, 05:37 PM - Araya
, Terminology Extraction
The Europarl Corpus is currently used to evaluate the bilingual term extractor. Europarl is a corpus of texts translated into various European languages based on meetings of the European Parliament.
Several of the extracted term files are available for download.
For more information see either: http://www.heartsome.de/de/europarl.php