This is a collection of parallel corpora collected by Hercules Dalianis and his research group for bilingual dictionary construction.
More information in: Hercules Dalianis, Hao-chun Xing, Xin Zhang: Creating a Reusable English-Chinese Parallel Corpus for Bilingual Dictionary Construction, In Proceedings of LREC2010 (source: http://people.dsv.su.se/~hercules/SEC/) and Konstantinos Charitakis (2007): Using Parallel Corpora to Create a Greek-English Dictionary with UPLUG, In Proceedings of NODALIDA 2007. Afrikaans-English: Aldin Draghoender and Mattias Kanhov: Creating a reusable English – Afrikaans parallel corpora for bilingual dictionary construction
Bottom-left triangle: download files
| Upper-right triangle: sample files
|
| language | files | tokens | sentences | af | el | en | zh |
|---|---|---|---|---|---|---|---|
| af | 1 | 0.4M | 63.1k | 52.1k | |||
| el | 1 | 0.2M | 8.5k | 7.0k | |||
| en | 3 | 0.7M | 73.9k | 57.4k | 8.2k | 2.2k | |
| zh | 1 | 61.5k | 2.2k | 2.2k |