Bilingual (EN, AL) corpus v.1.02 based on WikiMatrix 
Bilinugal dataset (EN, AL) based on the WikiMatrix coprus which is constructed as described in "WikiMatrix: Mining 135M Parallel Sentences in 1620 Language Pairs from Wikipedia". It was filtered with the purpose of removing TUs with limited or no use. It includes 401501 Translation Units.
People who looked at this resource also viewed the following:
- Bilingual (EN, AL) corpus v.1.05 based on WikiMatrix
- Compilation of German-Portuguese parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
- COVID-19 EUR-LEX dataset. Βilingual (EN-ET)
- Compilation of French-Latvian parallel corpora resources used for training of NTEU Machine Translation engines. Tier 3.
People who downloaded this resource also downloaded the following: