PRINCIPLE Anonymized English-Irish DCHG parallel translation memory dataset 
- Government of Ireland for original material
- Dublin City University/ADAPT Centre for cleaning
Aligned parallel corpus based on translation memory data from the Department of Culture, Heritage, and the Gaeltacht in Ireland. The data originally came in aligned format, and was subsequently anonymized and cleaned.
Languages: English-Irish
Size: 64694 translation units
People who looked at this resource also viewed the following:
- English-Finnish parallel corpus from National Audit Office of Finland
- CEF Data Marketplace second multilingual benchmark for the evaluation of cleaning tools
- Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-HR).
- Synthetic multilingual corpus of public services of Croatia
People who downloaded this resource also downloaded the following: