English-Albanian corpus from websites of national Agencies v.1.0 
Bilingual dataset (EN-SQ) based on the content of websites of national agencies. It includes 84747 Translation Units. It was generated by crawling the websites in January 2021, detecting pairs of parallel documents, identifying parallel sentence pairs and filtering the results.
People who looked at this resource also viewed the following:
- SciPar: A collection of parallel corpora from scientific abstracts (v. 2021) in TMX format.
- COVID-19 Government of Canada dataset v1. Bilingual (EN-PL)
- Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-CS).
- Bilingual corpus made out of PDF documents from the European Medicines Agency, (EMEA), https://www.ema.europa.eu, (February 2020) (EN-PL).