OPUS Wikimedia Parallel Corpus

Description

Translations published by the Wikimedia foundation and their article translation system. The parallel data sets are published at https://dumps.wikimedia.org/other/contenttranslation/
Show more

Year of publication

2024

Type of data

Authors

Jörg Tiedemann Orcid -palvelun logo - Curator, Creator, Publisher

Project

Other information

Fields of science

Computer and information sciences; Languages

Language

Multiple languages

Open access

Open

License

Creative Commons Attribution 4.0 International (CC BY 4.0)

Keywords

natural language processing, machine translation, parallel corpus

Subject headings

Temporal coverage

undefined

Related to this research data