Christmas Gospel text-to-speech in four Uralic languages, source

Christmas Gospel text-to-speech in four Uralic languages, source

Description

This resource is available for download in Kielipankki – the Language Bank of Finland. This resource consists of .txt and .wav files in four languages pertaining to the Finnish Christmas Gospel verses Luke 2. 1–20 The four languages include Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) and Olonets-Karelian (olo, aka Livvi), whose texts are all included in the Parallel Biblical Verses for Uralic Studies (PaBiVUS) corpus (LUK.2.1–20). The .wav files have been produced by Aleksei Ivanov, Niko Partanen and Jack Rueter, who have made use of the Facebook MMS (https://ai.meta.com/blog/multilingual-model-speech-recognition/) with language support for hundreds of languages (https://huggingface.co/facebook/mms-tts#supported-languages). While the Komi-Zyrian (kpv), Erzya (myv), Karelian (krl) are reported as having coverage for ASR, TTS and LID, Olonets-Karelian (olo, aka Livvi) is not mentioned at all. The Olonets-Karelian .wav has been produced using the krl TTS, as both language forms use writing systems with basically the same character to sound correlation.
Show more

Year of publication

2023

Authors

Raamatunkäännösinstituutti ry - Creator

Other information

Fields of science

Languages

Language

Karelian, Komi-Yodzyak language, Erzya language, Livvi-Karelian language, Kompane

Open access

Open

License

Creative Commons Attribution NonCommercial 4.0 International (CC BY NC 4.0)
Christmas Gospel text-to-speech in four Uralic languages, source - Research.fi