The Corpus of Beserman Udmurt, Kielipankki Version
Description
The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi) at http://urn.fi/urn:nbn:fi:lb-2016092601
The Corpus of Beserman Udmurt comprises 65 000 tokens. The Beserman dialect of Udmurt is used in daily communication approximately by 2 000 speakers (according to the 2010 census). The Beserman live in the basin of the Cheptsa river in the Republic of Udmurtia and in the Kirov Oblast of the Russian Federation. In the scientific literature Beserman is considered to be a dialect of the Udmurt language which is characterized by an unusual combination of specifically Beserman phenomena (concentrated in vocabulary and phonetics) with certain traits of Northern and Southern Udmurt dialects, mostly morphological and phonological. The dialect remains the main means of everyday communication in Beserman villages, at least for the older generation.
The texts contained in the corpus have been collected in the villages of Shamardan (109 texts of 117), Vortsa (4 of 117), Malaya Yunda (1 of 117) and Zhuvam (3 of 117) in the Republic of Udmurtia in the years 2003-2015. There are 33 informants in total. The texts have been recorded, transcribed and grammatically annotated in the SIL FieldWorks software. The corpus contains narratives, life stories, dialogues, recipes, and recordings of psycholinguistic experiments. Each sentence is provided with interlinear glossing (according to the Leipzig Glossing Rules) and translation. Both the full text version with audio files and the corpus version are available at http://beserman.ru/corpus/search/?interface_language=en
Show moreYear of publication
2018
Type of data
Authors
Lomonosov Moscow State University
Maria Usacheva - Creator
National Research University Higher School of Economics
Timofey Arkhangelskiy - Creator
Russian State University for the Humanities
Natalia Serdobolskaya - Curator, Publisher, Creator
University of Helsinki - Curator
Project
Other information
Fields of science
Languages
Language
Udmurt language
Open access
Open