The Corpus of Beserman Udmurt, Kielipankki Version

Description

The corpus is available in Kielipankki - the Language Bank of Finland (korp.csc.fi) at http://urn.fi/urn:nbn:fi:lb-2016092601 The Corpus of Beserman Udmurt comprises 65 000 tokens. The Beserman dialect of Udmurt is used in daily communication approximately by 2 000 speakers (according to the 2010 census). The Beserman live in the basin of the Cheptsa river in the Republic of Udmurtia and in the Kirov Oblast of the Russian Federation. In the scientific literature Beserman is considered to be a dialect of the Udmurt language which is characterized by an unusual combination of specifically Beserman phenomena (concentrated in vocabulary and phonetics) with certain traits of Northern and Southern Udmurt dialects, mostly morphological and phonological. The dialect remains the main means of everyday communication in Beserman villages, at least for the older generation. The texts contained in the corpus have been collected in the villages of Shamardan (109 texts of 117), Vortsa (4 of 117), Malaya Yunda (1 of 117) and Zhuvam (3 of 117) in the Republic of Udmurtia in the years 2003-2015. There are 33 informants in total. The texts have been recorded, transcribed and grammatically annotated in the SIL FieldWorks software. The corpus contains narratives, life stories, dialogues, recipes, and recordings of psycholinguistic experiments. Each sentence is provided with interlinear glossing (according to the Leipzig Glossing Rules) and translation. Both the full text version with audio files and the corpus version are available at http://beserman.ru/corpus/search/?interface_language=en
Show more

Year of publication

2018

Type of data

Authors

Lomonosov Moscow State University

Maria Usacheva - Creator

National Research University Higher School of Economics

Timofey Arkhangelskiy - Creator

Russian State University for the Humanities

Natalia Serdobolskaya - Curator, Publisher, Creator

University of Helsinki - Curator

Project

Other information

Fields of science

Languages

Language

Udmurt language

Open access

Open

License

Creative Commons Attribution 4.0 International (CC BY 4.0)

Keywords

Subject headings

Temporal coverage

undefined

Related to this research data