Corpus of Spoken Russian (UHLCS)
View resource name in all available languages
Puhutun venäjän kielen korpus (UHLCS)
russian-spoken-uhlcs
Persistent Identifier of this resource:
http://urn.fi/urn:nbn:fi:lb-2024052401
The corpus is available in Kielipankki - the Language Bank of Finland (puhti.csc.fi, access rights instructions: https://www.kielipankki.fi/access/).
Location: /appl/data/kielipankki/mrc-uhlcs/general-linguistics/indo-european-lgs/slavonic-lgs/russian/spoken/
The Corpus of Spoken Russian is originally prepared at the Language Institute of the Russian Academy of Science, Moscow. The versions in txt format have been constructed from the Original and .SGML files. The resulting valid UTF-8 files have Russian in Cyrillics. The corpus consists of approximately 100 texts and over 26309 tokens.
The corpus is a part of the Multilingual Resource Collection of the UHLCS.
UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).
The purpose of the resource use must be outlined in a research plan.
People who looked at this resource also viewed the following: