Corpus of Spoken Russian (UHLCS)

View resource name in all available languages

Puhutun venäjän kielen korpus (UHLCS)

russian-spoken-uhlcs

Persistent Identifier of this resource:

http://urn.fi/urn:nbn:fi:lb-2024052401

The corpus is available in Kielipankki - the Language Bank of Finland (puhti.csc.fi, access rights instructions: https://www.kielipankki.fi/access/).

Location: /appl/data/kielipankki/mrc-uhlcs/general-linguistics/indo-european-lgs/slavonic-lgs/russian/spoken/

The Corpus of Spoken Russian is originally prepared at the Language Institute of the Russian Academy of Science, Moscow. The versions in txt format have been constructed from the Original and .SGML files. The resulting valid UTF-8 files have Russian in Cyrillics. The corpus consists of approximately 100 texts and over 26309 tokens.

The corpus is a part of the Multilingual Resource Collection of the UHLCS.

UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).


The purpose of the resource use must be outlined in a research plan.

You don’t have the permission to edit this resource.