The Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp
View resource name in all available languages
Kansalliskirjaston sanoma- ja aikakauslehtikokoelman suomenkielinen osakorpus versio 2, Korp
klk-fi-v2-korp
Persistent Identifier of this resource:
http://urn.fi/urn:nbn:fi:lb-202009152
Access location:
This resource is available via Korp in Kielipankki – the Language Bank of Finland.
The corpus consists of Finnish newspapers and magazines starting from 1771 up to 2021, compiled by the National Library of Finland.
For this new version, the data of the previous version (Finnish and Swedish) was checked with the HeLI-OTS language identifier. Parts of texts, which do not contain Finnish, were removed from this corpus. On the other hand, texts from the Swedish part of KLK, which contain Finnish, where added to this corpus.
The new version consists of text elements, where at least one sentence element was identified as being in Finnish, from these three sources:
- KLK-fi, version 1 (http://urn.fi/urn:nbn:fi:lb-2016050302)
- KLK-sv, version 1 (http://urn.fi/urn:nbn:fi:lb-2016050301)
- new data from the National Library (not previously available in the Language Bank, may cover any time period, just more recently OCR'd)
The text elements are enriched with a 'version_added' attribute, which identifies the source.
For a listing of the newspapers and magazines contained in this resource, please see the Documentation.
The corpus consists of Finnish newspapers and magazines starting from 1771 up to 2021, compiled by the National Library of Finland.
For this new version, the data of the previous version (Finnish and Swedish) was checked with the HeLI-OTS language identifier. Parts of texts, which do not contain Finnish, were removed from this corpus. On the other hand, texts from the Swedish part of KLK, which contain Finnish, where added to this corpus.
The new version consists of text elements, where at least one sentence element was identified as being in Finnish, from these three sources:
- KLK-fi, version 1 (http://urn.fi/urn:nbn:fi:lb-2016050302)
- KLK-sv, version 1 (http://urn.fi/urn:nbn:fi:lb-2016050301)
- new data from the National Library (not previously available in the Language Bank, may cover any time period, just more recently OCR'd)
The text elements are enriched with a 'version_added' attribute, which identifies the source.
For a listing of the newspapers and magazines contained in this resource, please see the Documentation.
- TDT alpha
People who looked at this resource also viewed the following:
- The Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version
- The Swedish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version
- The Newspaper and Periodical Corpus of the National Library of Finland, Swedish sub-corpus, 1771–1879, VRT
- The Newspaper and Periodical OCR Corpus of the National Library of Finland (1771-1874)