dc.contributor.author |
Utka, Andrius |
dc.date.accessioned |
2016-11-19T21:05:01Z |
dc.date.available |
2016-11-19T21:05:01Z |
dc.date.issued |
2018-01-18 |
dc.identifier.uri |
http://hdl.handle.net/20.500.11821/12 |
dc.description |
The lemmatised wordlist of 1 m. word Lithuanian corpus. The structure of the tab delimited text file (dazninis.txt):
Headword<TAB>Part of Speech<TAB>Wordform<TAB>Frequency of Occurrence.
The data is the basis for "Frequency Dictionary of Written Lithuanian - based on 1m word morphologically annotated corpus" (A_Utka-Dazninis_zodynas.pdf).
Reference:
Utka. A. 2009. Dažninis rašytinis lietuvių kalbos žodynas: 1 milijono žodžių morfologiškai anotuoto
tekstyno pagrindu. Kaunas: VDU leidykla, ISBN 978-9955-12-546-4 |
dc.language.iso |
lit |
dc.publisher |
Vytautas Magnus University |
dc.rights |
PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT |
dc.rights.uri |
https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm |
dc.rights.label |
PUB |
dc.subject |
wordlist |
dc.subject |
Lithuanian |
dc.title |
Lemmatised Wordlist of 1 m. Corpus of Contemporary Lithuanian |
dc.type |
lexicalConceptualResource |
metashare.ResourceInfo#ContentInfo.detailedType |
wordList |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
hidden |
false |
hasMetadata |
false |
has.files |
yes |
branding |
CLARIN-LT |
contact.person |
Andrius Utka andrius.utka@vdu.gmail Vytautas Magnus University |
size.info |
128,058 entries |
files.size |
5106257 |
files.count |
1 |