dc.contributor.author |
Rimkutė, Erika |
dc.date.accessioned |
2016-11-17T13:11:51Z |
dc.date.available |
2016-11-17T13:11:51Z |
dc.date.issued |
2016-11-17 |
dc.identifier.other |
http://hdl.handle.net/99999/9 |
dc.identifier.uri |
http://hdl.handle.net/20.500.11821/9 |
dc.description |
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked)
Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%)
Wordform count: 1,641,263
Files: 92
Encoding: UTF-8
Tagset:
Human-readable (Lithuanian tags)
e.g. <word="liepos" lemma="liepa" type="dktv mot.gim vnsk K">
Date:
2014.08.06
Please use the following text to cite this item:
Rimkutė E., Daudaravičius V., Utka A. 2007: Morphological Annotation of the Lithuanian Corpus. Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics; Workshop Balto-Slavonic Natural Language Processing 2007, Prague, 94–99.
Licence:
CLARIN-LT ACA |
dc.description.sponsorship |
European Regional Development Fund Nr. VP2-3.1-IVPK-12-K Syntactic and Semantic Analysis System of the Lithuanian Language for Corpus, Internet, and Public Sector EUfunds |
dc.language.iso |
lt |
dc.publisher |
Vytautas Magnus University |
dc.rights |
ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT |
dc.rights.uri |
https://clarin.vdu.lt/licenses/eula/ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm |
dc.rights.label |
ACA |
dc.subject |
morphologically annotated |
dc.subject |
POS tagged |
dc.subject |
corpus |
dc.title |
Lithuanian morphologically annotated corpus - MATAS |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
hidden |
false |
hasMetadata |
false |
has.files |
yes |
branding |
CLARIN-LT |
contact.person |
Andrius Utka andrius.utka@vdu.lt Vytautas Magnus University |
size.info |
1641263 words |
size.info |
92 files |
files.size |
28128376 |
files.count |
1 |