Show simple item record

 
dc.contributor.author Rimkutė, Erika
dc.contributor.author Bielinskienė, Agnė
dc.contributor.author Kovalevskaitė, Jolanta
dc.contributor.author Boizou, Loïc
dc.contributor.author Aleksandravičiūtė, Gabrielė
dc.contributor.author Brokaitė, Kristina
dc.contributor.author Utka, Andrius
dc.date.accessioned 2019-10-24T06:40:31Z
dc.date.available 2019-10-24T06:40:31Z
dc.date.issued 2019-10-24
dc.identifier.other http://hdl.handle.net/99999/10
dc.identifier.uri http://hdl.handle.net/20.500.11821/21
dc.description ALKSNIS v3.0. ALKSNIS v3,0 consists of 3,643 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit syntactic trees by the editor TrED (https://ufal.mff.cuni.cz/tred/). Each node of a tree corresponds to a word, a punctuation mark or other text element (symbol, digit etc.) within a sentence. The following information is presented for each node: 1) a used form; 2) a lemma; 3) a morphology tag, and 4) a syntactic function (subject, object, etc.). Dependencies are shown by links between words. Syntactically annotated sentences are corrected according to guidelines that were created by scientists of VMU CCL, following rules of Prague Dependency Treebank. All the sentences are being manually checked and corrected by a group of linguists. The TreED editor and a style file is needed in order to view the files with .pml extension (with style file “antisDplus_schema“). ALKSNIS v3.0 from v2 was developed during the Vytautas Magnus University project “Semantika2” (Nr. 02.3.1-CPVA-V-527-01-0002). Modifications from v2 to 3.0 (2019-07-08) - The older version undergone full review of syntactic information based on improved guidelines to enhance annotation quality. - New layer added: non-compositional multiword expressions (light verbs and idioms). - Added new data: scientific abstracts and reviews, additional administrative texts. - Schema version modified as 3.0. - Jablonskis tagset, which is human-friendly, is used instead of MULTEXT-East tagset. - Some syntactic relations were corrected or modified (details to be published in the improved guidelines). - Conllu files are added together with the pml files (conllu files does not keep the mwe field). Reference: Bielinskienė A., Boizou L., Kovalevskaitė J., Rimkutė E. 2016: Lithuanian Dependency Treebank ALKSNIS. Proceedings of the Seventh International Conference Baltic HLT 2016. Amsterdam: IOS Press, 107–114. http://ebooks.iospress.nl/volumearticle/45523
dc.language.iso lit
dc.publisher Vytautas Magnus University
dc.rights PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT
dc.rights.uri https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm
dc.rights.label PUB
dc.source.uri https://clarin-lt.lt
dc.subject Lithuanian
dc.subject treebank
dc.subject syntactic analysis
dc.subject corpus
dc.title Lithuanian Treebank ALKSNIS (2019-10-24)
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN-LT
contact.person Andrius Utka andrius.utka@vdu.gmail Vytautas Magnus University
sponsor
sponsor EU Structural Funds 02.3.1-CPVA-V-527-01-0002 Information System of Syntactic-Semantic Analysis of Lithuanian Language: Development of Public Services (SEMANTIKA-2) euFunds
size.info 3643 sentences
files.size 2713010
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT
Icon
Name
Alksnis-3.0.zip
Size
2.59 MB
Format
application/zip
Description
Unknown
 Download file

Show simple item record