Show simple item record

 
dc.contributor.author Vaičenonienė, Jurgita
dc.contributor.author Kovalevskaitė, Jolanta
dc.contributor.author Boizou, Loïc
dc.date.accessioned 2020-09-30T17:17:09Z
dc.date.available 2020-09-30T17:17:09Z
dc.date.issued 2020-09-30
dc.identifier.uri http://hdl.handle.net/20.500.11821/40
dc.description ORVELIT v3 (Lith.Originalios ir Vertimų Lietuvių Kalbos Tekstynas) is a comparable monolingual corpus of original and translated Lithuanian consisting of four sub-corpora of original and translated fiction and popular science literature (approx. 1m words each). A detailed information on the composition and lexical and morphological features of the raw (ORVELIT v1) and morphologically annotated (ORVELIT v2) versions of the corpus can be found in: Vaičenonienė, Jurgita, Kovalevskaitė, Jolanta, and Ringailienė, Teresė. 2017. Tekstynais paremti vertimų kalbos tyrimai ir šaltiniai. Kalbų studijos/ Studies about Languages, Nr. 30, pp. 42-55. https://www.vdu.lt/cris/handle/20.500.12259/56648?mode=simple Vaičenonienė, Jurgita, Kovalevskaitė, Jolanta. 2019. Leksinės ir morfologinės vertimų kalbos ypatybės. Darnioji daugiakalbystė/ Sustainable Multilingualism Nr. 14, pp. 208-235. https://www.vdu.lt/cris/handle/20.500.12259/98861 ORVELIT v3 has been modified by deleting the title, content, bibliographical lists, indexes and author(s) of the texts as well as mixing the individual texts at paragraph level. Cases when some other information was deleted were marked as <del>. The corpus encoding is UTF-8. ORVELIT v3 includes a raw (ORVELIT v3_raw) and morphologically annotated (ORVELIT v3_annotated) corpus versions. The corpus was automatically morphologically annotated with Semantika.lt analyser.
dc.language.iso lit
dc.publisher Vytautas Magnus University
dc.rights ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT
dc.rights.uri https://clarin.vdu.lt/licenses/eula/ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm
dc.rights.label ACA
dc.subject corpus
dc.subject comparable corpus
dc.subject Lithuanian
dc.title ORVELIT v3
dc.type corpus
metashare.ResourceInfo#ContentInfo.mediaType text
hidden false
hasMetadata false
has.files yes
branding CLARIN-LT
contact.person Jurgita Vaičenonienė jurgita.vaicenoniene@vdu.lt Vytautas Magnus University
sponsor EU Structural Funds 02.3.1-CPVA-V-527-01-0002 Information System of Syntactic-Semantic Analysis of Lithuanian Language: Development of Public Services euFunds
sponsor Ministry of Education and Science MTI-02/2015 Lithuania’s Membership in the International Research Infrastructure - CLARIN ERIC nationalFunds
size.info 3998484 tokens
files.size 47411041
files.count 2


 Files in this item  Download all files in item (45.21 MB)

This item is
Academic Use
and licensed under:
ACA_CLARIN-LT_End-User-Licence-Agreement_EN-LT
Attribution Required Noncommercial
Icon
Name
ORVELIT_v3_ANNOTATED.zip
Size
36.62 MB
Format
application/zip
Description
zip archive
 Download file
Icon
Name
ORVELIT_v3_RAW.7z
Size
8.6 MB
Format
Unknown
Description
7-zip archive
 Download file

Show simple item record