What's New

corpus
corpus
Description:
MATAS corpus (version 1.0) DESCRIPTION Manually checked, morphologically annotated corpus MATAS FORMATS 1. CoNLL-U (CONLLU, conllu) 2. SketchEngine - tab delimited word per line (TAB-WPL, txt) SIZE Wordform ...
 This item contains 3 files (32.95 MB).
 
Publicly Available
toolService
toolService
Description:
Colloc -- a tool for automatic identification of multiword expressions (MWE) is freely available for online use at http://resursai.mwe.lt/atpazintuvas. As material for training DELFI.lt corpus (http://tekstynas.mwe.lt/) ...
 This item contains no files.
lexicalConceptualResource
lexicalConceptualResource
Description:
Database of Lithuanian multiword expressions (MWE) contains bi-gram and tri-gram MWE that occured in DELFI.lt corpus (http://tekstynas.mwe.lt/) at least 10 times. In the database diverse information about MWE there is ...
 This item contains no files.

Most Viewed Items

Top Last Week
corpus
corpus
Description:
23.9 m word Lithuanian Parliament corpus is specially designed for authorship attribution task. The corpus consists of 111 thousand samples of speech transcripts by 147 parliamentarians in Lithuanian Seimas. It covers the ...
 This item contains 4 files (1.72 GB).
 
Publicly Available
corpus
corpus
Description:
ALKSNIS v3.0. ALKSNIS v3,0 consists of 3,643 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit syntactic trees by the editor TrED ...
 This item contains 1 file (2.59 MB).
 
Publicly Available
corpus
corpus
Description:
Corpus of the Contemporary Lithuanian Language, which comprises 208 million words, is a collection of texts designed to represent the current Lithuanian. The corpus has been compiled since 1990. The corpus is designed to ...
 This item contains no files.