What's New

corpus
corpus
Author(s):
Description:
Lithuanian Coreference Corpus The corpus is made out of 100 articles from news portals focusing on political news, as such texts are rich in quotations and named entity mentions. The corpus has been annotated with ASAS ...
 Šis įrašas turi 1 bylą (176.44 KB).
 
Publicly Available
corpus
corpus
Description:
274 460 word corpus comprised of selected primary and secondary law acts of the EU of the period 2015-2017. The corpus was compiled of documents containing words with the root "teis-" (en. law). All of the included ...
 Šis įrašas turi 1 bylą (424.19 KB).
 
Publicly Available
corpus
corpus
Description:
23.9 m word Lithuanian Parliament corpus is specially designed for authorship attribution task. The corpus consists of 111 thousand samples of speech transcripts by 147 parliamentarians in Lithuanian Seimas. It covers the ...
 Šis įrašas turi 4 bylas (1.72 GB).
 
Publicly Available

Most Viewed Items

Top Last Week
corpus
corpus
Description:
ALKSNIS v2 ALKSNIS v2 consists of 2,355 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit syntactic trees by the editor TrED (https://ufal ...
 Šis įrašas turi 1 bylą (1.79 MB).
 
Publicly Available
lexicalConceptualResource
lexicalConceptualResource
Author(s):
Description:
Wordlist of the Contemporary Corpus of Lithuanian language Corpus Structure Subcorpus Words,m Proportion Fiction 17,08 12,3% Non-fiction 22,09 15,9% Documents 13,54 9,7% Periodicals 85,80 61,7% Speech ...
 Šis įrašas turi 1 bylą (17.18 MB).
 
Publicly Available
lexicalConceptualResource
lexicalConceptualResource
Author(s):
Description:
The lemmatised wordlist of 1 m. word Lithuanian corpus. The structure of the tab delimited text file (dazninis.txt): Headword<TAB>Part of Speech<TAB>Wordform<TAB>Frequency of Occurrence. The data is the basis for "Frequency ...
 Šis įrašas turi 1 bylą (4.87 MB).
 
Publicly Available