What's New

corpus
corpus
Author(s):
Description:
Lithuanian Coreference Corpus The corpus is made out of 100 articles from news portals focusing on political news, as such texts are rich in quotations and named entity mentions. The corpus has been annotated with ASAS ...
 This item contains 1 file (176.44 KB).
 
Publicly Available
corpus
corpus
Description:
274 460 word corpus comprised of selected primary and secondary law acts of the EU of the period 2015-2017. The corpus was compiled of documents containing words with the root "teis-" (en. law). All of the included ...
 This item contains 1 file (424.19 KB).
 
Publicly Available
corpus
corpus
Description:
23.9 m word Lithuanian Parliament corpus is specially designed for authorship attribution task. The corpus consists of 111 thousand samples of speech transcripts by 147 parliamentarians in Lithuanian Seimas. It covers the ...
 This item contains 4 files (1.72 GB).
 
Publicly Available

Most Viewed Items

Top Last Week
corpus
corpus
Description:
Corpus of the Contemporary Lithuanian Language, which comprises 208 million words, is a collection of texts designed to represent the current Lithuanian. The corpus has been compiled since 1990. The corpus is designed to ...
 This item contains no files.
corpus
corpus
Author(s):
Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked) Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%) Wordform count: 1,641,263 Files: 92 Encoding ...
 This item contains 1 file (26.83 MB).
 
Academic Use Attribution Required Noncommercial
corpus
corpus
Description:
ALKSNIS v2 ALKSNIS v2 consists of 2,355 syntactically annotated sentences in the PML (Prague Mark-up Language) format. The format allows researchers to visualise and edit syntactic trees by the editor TrED (https://ufal ...
 This item contains 1 file (1.79 MB).
 
Publicly Available