What's New

corpus
corpus
Description:
The corpus is comprised of 154 EU legislative documents (English documents and their translations into French and Lithuanian) related to various financial issues and enacted in the period 2013-2018. The documents were ...
 This item contains 1 file (4.27 MB).
 
Publicly Available
corpus
corpus
Description:
The corpus contains parallelly aligned scripts of TED Talks in English, Lithuanian, and Hebrew. It contains spoken language data.
 This item contains 1 file (4.3 MB).
 
Publicly Available
corpus
corpus
Description:
MATAS corpus (version 1.0) DESCRIPTION Manually checked, morphologically annotated corpus MATAS FORMATS 1. CoNLL-U (CONLLU, conllu) 2. SketchEngine - tab delimited word per line (TAB-WPL, txt) SIZE Wordform ...
 This item contains 3 files (32.95 MB).
 
Publicly Available

Most Viewed Items

Top Last Week
corpus
corpus
Author(s):
Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked) Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%) Wordform count: 1,641,263 Files: 92 Encoding ...
 This item contains 1 file (26.83 MB).
 
Academic Use Attribution Required Noncommercial
corpus
corpus
Description:
MATAS corpus (version 1.0) DESCRIPTION Manually checked, morphologically annotated corpus MATAS FORMATS 1. CoNLL-U (CONLLU, conllu) 2. SketchEngine - tab delimited word per line (TAB-WPL, txt) SIZE Wordform ...
 This item contains 3 files (32.95 MB).
 
Publicly Available
corpus
corpus
Author(s):
Description:
Lithuanian Coreference Corpus The corpus is made out of 100 articles from news portals focusing on political news, as such texts are rich in quotations and named entity mentions. The corpus has been annotated with ASAS ...
 This item contains 1 file (174.53 KB).
 
Publicly Available