What's New
corpus
Description:
The NEC corpus samples used in the study comprises 433 examination responses (essays) written in L2 English on two topics, namely, The importance of volunteering for young people (from the English examination of 2012, coded ...
This item contains 2 files (544.27
KB).
Academic Use
corpus
Description:
Lithuanian-English Parallel Cybersecurity Corpus consists of official cybersecurity documents of the Republic of Lithuania and their English translations, dating from 2014 to 2024. The documents were obtained from the legal ...
This item contains 6 files (4.26
MB).
Publicly Available
lexicalConceptualResource
Description:
The resource contains 6 frequency lists for the Corpus of Contemporary Lithuanian language (CCLL) (https://sitti.vdu.lt/en/services/)
1-LT_token_freq_list.txt
- a full frequency list of all tokens in CCLL
2-LT_token_f ...
This item contains 2 files (102.41
MB).
Publicly Available
Most Viewed Items
Top Last Week
lexicalConceptualResource
Description:
Lithuanian Hunspell dictionary consists of two files, namely an affix file (.aff) and a dictionary file (.dic). The data can be used for spell checking, morphological analysis, or synthesis of a Lithuanian word (e.g., ...
This item contains 4 files (1.63
MB).
Publicly Available
lexicalConceptualResource
Description:
Dabartinės lietuvių kalbos tekstyno žodžių formų dažniniai sąrašai
Worlists of Wordforms of the Contemporary Corpus of Lithuanian language
Tekstyno struktūra/Corpus Structure
Patekstynis/Subcorpus Words,m Proporti ...
This item contains 2 files (33.16
MB).
Publicly Available
corpus
Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked)
Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%)
Wordform count: 1,641,263
Files: 92
Encoding ...
This item contains 1 file (26.83
MB).
Academic Use