What's New

toolService
toolService
Description:
DIGIRES COVID-19 ML dataset v.1 is a tab-separated (.tsv) file prepared for training machine learning algorithms. The training dataset was compiled from various internet public Lithuanian media sources. It contains 351 ...
 This item contains 2 files (532.45 KB).
 
Publicly Available
corpus
corpus
Description:
DIGIRES COVID-19 Corpus v.1 consists of 351 Lithuanian media articles about COVID-19 pandemics. The corpus was compiled from various internet public Lithuanian media sources. Corpus contains 351 files in plain text format ...
 This item contains 2 files (743.54 KB).
 
Publicly Available
corpus
corpus
Description:
Two news portals were selected for comparable corpora building: the Lithuanian portal DELFI and the English portal The Guardian. The compiled corpora comprise 135 Lithuanian articles from DELFI portal and 135 English ...
 This item contains 2 files (553.57 KB).
 
Publicly Available

Most Viewed Items

Top Last Week
toolService
toolService
Description:
Lithuanian spelling checker for macOS 2020-04-10 version 1.0.45
 This item contains 1 file (810.02 KB).
 
Publicly Available
toolService
toolService
Description:
Speech to text automatic transcriber for Lithuanian is a containerized application implemented into 17 containers. It covers four areas: administrative, legal, medical and general spoken language. For the installation of ...
 This item contains no files.
corpus
corpus
Author(s):
Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked) Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%) Wordform count: 1,641,263 Files: 92 Encoding ...
 This item contains 1 file (26.83 MB).
 
Academic Use Attribution Required Noncommercial