What's New
toolService

Description:
DIGIRES COVID-19 ML dataset v.1 is a tab-separated (.tsv) file prepared for training machine learning algorithms. The training dataset was compiled from various internet public Lithuanian media sources. It contains 351 ...
This item contains 2 files (532.45
KB).
Publicly Available
corpus

Description:
DIGIRES COVID-19 Corpus v.1 consists of 351 Lithuanian media articles about COVID-19 pandemics. The corpus was compiled from various internet public Lithuanian media sources. Corpus contains 351 files in plain text format ...
This item contains 2 files (743.54
KB).
Publicly Available
corpus

Description:
Two news portals were selected for comparable corpora building: the Lithuanian portal DELFI and the English portal The Guardian. The compiled corpora comprise 135 Lithuanian articles from DELFI portal and 135 English ...
This item contains 2 files (553.57
KB).
Publicly Available
Most Viewed Items
Top Last Week
toolService

Description:
Lithuanian spelling checker for macOS
2020-04-10
version 1.0.45
This item contains 1 file (810.02
KB).
Publicly Available
toolService

Description:
Speech to text automatic transcriber for Lithuanian is a containerized application implemented into 17 containers. It covers four areas: administrative, legal, medical and general spoken language. For the installation of ...
This item contains no files.
corpus

Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked)
Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%)
Wordform count: 1,641,263
Files: 92
Encoding ...
This item contains 1 file (26.83
MB).
Academic Use

