What's New

corpus
corpus
Description:
LITUND contains two comparable corpora: 1. Unreliable news texts. 147 full-text articles (100,678 words) identified as misleading by professional fact-checkers. The corpus includes metadata file with the following ...
 Šis įrašas turi 3 bylas (3.34 MB).
 
Academic Use Attribution Required Noncommercial
corpus
corpus
Description:
This corpus consists of (1) examples of hate speech based on ethnicity, nationality, or race, and (2) a collection of neutral comments, including both general comments and comments mentioning nationality in a positive or ...
 Šis įrašas turi 4 bylas (803.65 KB).
 
Publicly Available
lexicalConceptualResource
lexicalConceptualResource
Author(s):
Description:
The dataset was extracted from publicly available online sources, primarily Lithuanian news portal publications from the period 2014–2020 (~500M words). It includes patterns using the following Perl-style regular expression: ...
 Šis įrašas turi 1 bylą (749.56 KB).
 
Publicly Available

Most Viewed Items

Top Last Week
toolService
toolService
Description:
This keyboard driver allows easy access of the Lithuanian letters via conventional keyboard layout a.k.a. „Lithuanian letters instead of numbers“. Essential new feature of this layout is the extensive use of "dead key" ...
 Šis įrašas turi 1 bylą (10.5 MB).
 
Publicly Available
toolService
toolService
Description:
Original TrueType font designed and hinted in Lithuania. The font complies with the ISO/IEC 10646 (Unicode) standard and have the full set of casual and accented Lithuanian characters (e.g., į̃, ū̃, r̃, ė́, etc.). All the ...
 Šis įrašas turi 1 bylą (782.13 KB).
 
Publicly Available
corpus
corpus
Author(s):
Description:
MATAS v0.2 - Morphologically Annotated Lithuanian Corpus (manually checked) Contains 4 parts: Documents (21%), Fiction (19%), Periodicals (36%), Scientific texts (24%) Wordform count: 1,641,263 Files: 92 Encoding ...
 Šis įrašas turi 1 bylą (26.83 MB).
 
Academic Use Attribution Required Noncommercial