dc.contributor.author |
Amilevičius, Darius |
dc.contributor.author |
Utka, Andrius |
dc.contributor.author |
Meidutė, Aistė |
dc.contributor.author |
Ruzaitė, Jūratė |
dc.date.accessioned |
2023-02-20T09:38:04Z |
dc.date.available |
2023-02-20T09:38:04Z |
dc.date.issued |
2023-02-20 |
dc.identifier.uri |
http://hdl.handle.net/20.500.11821/53 |
dc.description |
DIGIRES COVID-19 Corpus v.1 consists of 351 Lithuanian media articles about COVID-19 pandemics. The corpus was compiled from various internet public Lithuanian media sources. Corpus contains 351 files in plain text format (TXT) with UTF-8 encoding. Each article consists of a title (in the 1st line) and an article body. Files are classified into two subcorpora: 1) "unrealiable" that contains articles, which were identified by professional fact checkers as fake news; 2) "reliable" that contains trustworthy articles.
Subcorpus Files Word tokens
Reliable: 175 67902
Unreliable: 176 118747
Total 351 186649 |
dc.language.iso |
lit |
dc.publisher |
Vytautas Magnus University |
dc.rights |
PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT |
dc.rights.uri |
https://clarin.vdu.lt/licenses/eula/PUB_CLARIN-LT_End-User-Licence-Agreement_EN-LT.htm |
dc.rights.label |
PUB |
dc.source.uri |
https://digires.lt/ |
dc.subject |
desinformation corpus |
dc.title |
DIGIRES COVID-19 Corpus v.1 |
dc.type |
corpus |
metashare.ResourceInfo#ContentInfo.mediaType |
text |
hidden |
false |
hasMetadata |
false |
has.files |
yes |
branding |
CLARIN-LT |
contact.person |
Andrius Utka andrius.utka@vdu.lt Vytautas Magnus University |
contact.person |
Darius Amilevičius darius.amilevicius@vdu.lt Vytautas Magnus University |
sponsor |
European Commission LC-01682259 DIGIRES - Supporting Collaborative Partnerships for Digital Resilience and Capacity Building in the Times of Disinfodemic/COVID-19 euFunds |
size.info |
186649 tokens |
files.size |
761384 |
files.count |
2 |