Show simple item record

 
dc.contributor.author Rimkutė, Erika
dc.contributor.author Kamandulytė-Merfeldienė, Laura
dc.contributor.author Aleksandravičiūtė, Gabrielė
dc.contributor.author Anglickienė, Laimutė
dc.contributor.author Barkauskaitė, Giedrė
dc.contributor.author Bielinskienė, Agnė
dc.contributor.author Boizou, Loïc
dc.contributor.author Grigonytė, Gintarė
dc.contributor.author Kovalevskaitė, Jolanta
dc.contributor.author Virbickienė, Gabrielė
dc.date.accessioned 2022-08-30T13:22:27Z
dc.date.available 2022-08-30T13:22:27Z
dc.date.issued 2022-08-29
dc.identifier.uri http://hdl.handle.net/20.500.11821/50
dc.description The Pedagogic Corpus of Lithuanian is a monolingual specialized corpus, prepared for learning and teaching Lithuanian in a foreign language classroom. The pedagogic corpus includes authentic Lithuanian texts, selected using such criteria as a learner-relevant communicative function and genre. Spoken language as well as written language are represented in the corpus. The size of the corpus is 669,000 tokens: 111,000 tokens from texts and spoken language for A1-A2 levels, 558,000 tokens from texts and spoken language for B1-B2 levels (according to the Common European Framework of Reference for Languages). The spoken component constitutes appr. 7.5 % of the Corpus. The written subpart of the corpus (containing 620,000 tokens) includes levelled texts from coursebooks and unlevelled texts from other sources. The texts from coursebooks and other sources could be classified into 29 text types (dialogs, narratives, information, etc.) and 4 groups according to the communicative aims: informational texts, educational texts, advertising and fiction. There are two types of searches in the corpus: simple and advanced (see „Search Tips“). Simple Search allows you to find instances of a search item (word form, lemma, two words) in the whole corpus, or particular part of the corpus (spoken or written texts). After selecting the written subcorpus, you can further select the text type (coursebooks or non-coursebook texts) and/or the genre of the written texts. Advanced Search allows you to use all the features of simple search and find some additional options. Since the Pedagogic Corpus is morphologically annotated, the advanced search allows you to search by grammatical features (e.g. part of speech, case, number, verb form, etc.). At https://kalbu.vdu.lt/mokymosi-priemones/mokomasis-tekstynas/ you can find truncated wordlists: list of lemmas, word forms (for the whole corpus, spoken and written components, and for each level), lists of particular part of speech in the whole corpus. The lists can be downloaded as .xlsx files. REFERENCE Kovalevskaitė, Jolanta and Rimkutė, Erika. "Pedagogic Corpus of Lithuanian: A New Resource for Learning and Teaching Lithuanian as a Foreign Language" Sustainable Multilingualism, vol.17, no.1, 2020, pp.197-230. https://doi.org/10.2478/sm-2020-0019
dc.language.iso lit
dc.publisher Vytautas Magnus University
dc.source.uri https://kalbu.vdu.lt/mokymosi-priemones/mokomasis-tekstynas/
dc.subject Pedagogic corpus
dc.subject Lithuanian language
dc.title Pedagogic Corpus of Lithuanian
dc.type toolService
metashare.ResourceInfo#ContentInfo.detailedType service
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent true
hidden false
hasMetadata false
has.files no
branding CLARIN-LT
contact.person Erika Rimkutė erika.rimkute@vdu.lt Vytautas Magnus University
sponsor Europeans Social Fund 09.3.1-ESFA-V-709-01-0002 Lithuanian Academic Scheme for International Cooperation in Baltic Studies euFunds
files.size 0
files.count 0


Show simple item record