Two Lithuanian language children’s corpora, collected during the EMVAKA project, consist of the Lithuanian language production by children aged 7–13:
(1) spoken (73 files, c. 31,000 tokens) and written (77 files, c. 7,600 tokens) production by returning emigrant children ("KG");
(2) spoken (36 files, c. 18,800 tokens) and written (49 files, c. 4,300 tokens) production by children permanently residing in Lithuania ("KL").
Reference to the corpus: Bikelienė, L., R. Juknevičienė, N. Poderienė, J. Pribušauskaitė, A. Tamulionienė. 2022. Grįžusių emigrantų vaikų kalba: kelios įžvalgos. Verbum 13, 4. Prieiga: https://dx.doi.org/10.15388/Verb.30.