In: Jazykovedný časopis, vol. 74, no. 1
Dmitri Sitchinava
Details:
Year, pages: 2023, 266 - 274
Language: eng
Keywords:
lacunae, epigraphy, fragmented text, historical corpus, birchbark letters annotation, lemmatization, Old East Slavic
Article type: CORPUS BUILDING
About article:
The paper presents the issue of fragmented and/or ambiguously interpreted texts within the corpora of Old East Slavic vernacular writing. One of these corpora, the corpus of the Old East Slavic birchbark letters, is already available, the other, comprising the texts of Old East Slavic inscriptions, is under preparation. Due to the fragmentary state of many birchbark and epigraphy texts, their lemmatization and grammatical tagging may be uncertain and multiple interpretations may coexist. Some lemmas survive only in fragments which are nevertheless relevant for the study of lexicon. The grammatical status of many fragments may be firmly established despite lacking lexical information. However the relevant data on these fragments is not available in the word indices and corpora that take into consideration only best-preserved word forms. In the paper, the representation and annotation of such word forms within the Old East Slavic vernacular corpora is presented, and relative frequencies of such phenomena within the birchbark letter corpus are shown, with some case studies showing the relevance of the annotation of fragmented forms. The existing approaches, namely for the classical epigraphy within the EpiDoc standard and in the Hittite syntactic treebanks, are also briefly presented and compared to the solution found within the Old East Slavic vernacular corpora.
How to cite:
ISO 690:
Sitchinava, D. 2023. Multiple Interpretation and Fragmented Texts within a Historical Corpus: The Case of Old East Slavic Vernacular Writing. In Jazykovedný časopis, vol. 74, no.1, pp. 266-274. ISSN 0021-5597. DOI: https://doi.org/10.2478/jazcas-2023-0044
APA:
Sitchinava, D. (2023). Multiple Interpretation and Fragmented Texts within a Historical Corpus: The Case of Old East Slavic Vernacular Writing. Jazykovedný časopis, 74(1), 266-274. ISSN 0021-5597. DOI: https://doi.org/10.2478/jazcas-2023-0044
About edition:
Rights:
This work is licensed under CC BY-NC-ND 4.0