Fifteen writers and their digital imprints in numbers, images and words

Abstract

In this paper we present the corpus \textsc{15authors} which contains 49 works of fifteen authors that wrote in the Serbian language at the end of the 19th and the beginning of the 20th century. This corpus was derived from the SrpELTeC corpus built within the framework of the COST Action ``Distant Reading for European Literary History.'' We used existing annotations (sentences, phrases in foreign languages, part-of-speech (POS) tags, lemmas and named entities) and conducted additional analyses with open-code software Unitex and TXM, in order to reveal digital imprints left by the selected authors in their works.


Keywords: literary corpus, textometry, corpus linguistics, distant reading, Serbian language, Unitex, TXM.

Published
2026-04-27
How to Cite
KRSTEV, Cvetana. Fifteen writers and their digital imprints in numbers, images and words. Infotheca - Journal for Digital Humanities, [S.l.], v. 26, n. 1, p. 9-42, apr. 2026. ISSN 2217-9461. Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2026.26.1.1_en>. Date accessed: 21 may 2026. doi: https://doi.org/10.18485/infotheca.2026.26.1.1.