Old or new, we repair, adjust and alter (texts)

  • Cvetana Krstev University of Belgrade, Faculty of Philology
  • Ranka Stanković University of Belgrade, Faculty of Mining and Geology

Abstract

In this paper we present how e-dictionaries and cascades of finite-state transducers as implemented in Unitex can be used to solve three text transformation problems: correction of texts after OCR, restoration of diacritics and switching between different language variants.

Published
2020-03-16
How to Cite
KRSTEV, Cvetana; STANKOVIĆ, Ranka. Old or new, we repair, adjust and alter (texts). Infotheca - Journal for Digital Humanities, [S.l.], v. 19, n. 2, p. 61-80, mar. 2020. ISSN 2217-9461. Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2019.19.2.3_en>. Date accessed: 15 aug. 2020. doi: https://doi.org/10.18485/infotheca.2019.19.2.3.