Transducers for Annotating Weather Information in Meteorological Texts in Serbia

  • Vesna Pajić University of Belgrade, Faculty of Agriculture, Departmant for Agricultural Engineering
  • Staša Vujičić Stanković University of Belgrade, Faculty of Mathematics
  • Miloš Pajić University of Belgrade, Faculty of Agriculture, Departmant for Agricultural Engineering

Abstract

We present a process of extracting information on meteorological phenomena from texts in Serbian. We used finite state automata and transducers for both text processing and information extraction, through software specialized for linguistic text processing. Information extraction was done by annotating text segments. The extraction rules were described with transducers (finite state transducers and recursive transition networks). Some details of used transducers are presented in this paper, aiming to demonstrate the application of different electronic resources for Serbian, especially the electronic morphological dictionary. Transducers are very efficient tools for language processing. In the case of processing Serbian, it is very important to create different resources and corpora which could allow linguistic research. Therefore, we plan to form a collection of transducers and make it publicly available for different kinds of research in the computational linguistics domain.

Published
2024-03-04
How to Cite
PAJIĆ, Vesna; VUJIČIĆ STANKOVIĆ, Staša; PAJIĆ, Miloš. Transducers for Annotating Weather Information in Meteorological Texts in Serbia. Infotheca - Journal for Digital Humanities, [S.l.], v. 13, n. 2, p. 33-47, mar. 2024. ISSN 2217-9461. Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/393>. Date accessed: 19 nov. 2024.