Creation and Analysis of the Yugoslav Rock Song Lyrics Corpus from 1967 to 2003
The paper analyses the process of creation and processing of the Yugoslav rock song lyrics corpus from 1967 to 2003, from the theoretical and practical perspective. The data have been obtained and XML-annotated using the Python programming language and the libraries lyricsmaster/yattag. The corpus has been preprocessed and basic statistical data have been generated by the XSL transformation. The diacritic restoration has been carried out in the Slovo Majstor and LeXimir tools (the latter application has also been used for generating the frequency analysis). The extraction of socio-cultural topics has been performed using the Unitex software, whereas the prevailing topics have been visualised with the TreeCloud software.