Medical domain document classification via extraction of taxonomy concepts from MeSH ontology

Mihailo Škorić University of Belgrade
Mauro Dragoni Fondazione Bruno Kessler

DOI: https://doi.org/10.18485/infotheca.2019.19.1.3

Abstract

This paper is a result of a task that was presented to attendants of Keyword Search in Big Linked Data summer school, that was organized by Vienna University of Technology, under the Keystone COST action in the summer of 2017. It presents a specific approach to the classification via creation of minimal document surrogates based on the US National medical library’s MeSH ontology, which is derived from the Medical Subject Headings thesaurus. In a series of previously classified medically related text, which are the bases for the task, all of the significant terms are located and replaced with taxonomical references from the MeSH ontology. Extracted references are used for the classification within the ontology using a rather simple algorithm and the results are evaluated in compresence to previous manual classification of the same documents.

3_en_Skoric

Published

2019-10-25

How to Cite

ŠKORIĆ, Mihailo; DRAGONI, Mauro. Medical domain document classification via extraction of taxonomy concepts from MeSH ontology. Infotheca - Journal for Digital Humanities, [S.l.], v. 19, n. 1, p. 55-69, oct. 2019. ISSN 2217-9461. Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2019.19.1.3_en>. Date accessed: 13 feb. 2026. doi: https://doi.org/10.18485/infotheca.2019.19.1.3.

Citation Formats

Issue

Vol 19 No 1 (2019): Infotheca - Journal for Digital Humanities

Section

Articles

		Faculty of Philology, University of Belgrade
		University Library „Svetozar Marković“
		Association of Libraries of the Universities of Serbia

Medical domain document classification via extraction of taxonomy concepts from MeSH ontology

Abstract

Publisher