Extraction and annotation of 'location names'

  • Tita Kyriacopoulou University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France
  • Claude Martineau University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France
  • Markarit Vartampetian Paris Nanterre University, France

Abstract

Introduced as part of the Message Understanding Conferences dedicated to information extraction, Named Entity extraction is a well-studied task in Natural Language Processing. The recognition and the categorisation of person names, location names, organisation names, etc., is regarded as a fundamental process for a wide variety of natural language processing applications dealing with content analysis and many research works are devoted to it, achieving very good results.
One of our objectives is the identification and automatic (or semi-automatic) annotation of location names in order to apply the most appropriate information extraction methods. Then the main objective concerns the combination and interoperability between symbolic and statistical NLP (Natural Language Processing) methods (symbolic rules, machine learning, and data mining).
Our work consisted of recognising named entities and in particular locations with Unitex, annotating them with Brat, and correcting them manually. The recall and accuracy rates are very encouraging but the question remains: What is a location name ?

Published
2020-03-16
How to Cite
KYRIACOPOULOU, Tita; MARTINEAU, Claude; VARTAMPETIAN, Markarit. Extraction and annotation of 'location names'. Infotheca - Journal for Digital Humanities, [S.l.], v. 19, n. 2, p. 7-25, mar. 2020. ISSN 2217-9461. Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2019.19.2.1_en>. Date accessed: 15 aug. 2020. doi: https://doi.org/10.18485/infotheca.2019.19.2.1.