Central and South-European language resources in META-SHARE
Abstract
The paper intends to give a brief summary of one the most recent efforts on building the pan-European language technology infrastructure: META-NET – a network of Excellence consisting of 54 research centres from 33 countries – and specifically, its Central and South-European participating project: CESAR. One of the major activities of the project is selection of the resources and tools to be collected, validated, standardized, upgraded/extended/cross-lingually aligned and stored in the META-SHARE open resource exchange facility.
The contribution focuses on presenting the repository maintaining the metadata of the selected resources, the methodology and criteria for their selection and a detailed view to the resources and tools delivered by the project in 2011. After highlighting the concepts of META-SHARE metadata model and synchronized network of metadata servers, the article presents the methodology and criteria for the resource selection by calculating point values basing on solid evaluation indicators such as resource availability, quality, and quantity of similar resources available, coverage, maturity, sustainability and adaptability. The META-NET Language White Papers – the series of reports on the state of each European language with respect to language technology is also presented as well as the licensing guidelines put forward by the META-SHARE community, promoting open and free of charge use of data and tools by using standardized and well-defined legal attributions.
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.