go to untn.it e prints home
switch to italian version go to untn.it e prints home about browse search register user area help
go to Università di Trento
titles, abstracts, keywords >>>

A Large Scale Dataset for the Evaluation of Ontology Matching Systems

Giunchiglia, Fausto and Yatskevich, Mikalai and Avesani, Paolo and Shvaiko, Pavel (2008) A Large Scale Dataset for the Evaluation of Ontology Matching Systems. Technical Report DISI-08-001, Ingegneria e Scienza dell'Informazione, University of Trento.

Full text available as:
PDF - Requires Adobe Acrobat Reader or other PDF viewer.

Abstract

Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large scale matching tasks. In this paper we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two dozen of state of the art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state of the art ontology matching systems.

The paper has been accepted for publication in "The Knowledge Engineering Review", Cambridge Universty Press (ISSN: 0269-8889, EISSN: 1469-8005).

Subjects:Q Science: QA Mathematics: QA076 Computer software
ID Code:1345
Deposited By:DIT, Administrator
Deposited On:28 April 2008

Contact the site administrator at : eprints@biblio.unitn.it