Lightweight Parsing of Classifications

Autayeu, Aliaksandr and Giunchiglia, Fausto and Andrews, Pierre (2010) Lightweight Parsing of Classifications. UNSPECIFIED. (Submitted)

[img]
Preview
PDF
Download (304Kb) | Preview

    Abstract

    Understanding metadata written in natural language is a crucial requirement towards the successful automated integration of large scale, language-rich, classifications such as the ones used in digital libraries. In this article we analyze natural language labels used in such classifications by exploring their syntactic structure, and then we show how this structure can be used to detect patterns of language that can be processed by a lightweight parser whose average accuracy is 96.82%. This allows for a deep understanding of natural language metadata semantics. In particular we show how we improve the accuracy of the automatic translation of classifications into lightweight ontologies by almost 18% with respect to the previously used approach. The automatic translation is required by applications such as semantic matching, search and classification algorithms.

    Item Type: Departmental Technical Report
    Department or Research center: Information Engineering and Computer Science
    Subjects: Q Science > QA Mathematics > QA076 Computer software
    Additional Information: Submitted to the "IJoDL Special Issue on ECDL 2010" - International Journal on Digital Libraries (IJoDL - http://www.springerlink.com/content/100475/)
    Report Number: DISI-10-068
    Repository staff approval on: 31 Dec 2010

    Actions (login required)

    View Item