Automatic Generation of a Large Scale Semantic Search Evaluation Data-Set

Kharkevich, Uladzimir (2009) Automatic Generation of a Large Scale Semantic Search Evaluation Data-Set. UNSPECIFIED.

Download (289Kb) | Preview


    To compare the performance of information retrieval techniques in various settings, the data-sets which model these settings need to be generated. Although there are already available collections, such as those used in TREC conference series, which are used for evaluation of various retrieval tasks, there is a lack of collections which are specially developed for evaluation of the effectiveness of semantically enhanced text retrieval techniques. In this paper, we propose an approach for the automatic generation of such data-sets, by using search engines query logs and data from human-edited web directories. The evaluation is performed by comparing the performance of Lucene, a popular syntactic search engine, and Concept Search, a search engine which extends Lucene's syntactic search with semantics.

    Item Type: Departmental Technical Report
    Department or Research center: Information Engineering and Computer Science
    Subjects: Q Science > QA Mathematics > QA076 Computer software
    Additional Information: Published also in the proceedings of the 2nd International Conference on the Semantic Web and Digital Libraries (ICSD), 2009
    Report Number: DISI-09-031
    Repository staff approval on: 27 Oct 2009

    Actions (login required)

    View Item