Guidelines for annotating the LUNA corpus with frame information

Tonelli, Sara and Riccardi, Giuseppe (2010) Guidelines for annotating the LUNA corpus with frame information. UNSPECIFIED.

[img]
Preview
PDF
Download (597Kb) | Preview

    Abstract

    This document defines the annotation workflow aimed at adding frame information to the LUNA corpus of conversational speech. In particular, it details both the corpus pre-processing steps and the proper annotation process, giving hints about how to choose the frame and the frame element labels. Besides, the description of 20 new domain-specific and language-specific frames is reported. To our knowledge, this is the first attempt to adapt the frame paradigm to dialogs and at the same time to define new frames and frame elements for the specific domain of software/hardware assistance. The technical report is structured as follows: in Section 2 an overview of the FrameNet project is given, while Section 3 introduces the LUNA project and the annotation framework involving the Italian dialogs. Section 4 details the annotation workflow, including the format preparation of the dialog files and the annotation strategy. In Section 5 we discuss the main issues of the annotation of frame information in dialogs and we describe how the standard annotation procedure was changed in order to face such issues. Then, the 20 newly introduced frames are reported in Section 6.

    Item Type: Departmental Technical Report
    Department or Research center: Information Engineering and Computer Science
    Subjects: Q Science > QA Mathematics > QA075 Electronic computers. Computer science
    P Language and Literature > P Philology. Linguistics (General) > P0121 Linguistics
    Uncontrolled Keywords: lexical semantics, conversational speech, corpora annotation, FrameNet
    Report Number: DISI-10-017
    Repository staff approval on: 05 Mar 2010

    Actions (login required)

    View Item