TY - JOUR A1 - Dipper, Stefanie A1 - Götze, Michael A1 - Skopeteas, Stavros T1 - Introduction [ISIS (2007) 07: Information structure in Cross-Linguistic Corpora] JF - Interdisciplinary studies on information structure : ISIS N2 - The annotation guidelines introduced in this chapter present an attempt to create a unique infrastructure for the encoding of data from very different languages. The ultimate target of these annotations is to allow for data retrieval for the study of information structure, and since information structure interacts with all levels of grammar, the present guidelines cover all levels of grammar too. After introducing the guidelines, the current chapter also presents an evaluation by means of measurements of the inter-annotator agreement. Y1 - 2007 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-22225 SN - 1614-4708 SN - 1866-4725 IS - 7 SP - 1 EP - 27 PB - Universitätsverlag Potsdam CY - Potsdam ER - TY - JOUR A1 - Dipper, Stefanie A1 - Götze, Michael A1 - Stede, Manfred A1 - Wegst, Tillmann T1 - ANNIS BT - a linguistic database for exploring information structure JF - Interdisciplinary studies on information structure : ISIS ; working papers of the SFB 632 N2 - In this paper, we discuss the design and implementation of our first version of the database "ANNIS" ("ANNotation of Information Structure"). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing. Y1 - 2004 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-8432 SN - 1866-4725 SN - 1614-4708 IS - 1 SP - 245 EP - 279 ER - TY - JOUR A1 - Chiarcos, Christian A1 - Dipper, Stefanie A1 - Götze, Michael A1 - Leser, Ulf A1 - Lüdeling, Anke A1 - Ritz, Julia A1 - Stede, Manfred T1 - A flexible framework for integrating annotations from different tools and tag sets N2 - We present a general framework for integrating annotations from different tools and tag sets. When annotating corpora at multiple linguistic levels, annotators may use different expert tools for different phenomena or types of annotation. These tools employ different data models and accompanying approaches to visualization, and they produce different output formats. For the purposes of uniformly processing these outputs, we developed a pivot format called PAULA, along with converters to and from tool formats. Different annotations are not only integrated at the level of data format, but are also joined on the level of conceptual representation. For this purpose, we introduce OLiA, an ontology of linguistic annotations that mediates between alternative tag sets that cover the same class of linguistic phenomena. All components are integrated in the linguistic information system ANNIS : Annotation tool output is converted to the pivot format PAULA and read into a database where the data can be visualized, queried, and evaluated across multiple layers. For cross-tag set querying and statistical evaluation, ANNIS uses the ontology of linguistic annotations. Finally, ANNIS is also tied to a machine learning component for semiautomatic annotation. Y1 - 2008 UR - http://www.atala.org/A-Flexible-Framework-for SN - 1248-9433 ER -