publish.UP Search

A flexible framework for integrating annotations from different tools and tag sets (2008)

Chiarcos, Christian ; Dipper, Stefanie ; Götze, Michael ; Leser, Ulf ; Lüdeling, Anke ; Ritz, Julia ; Stede, Manfred

We present a general framework for integrating annotations from different tools and tag sets. When annotating corpora at multiple linguistic levels, annotators may use different expert tools for different phenomena or types of annotation. These tools employ different data models and accompanying approaches to visualization, and they produce different output formats. For the purposes of uniformly processing these outputs, we developed a pivot format called PAULA, along with converters to and from tool formats. Different annotations are not only integrated at the level of data format, but are also joined on the level of conceptual representation. For this purpose, we introduce OLiA, an ontology of linguistic annotations that mediates between alternative tag sets that cover the same class of linguistic phenomena. All components are integrated in the linguistic information system ANNIS : Annotation tool output is converted to the pivot format PAULA and read into a database where the data can be visualized, queried, and evaluated across multiple layers. For cross-tag set querying and statistical evaluation, ANNIS uses the ontology of linguistic annotations. Finally, ANNIS is also tied to a machine learning component for semiautomatic annotation.

Animacy and child language : An OT account (2008)

Féry, Caroline ; Drenhaus, Heiner

In this paper we report the results of an elicited imitation task on dative case marking in non-canonical double object constructions with 22 German children (3;9-6;8). The aim was to test the proficiency of the children's grammar and to see which strategies they use to produce ditransitive sentences in which the direct object precedes the indirect object. The analysis of the children's utterances/imitations shows that the animacy of the direct object affects the overt dative case marking of the indirect object. Children made more errors repeating dative case marking when the direct object was inanimate, i.e., they produced the accusative case on the indirect object (non-adult-like). When both objects were animate, children correctly produced the dative case on the indirect object. We describe and account for these performance strategies of the children in the framework of Optimality Theory. Assuming that the same universal constraints are at work as in the adult grammar, the difference between adults and children lies in the constraint ranking. We focus on a prominent pattern found in children's performance, which is absent (or rather oppressed) in the corresponding adult performance, and show that one and the same grammar accounts for both (in the sense of "strong continuity"). (c) 2007 Elsevier B.V. All rights reserved.

Impaired word stress pattern discrimination in very-low-birthweight infants during the first 6 months of life (2008)

Herold, Birgit ; Höhle, Barbara ; Walch, Elisabeth ; Weber, Tanja ; Obladen, Michael

Syntactic categorization of new words : distributional and morphological cues to form class (2008)

Höhle, Barbara ; Wang, Hao

The role of duration as a phonetic correlate of focus (2008)

Kügler, Frank

Teaching a new word : properties of CDS to 12-month-old German-learning children (2008)

Müller, Anja ; Höhle, Barbara ; Weissenborn, Jürgen

Individual differences in moral judgment competence influence neural correlates of socio-normative judgments (2008)

Prehn, Kristin ; Wartenburger, Isabell ; Mériau, Katja ; Scheibe, Christina ; Goodenough, Oliver R. ; Villringer, Arno ; van der Meer, Elke ; Heekeren, Hauke R.

Disambiguating rhetorical structure (2008)

Stede, Manfred

Empirical studies of text coherence often use tree-like structures in the spirit of Rhetorical Structure Theory (RST) as representational device. This paper identifies several sources of ambiguity in RST-inspired trees and argues that such structures are therefore not as explanatory as a text representation should be. As an alternative, an approach toward multi-level annotation (MLA) of texts is proposed, which separates the information into distinct levels of representation, in particular: referential structure, thematic structure, conjunctive relations, and intentional structure. Levels are conceptually built upon each other, and human annotators can produce them using a dedicated software environment. We argue that the resulting multi-level corpora are descriptively more adequate, and as a resource are more useful than RST-style treebanks.

RST revisited : disentangling nuclearity (2008)

Stede, Manfred

Connective-based local coherence analysis : a lexicon for recognizing causal relationships (2008)

Stede, Manfred

Metrical and statistical cues for word segmentation : the use of vowel harmony and word stress as a cue to word boundaries by 6- and 9-month-old Turkish learners (2008)

van Kampen, Anja ; Parmaksiz, Güliz ; van de Vijver, Ruben ; Höhle, Barbara

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Institute

11 search hits