Refine
Has Fulltext
- no (11) (remove)
Year of publication
- 2008 (11) (remove)
Document Type
- Article (11)
Language
- English (11) (remove)
Is part of the Bibliography
- yes (11)
Institute
- Department Linguistik (11) (remove)
We present a general framework for integrating annotations from different tools and tag sets. When annotating corpora at multiple linguistic levels, annotators may use different expert tools for different phenomena or types of annotation. These tools employ different data models and accompanying approaches to visualization, and they produce different output formats. For the purposes of uniformly processing these outputs, we developed a pivot format called PAULA, along with converters to and from tool formats. Different annotations are not only integrated at the level of data format, but are also joined on the level of conceptual representation. For this purpose, we introduce OLiA, an ontology of linguistic annotations that mediates between alternative tag sets that cover the same class of linguistic phenomena. All components are integrated in the linguistic information system ANNIS : Annotation tool output is converted to the pivot format PAULA and read into a database where the data can be visualized, queried, and evaluated across multiple layers. For cross-tag set querying and statistical evaluation, ANNIS uses the ontology of linguistic annotations. Finally, ANNIS is also tied to a machine learning component for semiautomatic annotation.
In this paper we report the results of an elicited imitation task on dative case marking in non-canonical double object constructions with 22 German children (3;9-6;8). The aim was to test the proficiency of the children's grammar and to see which strategies they use to produce ditransitive sentences in which the direct object precedes the indirect object. The analysis of the children's utterances/imitations shows that the animacy of the direct object affects the overt dative case marking of the indirect object. Children made more errors repeating dative case marking when the direct object was inanimate, i.e., they produced the accusative case on the indirect object (non-adult-like). When both objects were animate, children correctly produced the dative case on the indirect object. We describe and account for these performance strategies of the children in the framework of Optimality Theory. Assuming that the same universal constraints are at work as in the adult grammar, the difference between adults and children lies in the constraint ranking. We focus on a prominent pattern found in children's performance, which is absent (or rather oppressed) in the corresponding adult performance, and show that one and the same grammar accounts for both (in the sense of "strong continuity"). (c) 2007 Elsevier B.V. All rights reserved.
Empirical studies of text coherence often use tree-like structures in the spirit of Rhetorical Structure Theory (RST) as representational device. This paper identifies several sources of ambiguity in RST-inspired trees and argues that such structures are therefore not as explanatory as a text representation should be. As an alternative, an approach toward multi-level annotation (MLA) of texts is proposed, which separates the information into distinct levels of representation, in particular: referential structure, thematic structure, conjunctive relations, and intentional structure. Levels are conceptually built upon each other, and human annotators can produce them using a dedicated software environment. We argue that the resulting multi-level corpora are descriptively more adequate, and as a resource are more useful than RST-style treebanks.