Extern
Refine
Has Fulltext
- yes (15)
Year of publication
- 2005 (15) (remove)
Document Type
- Article (15) (remove)
Is part of the Bibliography
- no (15)
Keywords
- NP-deletion (1)
- VP-ellipsis (1)
- affect (1)
- conjunction (1)
- definite descriptions (1)
- ex-situ focus (1)
- focus marker (1)
- gesture (1)
- grammaticalization (1)
- information structure (1)
Institute
- Extern (15) (remove)
Das Mädchen aus dem Urwald
(2005)
Diskurspragmatische Faktoren für Topikalität und Verbstellung in der ahd. Tatianübersetzung (9. Jh.)
(2005)
The paper presents work in progress on the interaction between information structure and word order in Old High German based on data from the Tatian translation (9th century). The examination of the position of the finite verb in correspondence with the pragmatic status of discourse referents reveals an overall tendency for verb-initial order in thetic/all-focus sentences, whereas in categorical/topic-comment sentences verb-second placement with an initial topic constituent is preferred. This conclusion provides support for the hypothesis stated in Donhauser & Hinterhölzl (2003) that the finite verb form in Early Germanic serves to distinguish the information-structural domains of Topic and Focus. Finally, the investigation sheds light on the process of language change that led to the overall spread of verb-second in main clauses of modern German.
This paper presents some concepts and principles used in the development of a database of multilingual spoken discourse at the University of Hamburg. The emphasis of the first part is on general considerations for the handling of heterogeneous data sets: After showing that diversity in transcription data is partly conceptually and partly technologically motivated, it is argued that the processing of transcription corpora should be approached via a three-level architecture which separates form (application) and content (data) on the one hand, and logical and physical data structures on the other hand. Such an architecture does not only pave the way for modern text-technological approaches to linguistic data processing, it can also help to decide where and how a standardization in the work with heterogeneous data is possible and desirable and where it would run counter to the needs of the research community. It is further argued that, in order to ensure user acceptance, new solutions developed in this approach must take care not to abandon established concepts too quickly. The focus of the second part is on some practical experiences with users and technologies gained in the four years’ project work. Concerning the practical development work, the value of open standards like XML and Unicode is emphasized and some limitations of the “platform-independent” JAVA technology are indicated. With respect to users of the EXMARaLDA system, a predominantly conservative attitude towards technological innovations in transcription corpus work can be stated: individual users tend to stick to known functionalities and are reluctant to adopt themselves to the new possibilities. Furthermore, an active commitment to cooperative corpus work still seems to be the exception rather than the rule. It is concluded that technological innovations can contribute their share to a progress in the work with heterogeneous linguistic data, but that they will have to be supplemented, in the long run, with an adequate methodological reflection and the creation of an appropriate infrastructure.
We present a system for the linguistic exploration and analysis of lexical cohesion in English texts. Using an electronic thesaurus-like resource, Princeton WordNet, and the Brown Corpus of English, we have implemented a process of annotating text with lexical chains and a graphical user interface for inspection of the annotated text. We describe the system and report on some sample linguistic analyses carried out using the combined thesaurus-corpus resource.
Inhalt: 1 Vorspann: Bei ARD und ZDF sitzt Münster in der ersten Reihe 2 Alibi zur Tatzeit: GeographInnen vor dem Fernseher 3 Film ab: Münster als Schauplatz des Verbrechens 4 Doppelgänger unter Tatverdacht: Hinter den Kulissen von Wilsberg 5 Zeugen und Mittäter: Die Zuschauer 6 Nebenrollen und effekte: Regionalwirtschaftliche Auswirkungen 7 Abspann
This paper describes the standardization problems that come up in a diachronic corpus: it has to cope with differing standards with regard to diplomaticity, annotation, and header information. Such highly het-erogeneous texts must be standardized to allow for comparative re-search without (too much) loss of information.
Multiple hierarchies
(2005)
In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.
Inhalt: - Ein Meilenstein der fachgeschichtlichen Dokumentation - Eine Anregung und ein Hinweis zu möglichen Wirkungen der Sammlung - Zur Auswahl und zur Repräsentativität der Texte - Wo sind die drei Ws, wo die zwei Bs? - Das Bild der Geographie als Insel … - … und Wissenschaftsfach mit wenig vernetzten, segregierten Denkkulturen... - … in teils stark normativ getönter metatheoretischer Rahmung - Für eine Belebung einer bestens fundierten intradisziplinären Konfliktkultur