publish.UP Extern

EXMARaLDA und Datenbank "Mehrsprachigkeit" (2005)

This paper presents some concepts and principles used in the development of a database of multilingual spoken discourse at the University of Hamburg. The emphasis of the first part is on general considerations for the handling of heterogeneous data sets: After showing that diversity in transcription data is partly conceptually and partly technologically motivated, it is argued that the processing of transcription corpora should be approached via a three-level architecture which separates form (application) and content (data) on the one hand, and logical and physical data structures on the other hand. Such an architecture does not only pave the way for modern text-technological approaches to linguistic data processing, it can also help to decide where and how a standardization in the work with heterogeneous data is possible and desirable and where it would run counter to the needs of the research community. It is further argued that, in order to ensure user acceptance, new solutions developed in this approach must take care not to abandon established concepts too quickly. The focus of the second part is on some practical experiences with users and technologies gained in the four years’ project work. Concerning the practical development work, the value of open standards like XML and Unicode is emphasized and some limitations of the “platform-independent” JAVA technology are indicated. With respect to users of the EXMARaLDA system, a predominantly conservative attitude towards technological innovations in transcription corpus work can be stated: individual users tend to stick to known functionalities and are reluctant to adopt themselves to the new possibilities. Furthermore, an active commitment to cooperative corpus work still seems to be the exception rather than the rule. It is concluded that technological innovations can contribute their share to a progress in the work with heterogeneous linguistic data, but that they will have to be supplemented, in the long run, with an adequate methodological reflection and the creation of an appropriate infrastructure.

Unity in diversity (2005)

Wagner, Andreas

This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.

Zur Theorie der Einstellungen zur Staatstätigkeit : Möglichkeiten und Grenzen der Erfassung (1999)

Leßmann, Grit

Inhalt: Psychologischer Hintergrund Grundlagen der Einstellungsmessung Das Einstellungsobjekt "Staatstätigkeit" Werte und Einstellungen Beispielhafte Einstellungen zur Staatstätigkeit -Steuermoral -Schattenwirtschaft Anspruchsinflation und Fiskalillusion

Food deficits, food security and food aid : concepts and measurement (1998)

Gabbert, Silke ; Weikard, Hans-Peter

The concepts of food deficit, hunger, undernourishment and food security are discussed. Axioms and indices for the assessment of nutrition of individuals and groups are suggested. Furthermore a measure for food aid donor performance is developed and applied to a sample of bilateral and multilateral donors providing food aid for African countries.

Effekte der Verrechnungsmöglichkeit negativer Einkünfte im deutschen Einkommensteuerrecht (1997)

Bork, Christhart ; Müller, Klaus

Die zunehmende Erosion der veranlagten Einkommensteuer wirft die Frage auf, inwieweit die Möglichkeit der Verechnung positiver mit negativen Einkünften dafür verantwortlich ist. Auf der Basis eines Mikrosimulationsmodells analysiert der Beitrag die Wirkungen dieser im deutschen Einkommensteuerrecht möglichen Verrechenbarkeit. Zum einen werden die aus der Abschaffung der Verrechnungsmöglichkeiten resultierenden Wanderungen von Steuerpflichtigen in höhere Einkommensklassen und zum anderen die Auswirkungen auf das Steueraufkommen untersucht. Insgesamt vermindern ca. 5 % der Steuerpflichtigen ihre positiven Einkünfte durch negative Einkünfte einer anderen Einkunftsart. Im Hinblick auf das Steueraufkommen zeichnen sich beträchtliche Steuerausfälle in Höhe von ca. 33 Mrd. DM durch diese Verrechnungsmöglichkeit ab.

Aufkommens- und Verteilungswirkungen einer Reform der Rentenbesteuerung (1997)

Bork, Christhart ; Müller, Klaus

Inhalt: Theoretische Begründung einer Besteuerung von Alterseinkünften Kritische Überprüfung einiger Argumente Die Reformvorschläge und das Simulationsmodell Die Simulationsergebnisse Kritische Bewertung der Ergebnisse Mit einem Kommentar von Hans-Peter Weikard: Rentenbesteuerung und Korrespondenzprinzip: 1. Wie selbstverständlich ist das Korrespondenzprinzip? 2. Die zeitliche Dimension des Korrespondenzprinzips 3. Eine unzulässige Interpretation 4. Fazit

ANNIS (2004)

Dipper, Stefanie ; Götze, Michael ; Stede, Manfred ; Wegst, Tillmann

In this paper, we discuss the design and implementation of our first version of the database "ANNIS" ("ANNotation of Information Structure"). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing.

Focus strategies in chadic (2004)

Hartmann, Katharina ; Zimmermann, Malte

We argue that the standard focus theories reach their limits when confronted with the focus systems of the Chadic languages. The backbone of the standard focus theories consists of two assumptions, both called into question by the languages under consideration. Firstly, it is standardly assumed that focus is generally marked by stress. The Chadic languages, however, exhibit a variety of different devices for focus marking. Secondly, it is assumed that focus is always marked. In Tangale, at least, focus is not marked consistently on all types of constituents. The paper offers two possible solutions to this dilemma.

The influence of tense in adverbial quantification (2004)

Endriss, Cornelia ; Hinterwimmer, Stefan

We argue that there is a crucial difference between determiner and adverbial quantification. Following Herburger [2000] and von Fintel [1994], we assume that determiner quantifiers quantify over individuals and adverbial quantifiers over eventualities. While it is usually assumed that the semantics of sentences with determiner quantifiers and those with adverbial quantifiers basically come out the same, we will show by way of new data that quantification over events is more restricted than quantification over individuals. This is because eventualities in contrast to individuals have to be located in time which is done using contextual information according to a pragmatic resolution strategy. If the contextual information and the tense information given in the respective sentence contradict each other, the sentence is uninterpretable. We conclude that this is the reason why in these cases adverbial quantification, i.e. quantification over eventualities, is impossible whereas quantification over individuals is fine.

Heterogeneity in focus : creating and using linguistic databases (2005)

The papers in this volume were presented at the workshop Heterogeneity in Linguistic Databases', which took place on July 9, 2004 at the University of Potsdam. The workshop was organized by project D1: Linguistic Database for Information Structure: Annotation and Retrieval', a member project of the SFB 632, a collaborative research center entitled Information Structure: the Linguistic Means for Structuring Utterances, Sentences and Texts'. The workshop brought together both developers and users of linguistic databases from a number of research projects which work on an empirical basis, all of which have to cope with different sorts of heterogeneity: primary linguistic data and annotated information may be heterogeneous, as well as the data structures representing them. The first four papers (by Wagner, Schmidt, Lüdeling, and Witt) address aspects of heterogeneous data from the point of view of database developers; the remaining three papers (by Meyer, Smith, and Teich/Fankhauser) focus on data exploitation by the users.

Extern

Refine

Has Fulltext

Author

Year of publication

Document Type

Language

Is part of the Bibliography

Keywords

Institute

1482 search hits