ANNIS
(2004)
In this paper, we discuss the design and implementation of our first version of the database "ANNIS" ("ANNotation of Information Structure"). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing.
The annotation guidelines introduced in this chapter present an attempt to create a unique infrastructure for the encoding of data from very different languages. The ultimate target of these annotations is to allow for data retrieval for the study of information structure, and since information structure interacts with all levels of grammar, the present guidelines cover all levels of grammar too. After introducing the guidelines, the current chapter also presents an evaluation by means of measurements of the inter-annotator agreement.
Information structure
(2007)
We present a general framework for integrating annotations from different tools and tag sets. When annotating corpora at multiple linguistic levels, annotators may use different expert tools for different phenomena or types of annotation. These tools employ different data models and accompanying approaches to visualization, and they produce different output formats. For the purposes of uniformly processing these outputs, we developed a pivot format called PAULA, along with converters to and from tool formats. Different annotations are not only integrated at the level of data format, but are also joined on the level of conceptual representation. For this purpose, we introduce OLiA, an ontology of linguistic annotations that mediates between alternative tag sets that cover the same class of linguistic phenomena. All components are integrated in the linguistic information system ANNIS : Annotation tool output is converted to the pivot format PAULA and read into a database where the data can be visualized, queried, and evaluated across multiple layers. For cross-tag set querying and statistical evaluation, ANNIS uses the ontology of linguistic annotations. Finally, ANNIS is also tied to a machine learning component for semiautomatic annotation.
To address one of the central questions of plate tectonics-How do large transform systems work and what are their typical features?-seismic investigations across the Dead Sea Transform (DST), the boundary between the African and Arabian plates in the Middle East, were conducted for the first time. A major component of these investigations was a combined reflection/ refraction survey across the territories of Palestine, Israel and Jordan. The main results of this study are: (1) The seismic basement is offset by 3-5 km under the DST, (2) The DST cuts through the entire crust, broadening in the lower crust, (3) Strong lower crustal reflectors are imaged only on one side of the DST, (4) The seismic velocity sections show a steady increase in the depth of the crust-mantle transition (Moho) from 26 km at the Mediterranean to 39 km under the Jordan highlands, with only a small but visible, asymmetric topography of the Moho under the DST. These observations can be linked to the left-lateral movement of 105 km of the two plates in the last 17 Myr, accompanied by strong deformation within a narrow zone cutting through the entire crust. Comparing the DST and the San Andreas Fault (SAF) system, a strong asymmetry in subhorizontal lower crustal reflectors and a deep reaching deformation zone both occur around the DST and the SAF. The fact that such lower crustal reflectors and deep deformation zones are observed in such different transform systems suggests that these structures are possibly fundamental features of large transform plate boundaries