Refine
Has Fulltext
- no (35)
Year of publication
- 2012 (35) (remove)
Document Type
- Article (35) (remove)
Is part of the Bibliography
- yes (35) (remove)
Keywords
- Linguistic annotation (2)
- 2nd-language (1)
- Age of acquisition (1)
- Akan (1)
- Annotation tools (1)
- Auditory language comprehension (1)
- Chadic languages (1)
- Cleft structure (1)
- Coherence relation (1)
- Cold War (1)
Institute
- Department Linguistik (35) (remove)
Vorwort
(2012)
Given the contemporary trend to modular NLP architectures and multiple annotation frameworks, the existence of concurrent tokenizations of the same text represents a pervasive problem in everyday's NLP practice and poses a non-trivial theoretical problem to the integration of linguistic annotations and their interpretability in general. This paper describes a solution for integrating different tokenizations using a standoff XML format, and discusses the consequences from a corpus-linguistic perspective.