• Deutsch

University Logo

  • Home
  • Search
  • Browse
  • Submit
  • Sitemap
Schließen

Refine

Has Fulltext

  • no (1)

Author

  • Chiarcos, Christian (1)
  • Ritz, Julia (1)
  • Stede, Manfred (1)

Year of publication

  • 2012 (1)

Document Type

  • Article (1)

Language

  • English (1)

Is part of the Bibliography

  • yes (1)

Keywords

  • Linguistic annotation (1) (remove)

Institute

  • Department Linguistik (1) (remove)

1 search hit

  • 1 to 1
  • BibTeX
  • CSV
  • RIS
  • XML
  • 10
  • 20
  • 50
  • 100
By all these lovely tokens... Merging conflicting tokenizations (2012)
Chiarcos, Christian ; Ritz, Julia ; Stede, Manfred
Given the contemporary trend to modular NLP architectures and multiple annotation frameworks, the existence of concurrent tokenizations of the same text represents a pervasive problem in everyday's NLP practice and poses a non-trivial theoretical problem to the integration of linguistic annotations and their interpretability in general. This paper describes a solution for integrating different tokenizations using a standoff XML format, and discusses the consequences from a corpus-linguistic perspective.
  • 1 to 1

OPUS4 Logo  KOBV Logo  OAI Logo  DINI Zertifikat 2007  OA Netzwerk Logo

    • Publication server
    • University Bibliography
    • University Library
    • Policy
    • Contact
    • Imprint
    • Privacy Policy
    • Accessibility

    Login