• search hit 30 of 137
Back to Result List

Developing a finite-state morphological analyzer for Urdu and Hindi

  • We introduce and discuss a number of issues that arise in the process of building a finite-state morphological analyzer for Urdu, in particular issues with potential ambiguity and non-concatenative morphology. Our approach allows for an underlyingly similar treatment of both Urdu and Hindi via a cascade of finite-state transducers that transliterates the very different scripts into a common ASCII transcription system. As this transliteration system is based on the XFST tools that the Urdu/Hindi common morphological analyzer is also implemented in, no compatibility problems arise.

Download full text files

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Tina Bögel, Miriam Butt, Annette Hautli, Sebastian Sulger
URN:urn:nbn:de:kobv:517-opus-27155
Publication type:Conference Proceeding
Language:English
Publication year:2008
Publishing institution:Universität Potsdam
Release date:2008/12/11
Organizational units:Extern / Extern
DDC classification:4 Sprache / 40 Sprache / 400 Sprache
Collection(s):Universität Potsdam / Tagungsbände/Proceedings (nicht fortlaufend) / Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 / II Regular Papers
License (German):License LogoKeine öffentliche Lizenz: Unter Urheberrechtsschutz
External remark:
The complete edition of the proceedings "Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 ; Revised Papers" is available:
URN urn:nbn:de:kobv:517-opus-23812
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.