|
|
Building Morphological Analyzer for Nepali
|
|
|
|
|
نویسنده
|
Bhat Shahid Mushtaq ,Rai Rupesh
|
منبع
|
journal of modern languages - 2012 - دوره : 22 - - کد همایش: - صفحه:45 -58
|
چکیده
|
Morphological analyzer is a fundamental tool in natural language processing (nlp) that generates the morphological analyses of a given word-form. it can be used in enhancing the accuracy of pos-tagging, chunking, syntactic parsing, word sense disambiguation (wsd), information retrieval (ir) & machine translation (mt) systems. this paper describes an ongoing effort to develop nepali morphological analyzer, using an open source platform-apertium (lt-toolbox). since, it is the initial stage of this project; we have confined our work to inflectional morphology. so far, we have covered all the possible categories, as per ldc-il1 pos tag-set of nepali. currently, the coverage of nepali morph-analyzer is 20,000 words, classified into 219 paradigms.
|
کلیدواژه
|
Morphological analyzer ,Word and paradigm model ,Apertium ,LT-Tool Box ,Paradigm ,Concatenative Morphology ,Machine Translation ,Devnagri ,Transliteration
|
آدرس
|
Central Institute of Indian Languages, Linguistic Data Consortium for Indian Languages, India, Central Institute of Indian Languages, Linguistic Data Consortium for Indian Languages, India
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|