earticle

논문검색

Recognition of Person Name in Uyghur Text Corpus using Naïve Bayes

초록

영어

This paper presents a novel approach to recognize person name in Uyghur corpus. The Recognition of a person name for Uyghur using Naive Bayes Classifier is a challenging task in intelligent computing. Uyghur person name recognition (UPNR) aims at classifying each word in a document into predefined target label (person name or others) in a linear and non-linear fashion. Some language specific rules are added to recognize person names. Moreover, some gazetteers and context patterns are added to increase its performance as it is observed that identification of rules and context patterns requires language-based knowledge to make the work better. We have used required lexical databases to prepare rules and identify the context patterns for Uyghur. Experimental results show that our approach achieves higher accuracy than previous approaches.

목차

Abstract
 1. Introduction
 2. The Uyghur Text Corpus
 3. Naïve Bayes Classifier
 4. Training Data
  4.1. Features
  4.2. Suffix and Prefix
  4.3. Stem of Word
  4.4. The Algorithm
 5. Experimental Results and Evaluations
 6. Conclusions
 References

저자정보

  • Abdurahim Mahmoud Institute of Information Science and Engineering, Xinjiang University, China
  • Tashpolat Nizamidin Institute of Information Science and Engineering, Xinjiang University, China
  • Peride Tursun School of Graduate, Xinjiang University, Urumqi, China
  • Askar Hamdulla School of Graduate, Xinjiang University, Urumqi, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.