earticle

논문검색

Finding and Typing New Named Entities in Tibetan from Chinese-Tibetan Parallel Corpora

원문정보

초록

영어

Currently there is much interest in the automatic acquisition of entities, with the goal of Named Entity Recognition (NER). However previous work has focused primarily on major languages, with the large, structured, and semantically rich knowledge bases and using the large corpus with annotated NER tags. In this paper, we describe a method for Chinese-Tibetan bilingual named entity recognition using easily obtainable bilingual dictionary and parallel political corpora. We present two distinct steps for NER, one step identifying entity candidates in Tibetan, and the second step typing the entity into the semantic class. We then test the approach on the dataset and give the analysis of NE type errors.

목차

Abstract
 1. Introduction
 2. Motivation
 3. The Proposed Bilingual NER Method
  3.1. Large-Scale Harvesting of Entities
  3.2. Joint Disambiguation
 4. Evaluation
 5. Related Work
 6. Conclusion and Future Work
 Acknowledgements
 References

저자정보

  • Lirong Qiu School of Information Engineering, Minzu University of China Beijing, China

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.