A Word Similarity Algorithm with Sememe Probability Density Ratio Based on HowNet

Rui Zheng; Huan Zhao; Xixiang Zhang

A Word Similarity Algorithm with Sememe Probability Density Ratio Based on HowNet

원문정보

Rui Zheng, Huan Zhao, Xixiang Zhang

보안공학연구지원센터(IJHIT) International Journal of Hybrid Information Technology Vol.8 No.10 2015.10 pp.417-426

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

The study on word similarity computation plays an important role in natural language processing (NLP). Recently the algorithm based on HowNet is widely used and proves to work well in Chinese word similarity computation. However, the relationship between the number of brother nodes and the fineness of the hierarchy is not considered. This paper investigates the ratio of two words on the brother nodes’ number called sememe probability density and proposes an improved algorithm based on HowNet. The results indicate that the correlation measure of the algorithm presented by this paper is 75.4%, and it is much better than the major state-of-the-art method (68.1%).

Abstract
1. Introduction
2. Related Work
3. Algorithm
  3.1 HowNet
  3.2 Similarity between Sememes
  3.3 Similarity between Sets
  3.4 Similarity between Concepts
  3.5 Similarity between words
4. Evaluation
  4.1 Data Set and Setting
  4.2 Experimental Results
5. Conclusions
ACKNOWLEDGEMENTS
References

키워드

저자정보

Rui Zheng School of Information Science and Engineering, Hunan University, Changsha, 410082, China
Huan Zhao School of Information Science and Engineering, Hunan University, Changsha, 410082, China
Xixiang Zhang School of Information Science and Engineering, Hunan University, Changsha, 410082, China

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle