Text Clustering using Semantic Terms

Sun Park; Seong Ro Lee

Text Clustering using Semantic Terms

원문정보

Sun Park, Seong Ro Lee

보안공학연구지원센터(IJHIT) International Journal of Hybrid Information Technology Vol.5 No.2 2012.04 pp.135-140

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

In traditional text clustering, documents appear terms frequency without considering the semantic information of each document (i.e., vector model). The property of vector model may be incorrectly classified documents into different clusters when documents of same cluster lack the shared terms. Recently, to overcome this problem uses knowledge based approaches. However, these approaches have an influence of structure of document set and a cost problem of constructing ontology. In this paper, we propose a text clustering method using semantic terms for clustering label and term weights. The semantic terms of clustering label can well express the internal structure of document clusters using non-negative matrix factorization (NMF). It can also improve the quality of text clustering which uses the term weights by WordNet. The experimental results demonstrate that the proposed method achieves better performance than other text clustering methods.

키워드

저자정보

Sun Park Institute Research of Information Science and Engineering, Mokpo National University
Seong Ro Lee Department of Information and Electronic, Mokpo Naitional University

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

0개의 논문이 장바구니에 담겼습니다.

earticle