미동록어의 의미 범주 분석을 이용한 복합명사 분해

강유환; 서영훈

미동록어의 의미 범주 분석을 이용한 복합명사 분해

원문정보

Segmentation of Korean Compound Nouns Using Semantic Category Analysis of Unregistered Nouns

강유환, 서영훈

한국정보기술응용학회 JITAM Vol.11 No.4 2004.12 pp.95-102

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

This paper proposes a method of segmenting compound nouns which include unregistered nouns into a correct combination of unit nouns using characteristics of person's names, loanwords, and location names. Korean person's name is generally composed of 3 syllables, only relatively small number of syllables is used as last names, and the second and the third syllables combination is somewhat restrictive. Also many person's names appear with clue words in compound nouns. Most loanwords have one or more syllables which cannot appear in Korean words, or have sequences of syllables different from usual Korean words. Location names are generally used with clue words designating districts in compound nouns. Use of above characteristics to analyze compound nouns not only makes segmentation more accurate, helps natural language systems use semantic categories of those unregistered nouns. Experimental results show that the precision of our method is approximately 98% on average. The precision of human names and loanwords recognition is about 94% and about 92% respectively.

키워드

저자정보

강유환 Yu-Hwan Kang. 충북대학교 컴퓨터공학과 박사과정
서영훈 Young-Hoon Seo. 충북대학교 전기전자컴퓨터공학부 교수

참고문헌

자료제공 : 네이버학술정보

함께 이용한 논문

※ 기관로그인 시 무료 이용이 가능합니다.

4,000원

0개의 논문이 장바구니에 담겼습니다.

earticle