earticle

논문검색

Knowledge Discovery using Text Mining: A Programmable Implementation on Information Extraction and Categorization

초록

영어

Textual data in electronic documents today around the world have no doubt brought forward all the information one could need and as data banks build up worldwide, and access gets easier through technology, it has become easier to overlook vital facts and figures that could bring about groundbreaking discoveries. This research paper discusses in detail an implementation of Information Extraction and Categorization in the text mining application that we have implemented. To extract terms from the document we have used modified version of Porter’s Algorithm for inflectional stemming. For calculating term frequencies for categorization, we have used a domain dictionary for ‘Computer Science’ domain.

목차

Abstract
 1. Introduction
 2. Information Extraction
  2.1 Stemming
  2.2 Domain dictionary
  2.3 Exclusion List
 3. Categorization
 4. Limitations
 5. Future Work
 6. Conclusion
 References

저자정보

  • Atika Mustafa National University of Computer and Emerging Sciences-FAST, Karachi Pakistan
  • Ali Akbar National University of Computer and Emerging Sciences-FAST, Karachi Pakistan
  • Ahmer Sultan National University of Computer and Emerging Sciences-FAST, Karachi Pakistan

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.