earticle

논문검색

KNN based Machine Learning Approach for Text and Document Mining

초록

영어

Text Categorization (TC), also known as Text Classification, is the task of automatically classifying a set of text documents into different categories from a predefined set. If a document belongs to exactly one of the categories, it is a single-label classification task; otherwise, it is a multi-label classification task. TC uses several tools from Information Retrieval (IR) and Machine Learning (ML) and has received much attention in the last years from both researchers in the academia and industry developers. In this paper, we first categorize the documents using KNN based machine learning approach and then return the most relevant documents.

목차

Abstract
 1. Introduction
 2. Classification Methods
  2.1. Naive Bayes
  2.2. Term Graph Model
  2.3. k-Nearest Neighbors
 3. Experimental Results
  3.1. Dataset
  3.2. Implementation
  3.3. Results
 4. Conclusion
 References

저자정보

  • Vishwanath Bijalwan Asst. Prof., Institute of technology Gopeshwar, Chamoli, Uttarakhand, India
  • Vinay Kumar Asst. Prof., Institute of technology Gopeshwar, Chamoli, Uttarakhand, India
  • Pinki Kumari Bansathali University, Rajasthan, India
  • Jordan Pascual Department of Computer Science, University of Oviedo, Spain

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.