earticle

논문검색

Unstructured Document Categorization: A Study

초록

영어

The main purpose of communication is to transfer information from one corner to another of the world. The information is basically stored in forms of documents or files created on the basis of requirements. So, the randomness of creation and storage makes them unstructured in nature. As a consequence, data retrieval and modification become hard nut to crack. The data, that is required frequently, should maintain certain pattern. Otherwise, problems like retrieving erroneous data or anomalies in modification or time consumption in retrieving process may hike. As every problem has its own solution, these unstructured documents have also given the solution named unstructured document categorization. That means, the collected unstructured documents will be categorized based on some given constraints. This paper is a review which deals with different techniques like text and data mining, genetic algorithm, lexical chaining, binarization method to reach the fulfillment of desired unstructured document categorization appeared in the literature.

목차

Abstract.
 1 Introduction
 2 Overview
  A. Past
  B. Present
  C. Future
 3 Approach
  A.  Text and Data mining Technique
  B.  Lexical Chain Technique
  C.  Genetic Algorithm (GA) Technique
  D.   Artificial Neural Network technique
  E.   Binarization
 4 Application
 5 Evaluation
  A. TEXT AND DATA MINING TECHNIQUE:
  B. LEXICAL CHAIN MECHANISM:
  C. GENETIC ALGORITMIC APPROACH:
  D. ARTIFICIAL NEURAL NETWORK:
  E. BINARIZATION:
 6 Discussion & Conclusion
 References

저자정보

  • Debnath Bhattacharyya Computer Science and Engineering Department, Heritage Institute of Technology
  • Poulami Das Computer Science and Engineering Department, Heritage Institute of Technology
  • Debashis Ganguly Computer Science and Engineering Department, Heritage Institute of Technology
  • Kheyali Mitra Computer Science and Engineering Department, Heritage Institute of Technology
  • Purnendu Das Computer Science and Engineering Department, Heritage Institute of Technology
  • Samir Kumar Bandyopadhyay Department of Computer Science and Engineering, University of Calcutta
  • Tai-hoon Kim Hannam University

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.