원문정보
초록
영어
The main purpose of communication is to transfer information from one corner to another of the world. The information is basically stored in forms of documents or files created on the basis of requirements. So, the randomness of creation and storage makes them unstructured in nature. As a consequence, data retrieval and modification become hard nut to crack. The data, that is required frequently, should maintain certain pattern. Otherwise, problems like retrieving erroneous data or anomalies in modification or time consumption in retrieving process may hike. As every problem has its own solution, these unstructured documents have also given the solution named unstructured document categorization. That means, the collected unstructured documents will be categorized based on some given constraints. This paper is a review which deals with different techniques like text and data mining, genetic algorithm, lexical chaining, binarization method to reach the fulfillment of desired unstructured document categorization appeared in the literature.
목차
1 Introduction
2 Overview
A. Past
B. Present
C. Future
3 Approach
A. Text and Data mining Technique
B. Lexical Chain Technique
C. Genetic Algorithm (GA) Technique
D. Artificial Neural Network technique
E. Binarization
4 Application
5 Evaluation
A. TEXT AND DATA MINING TECHNIQUE:
B. LEXICAL CHAIN MECHANISM:
C. GENETIC ALGORITMIC APPROACH:
D. ARTIFICIAL NEURAL NETWORK:
E. BINARIZATION:
6 Discussion & Conclusion
References