원문정보
초록
영어
Data mining technology is a powerful tool to solve the problem. It is widely used to identify potentially useful information. Fuzzy principle based fuzzy C- means is a modification of commonly used C- means clustering technique. In the paper, a modified fuzzy C-means algorithm is used in the gene sequence. The modification is through taking a pseudo F statistics into the method. In the simulation, we use nodes instead of gene to verify the validity. According to the simulation, we get the optimal cluster number, the structure of the classification of the nodes. In order to test the performance of the algorithm, it has been used to process large amount of data, and results show that it has higher processing speed and stable performance. The algorithm can be used in the gene description.
목차
1. Introduction
2. Data Mining
2.1 Main Methods
2.2 The Basic Process of Data Mining
3. Clustering Algorithm
4. Modified Clustering Algorithm
4.1 Discriminant Function
4.2 Clustering Algorithm
5. Analysis of Simulation Results
6. Conclusions
References