원문정보
초록
영어
Increasing access to large-scale, high-dimensional and non-stationary streams in many real applications has made it necessary to design new dynamic classification algorithms. Most existing approaches for the textual stream classification are able to train the model relying on labeled data. However, only a limited number of instances can be labeled in a real streaming environment since large-scale data appear at a high speed. Therefore, it is useful to make unlabeled instances available for training and updating the ensemble models. In this paper, we present a new ensemble framework with partial labeled instances for learning from the textual stream. A new semi-supervised cluster-based classifier is proposed as the sub-classifier in our approach. In order to integrate these sub-classifiers, we propose an adaptive selection method. Empirical evaluation of textual streams reveals that our approach outperforms state-of-the-art stream classification algorithms.
목차
1. Introduction
2. Related Work
3. Cluster Classifier Ensemble Model with Partial Labeled Instances (CCEM-PL)
4. Cluster-based Classifiers (CCs) Using Partially Labeled Instances
5. Selection method and Voting Method
5.1. Selection Method and Accuracy Weight
5.2. Voting Strategy
6. Experiments
6.1. Datasets
6.2. Compared Models
6.3. Results
Acknowledgements
References