원문정보
초록
영어
Script Identification is one of the challenging step in the Optical Character Recognition system for multi-script documents. In Indian and Non-Indian context some results have been reported, but research in this field is still emerging. This paper presents a research work in the identification of Gurmukhi and English scripts at word level. It also identifies English Numerals from Gurmukhi text. Gabor feature extraction is one of most popular method for script recognition. This paper presents a zone based gabor feature extraction technique. The given word image after normalization is divided into different zones of different sizes and then features from each of these zones are extracted in various directions using gabor filters. Script is then determined by using SVM classifier. The experimental tests carried out in the field of Gurmukhi and English Script recognition show that the proposed technique leads to improvement over the traditional Gabor feature extraction without zoning. In future, this can also be extended for other scripts.
목차
1. Introduction
1.1. Gurumukhi Script
1.2. English Script
1.3. English Numerals
2. Related Works
3. Feature Extraction
3.1. 2 Dimensional Gabor Function
3.2. Zone-Based Gabor Feature Extraction
4. Classification
4.1. SVM Classifier
5. Experimental Results and Discussion
5.1. Dataset Preparation
5.2. Experiment Results with Different Kernel Functions
5.3. Comparison with Existing Methods
6. Conclusion
References