earticle

논문검색

A Comparative Study on OCR using Super-Resolution for Small Fonts

초록

영어

Recently, there have been many issues related to text recognition using Tesseract. One of these issues is that the text recognition accuracy is significantly lower for smaller fonts. Tesseract extracts text by creating an outline with direction in the image. By searching the Tesseract database, template matching with characters with similar feature points is used to select the character with the lowest error. Because of the poor text extraction, the recognition accuracy is lowerd. In this paper, we compared text recognition accuracy after applying various super-resolution methods to smaller text images and experimented with how the recognition accuracy varies for various image size. In order to recognize small Korean text images, we have used super-resolution algorithms based on deep learning models such as SRCNN, ESRCNN, DSRCNN, and DCSCN. The dataset for training and testing consisted of Korean-based scanned images. The images was resized from 0.5 times to 0.8 times with 12pt font size. The experiment was performed on x0.5 resized images, and the experimental result showed that DCSCN super-resolution is the most efficient method to reduce precision error rate by 7.8%, and reduce the recall error rate by 8.4%. The experimental results have demonstrated that the accuracy of text recognition for smaller Korean fonts can be improved by adding super-resolution methods to the OCR preprocessing module.

목차

Abstract
1. INTRODUCTION
2. TESSERACT-OCR
3. SUPER-RESOLUTION
3.1 SRCNN (SUPER-RESOLUTION CNN)
3.2 ESRCNN (EXPANDED SUPER RESOLUTION CNN)
3.3 DSRCNN (DENOISING SUPER RESOLUTION CNN)
3.4 DCSCN (DEEP CNN WITH SKIP CONNECTION AND NETWORK IN NETWORK)
4. EXPERIMENTS AND RESULTS
4.1 DATASET
4.2 EXPERIMENT METHOD
4.3 RESULTS
5. CONCLUSION
REFERENCES

저자정보

  • Wooyeong Cho Department of Electronics Engineering, Kwangwoon University, Korea
  • Juwon Kwon Department of Electronics Engineering, Kwangwoon University, Korea
  • Soonchu Kwon Graduate School of Smart Convergence, Kwangwoon University, Korea
  • Jisang Yoo Department of Electronics Engineering, Kwangwoon University, Korea

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.