감시 응용 프로그램에서 텍스트 인식을 위한 시각적 콘텐츠의 강력한 처리

라힘 나시르; 아마드 자밀; 칸 무함마드; 이미영; 백성욱

원문정보

Robust Processing of Visual Contents for Text Recognition in Surveillance Applications

라힘 나시르, 아마드 자밀, 칸 무함마드, 이미영, 백성욱

한국차세대컴퓨팅학회 한국차세대컴퓨팅학회 논문지 Vol.14 No.1 2018.02 pp.65-77 KCI 등재

피인용수 : 0건 (자료제공 : 네이버학술정보)

초록

영어

Textual information in natural scene images is an essential clue for several image-based applications. However, recognizing accurate text from natural scene images is a challenging task due to variation in illumination, text skewness, and variation in color, size, font, and camera alignment. In this paper, an optimal pre-processing pipeline is proposed for efficient and effective text recognition in visual surveillance applications. The pre-processing steps involve image de-sampling, de-noising, color to gray conversion, and adaptive thresholding. Furthermore, maximally stable extremal region (MSER) text detection algorithm has been used to detect textual regions which are then forwarded to the text recognition module. The recognized text by OCR is matched with a lookup table and the most likely value is extracted. Through experiments conducted for both Android and PC platforms using various datasets, it was concluded that the proposed method is more suitable for text recognition in surveillance applications.

한국어

자연 장면 이미지의 텍스트 정보는 몇 가지 이미지 기반 응용 프로그램에서 필수적 단서입니다. 그러나 자연 장면 이미지에서 정확한 텍스트를 인식하는 것은 조명, 텍스트 비대칭성 및 색상, 크기 글꼴 및 카메라 정렬 변화로 인해 어려운 작업입니다. 이 논문에서는 시각적 감시 응용 프로그램에서 효율적이고 효과적인 텍스트 인식을 위한 최적의 전처리 파이프 라인을 제안합니다. 전처리 단계에는 이미지 디 샘플링, 노이즈 제거, 색상에서 회색 변환 및 적응형 임계 값이 포함됩니다. 또한 최대 안정적 극한 영역(MSER)텍스트 검출 알고리즘을 사용하여 텍스트 인식 모듈로 전달되는 텍스트 영역을 검출했습니다. OCR에 의해 인식된 텍스트는 검색 테이블과 일치되어 가장 가능성 있는 값 이 추출됩니다. 다양한 데이터 세트를 사용하여 Android및 PC플랫폼 모두에 대해 수행된 실험을 통해 제안된 방법 이 감시 응용 프로그램에서 텍스트 인식에 더 적합하다고 결론을 내렸습니다.

요약
Abstract
1. Introduction
2. Related Works
3. Proposed Framework
  3.1 Image Pre-Processing Pipeline
  3.2 Text Detection
  3.3 Text Recognition
  3.4 Lookup Table Approach
  3.5 Edit Distance
4. Experimental Results
  4.1 ICDAR 2003
  4.2 ICDAR 2013
  4.3 IIIT 5K-Words
5. Conclusion and Future Work
Acknowledgement
참고문헌

키워드

저자정보

라힘 나시르 Nasir Rahim. 세종대학교 디지털콘텐츠연구소, 소프트웨어융합대학
아마드 자밀 Jamil Ahmad. 세종대학교 디지털콘텐츠연구소, 소프트웨어융합대학
칸 무함마드 Khan Muhammad. 세종대학교 디지털콘텐츠연구소, 소프트웨어융합대학
이미영 Mi Young Lee. 세종대학교 디지털콘텐츠연구소, 소프트웨어융합대학
백성욱 Sung Wook Baik. 세종대학교 디지털콘텐츠연구소, 소프트웨어융합대학

참고문헌

자료제공 : 네이버학술정보

earticle