earticle

논문검색

Practical Segmentation Methods for Logical and Geometric Layout Analysis to Improve Scanned PDF Accessibility to Vision Impaired

초록

영어

The use of electronic documents has rapidly increased in recent decades and the PDF is one the most commonly used electronic document formats. A scanned PDF is an image and does not actually contain any text. For the vision–impaired user who is dependent upon a screen reader to access this information, this format is not useful. Thus addressing PDF accessibility through assistive technology has now become an important concern. PDF layout analysis provides precious formatting information that supports PDF component classification. This classification facilitates the tag generation. Accurate tagging produces a searchable and navigable scanned PDF document. This paper describes several practical segmentation methods which are easy to implement and efficient for PDF layout analysis so that the scanned PDF document can be navigated or searched using assistive technologies.

목차

Abstract
 1. Introduction
 3. Pre Processing
  3.1. Format Conversion
  3.2. Binarization
  3.3. Scaling Image
  3.4. Margin Removal
  3.5. Skew Detection and Correction
 4. Block Segmentation
 5. Text-Image Segmentation
 6. Line Segmentation
 7. Word Segmentation
 8. Vertical-Horizontal-Recursive-Segmentation
 9. Conclusion
 References

저자정보

  • Azadeh Nazemi Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia
  • Iain Murray Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia
  • David A. Mc Meekin Electrical and Computer Engineering Spatial Sciences, Curtin University ,Perth ,WA,Australia

참고문헌

자료제공 : 네이버학술정보

    함께 이용한 논문

      ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

      0개의 논문이 장바구니에 담겼습니다.