Recognizing text in raster maps

Yao Yi Chiang, Craig A. Knoblock

Research output: Contribution to journalArticlepeer-review

37 Scopus citations

Abstract

Text labels in maps provide valuable geographic information by associating place names with locations. This information from historical maps is especially important since historical maps are very often the only source of past information about the earth. Recognizing the text labels is challenging because heterogeneous raster maps have varying image quality and complex map contents. In addition, the labels within a map do not follow a fixed orientation and can have various font types and sizes. Previous approaches typically handle a specific type of map or require intensive manual work. This paper presents a general approach that requires a small amount of user effort to semi-automatically recognize text labels in heterogeneous raster maps. Our approach exploits a few examples of text areas to extract text pixels and employs cartographic labeling principles to locate individual text labels. Each text label is then rotated automatically to horizontal and processed by conventional OCR software for character recognition. We compared our approach to a state-of-art commercial OCR product using 15 raster maps from 10 sources. Our evaluation shows that our approach enabled the commercial OCR product to handle raster maps and together produced significant higher text recognition accuracy than using the commercial OCR alone.

Original languageEnglish (US)
Pages (from-to)1-27
Number of pages27
JournalGeoInformatica
Volume19
Issue number1
DOIs
StatePublished - Jan 2014
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2014, Springer Science+Business Media New York.

Keywords

  • GIS
  • Map processing
  • OCR
  • Raster maps
  • Text recognition

Fingerprint

Dive into the research topics of 'Recognizing text in raster maps'. Together they form a unique fingerprint.

Cite this