Abstract
Detecting overlapping text from map images is a challenging problem. Previous algorithms generally assume specific cartographic styles (e.g., road shapes and text format) and are difficult to adjust for handling different map types. In this paper, we build on our previous text recognition work, Strabo, to develop an algorithm for detecting overlapping characters from non-text symbols. We call this algorithm Overlapping Text Detection (OTD). OTD uses the recognition results and locations of detected text labels (from Strabo) to detect potential areas that contain overlapping text. Next, OTD classifies these areas as either text or non-text regions based on their shape descriptions (including the ratio of number of foreground pixels to area size, number of connected components, and number of holes). The average precision and recall of OTD in classifying text and non-text regions were 77% and 86%, respectively. We show that OTD improved the precision and recall of text detection in Strabo by 19% and 41%, respectively, and produced higher accuracy compared to a state-of-the-art text/graphic separation algorithm.
Original language | English (US) |
---|---|
Journal | IS and T International Symposium on Electronic Imaging Science and Technology |
DOIs | |
State | Published - 2016 |
Externally published | Yes |
Event | 23rd Document Recognition and Retrieval 2016, DRR 2016 - San Francisco, United States Duration: Feb 14 2016 → Feb 18 2016 |
Bibliographical note
Publisher Copyright:© 2016 Society for Imaging Science and Technology.
Keywords
- Digital map processing
- Geographic information system
- Optical character recognition
- Spatial databases