A locally-constrained YOLO framework for detecting small and densely-distributed building footprints

Yiqun Xie, Jiannan Cai, Rahul Bhojwani, Shashi Shekhar, Joseph Knight

Research output: Contribution to journalArticlepeer-review

37 Scopus citations


Building footprints are among the most predominant features in urban areas, and provide valuable information for urban planning, solar energy suitability analysis, etc. We aim to automatically and rapidly identify building footprints by leveraging deep learning techniques and the increased availability of remote sensing datasets at high spatial resolution. The task is computationally challenging due to the use of large training datasets and large number of parameters. In related work, You-Only-Look-Once (YOLO) is a state-of-the-art deep learning framework for object detection. However, YOLO is limited in its capacity to identify small objects that appear in groups, which is the case for building footprints. We propose a LOcally-COnstrained (LOCO) You-Only-Look-Once framework to detect small and densely-distributed building footprints. LOCO is a variant of YOLO. Its layer architecture is determined by the spatial characteristics of building footprints and it uses a constrained regression modeling to improve the robustness of building size predictions. We also present an invariant augmentation based voting scheme to further improve the precision in the prediction phase. Experiments show that LOCO can greatly improve the solution quality of building detection compared to related work.

Original languageEnglish (US)
Pages (from-to)777-801
Number of pages25
JournalInternational Journal of Geographical Information Science
Issue number4
StatePublished - Apr 2 2020

Bibliographical note

Publisher Copyright:
© 2019, © 2019 Informa UK Limited, trading as Taylor & Francis Group.


  • Building detection
  • YOLO
  • deep learning
  • locally constrained
  • remote sensing


Dive into the research topics of 'A locally-constrained YOLO framework for detecting small and densely-distributed building footprints'. Together they form a unique fingerprint.

Cite this