TY - GEN
T1 - A geographical approach for metadata quality improvement in biological observation databases
AU - Cugler, Daniel Cintra
AU - Medeiros, Claudia Bauzer
AU - Shekhar, Shashi
AU - Toledo, Luís Felipe
N1 - Copyright:
Copyright 2014 Elsevier B.V., All rights reserved.
PY - 2013
Y1 - 2013
N2 - This paper addresses the problem of improving the quality of metadata in biological observation databases, in particular those associated with observations of living beings, and which are often used as a starting point for biodiversity analyses. Poor quality metadata lead to incorrect scientific conclusions, and can mislead experts. Thus, it is important to design and develop methods to detect and correct metadata quality problems. This is a challenging problem because of the variety of issues concerning such metadata, e.g., misnaming of species, location uncertainty and imprecision concerning where observations were recorded. Related work is limited because it does not adequately model such issues. We propose a geographic approach based on expertled classification of place and/or range mismatch anomalies detected by our algorithms. Our approach enables detection of anomalies in both species' reported geographic distributions and in species' identification. Our main contribution is our geographic algorithm that deals with uncertain/imprecise locations. Our work is tested using a case study with the Fonoteca Neotropical Jacques Vielliard, one of the 10 largest animal sound collections in the world.
AB - This paper addresses the problem of improving the quality of metadata in biological observation databases, in particular those associated with observations of living beings, and which are often used as a starting point for biodiversity analyses. Poor quality metadata lead to incorrect scientific conclusions, and can mislead experts. Thus, it is important to design and develop methods to detect and correct metadata quality problems. This is a challenging problem because of the variety of issues concerning such metadata, e.g., misnaming of species, location uncertainty and imprecision concerning where observations were recorded. Related work is limited because it does not adequately model such issues. We propose a geographic approach based on expertled classification of place and/or range mismatch anomalies detected by our algorithms. Our approach enables detection of anomalies in both species' reported geographic distributions and in species' identification. Our main contribution is our geographic algorithm that deals with uncertain/imprecise locations. Our work is tested using a case study with the Fonoteca Neotropical Jacques Vielliard, one of the 10 largest animal sound collections in the world.
UR - http://www.scopus.com/inward/record.url?scp=84893435570&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893435570&partnerID=8YFLogxK
U2 - 10.1109/eScience.2013.14
DO - 10.1109/eScience.2013.14
M3 - Conference contribution
AN - SCOPUS:84893435570
SN - 9780768550831
T3 - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
SP - 212
EP - 220
BT - Proceedings - IEEE 9th International Conference on e-Science, e-Science 2013
PB - IEEE Computer Society
T2 - 9th IEEE International Conference on e-Science, e-Science 2013
Y2 - 22 October 2013 through 25 October 2013
ER -