Interest group about data quality, cleaning and fitness-for-use. TDWG data quality interest group.

Share |

Pages home > References



Last updated 901 days ago by Lee Belbin

Please add to this list any references to useful articles, papers, presentations etc relating to 'data quality'

Ariño, A.H., Chavan, V., Faith, D.P. 2013. Assessment of user needs of primary biodiversity data: Analysis, concerns, and challenges. Biodiversity Informatics 8(2) 59-93.

Chapman, A. D. 2005. Principles of Data Quality, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen.

Chapman, A. D. 2005. Principles and Methods of Data Cleaning – Primary Species and SpeciesOccurrence Data, version 1.0. Report for the Global Biodiversity Information Facility, Copenhagen.

Hill, A.W., Guralnick, R., Flemons, P., Beaman, R., Wieczorek, J., Ranipeta, A., Chavan, V. and Remsen, D. 2009. Location, location, location: utilizing pipelines and services to more effectively georeference the world's biodiversity data. BMC Bioinformatics 2009, 10(Suppl 14):S3 doi:10.1186/1471-2105-10-S14-S3

Mathew, C., Güntsch, A., Obst, M., Vicario, S., Haines, R., Williams, A., de Jong, Y., Goble, C. 2014. A semiautomated workflow for biodiversity data retrieval, cleaning, and quality control. Biodiversity Data Journal 2:


e4221. doi: 10.3897/BDJ.2.e4221

Otegui, J., Ariño, A.H., Encinas, M.A., Pando, F. 2013. Assessing the Primary Data Hosted by the Spanish Node of the Global Biodiversity Information Facility (GBIF). PLoS ONE 8(1): e55144. doi:10.1371/journal.pone.0055144

Pipino, L.L., Lee, Y.W. and Wang, R.Y. 2002. Data quality assessment. Commun. ACM 45, 4 (April 2002), 211-218. DOI=10.1145/505248.506010 http://doi.acm.org/10.1145/505248.506010

Wang, R.Y. 2005. Raising the Bar for Data Quality in the New Millennium. Powerpoint Presentation.

Lee Belbin 901 days ago


Barnett, V. and Lewis, T. 1994. Outliers in Statistical Data. Chichester, UK: Wiley and Sons.

Chapman, A.D. 1999. Quality Control and Validation of Point-Sourced Environmental Resource Data. pp. 409-418 in Lowell, K. and Jaton, A. eds. Spatial accuracy assessment: Land information uncertainty in natural resources. Chelsea, MI: Ann Arbor Press.

 Chapman, A.D., Hijmans, R., Marino, A, De Giovanni, R. and de Souza, S. (2006). Using the concept of “Outlierness” to identify suspect records in Primary Species Occurrence Data p. 39 in The Road to Productive Partnerships. The 21st Annual Meeting of the Society for the Preservation of Natural History Collections and the Natural Science Collections Alliance 2006 Annual Meeting. Program & Abstracts. Albuquerque, New Mexico 23-27.  May 2006.

Chapman, A.D. and Wieczorek, J. (eds). (2006). Guide to Best Practices for Georeferencing. BioGeomancer Consortium. Copenhagen: Global Biodiversity Information Facility. 90pp. ISBN: 87-92020-00-3. http://www.gbif.org/orc/?doc_id=1288




CSPR Assessment Panel. 2004. Scientific Data and Information. A Report of the CSPR Assessment Panel. 42pp. ICSI: Paris, France.

Guralnick, R.P., Wieczorek, J., Beaman, R., Hijmans, R.J., and the BioGeomancer Working Group. 2006. BioGeomancer: Automated georeferencing to map the world’s biodiversity data. PLoS Biol 4(11): e381. DOI: 10.1371/journal.pbio.0040381. http://biology.plosjournals.org/perlserv/?request=get-document&doi=10.1371%2Fjournal.pbio.0040381  

Maletic, J.I. and Marcus, A. 2000. Data Cleansing: Beyond Integrity Analysis. pp. 200-209 in Proceedings of the Conference on Information Quality (IQ2000). Boston: Massachusetts Institute of Technology.



Marino, A., Pavarin, F., de Souza, S. and Chapman, A.D. (2004).  geoLoc and spOutlier: on-line tools for geocoding and validating biological data. In Proceedings of Inter-American Workshop on Environmental Data.  http://www.cria.org.br/eventos/iaed/amarino_pre.html Powerpoint Presentation. http://tinyurl.com/mbv93dl


Peterson, A.T. et al. 2004. Detecting Errors in Biological Data based on collectors' itineraries. Bull. British Ornithologists Club 124(2): 143-151. http://tinyurl.com/khzarrn

Arthur Chapman 901 days ago


Allan Koch Veiga 900 days ago