dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HAS
total number of records online 37772 
- without coordinates 34134 
- georeferenced 3638 
- access to georeferenced data denied
- in the sea 64 
- blank catalognumber
  [ susp ]
repeated records
catalog number 22 
duplicate records
collector's name and number 755 
last update  -  error logs
of the collection:  31-05-2019 of dataCleaning:  01-06-2019
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 48 suspect records
genus 228 suspect records
species 579 suspect records
subspecies not found
author 14176 suspect records
duplicate 1170 suspect records
other inconsistencies not found
locality data
inventory country - state - municipality
name of the country/state not found
outlier 62 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero 60 suspect records
long/lat in the sea (Brazil) 62 suspect records
municipality name (Brazil) 385 suspect records
coordinate unit analysis (Brazil) not found
other inconsistencies 83 suspect records

date collected
collected before 1930 88 suspect records
last update previous to date collected not found
identification year previous to date collected 43 suspect records
suggestions for blank fields
long/lat (Brazil) 29956 suggestions  
country/state name 1 suggestions
municipality name (Brazil) 11 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA