dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    NMNH-Botany_BR
total number of records on-line 37662 
- without coordinates 35373 
- georeferenced 2289 
- access to georeferenced data denied
- in the sea 45 
- blank catalognumber
  smaller: 2028958   larger: 2854088 [ gap ]   [ susp ]
repeated records
catalog number
duplicate records
collector's name and number
last update
of the collection:  13-02-2008 of dataCleaning:  05-07-2010
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - types
family not found
genus not found
species 192 suspect records
subspecies not found
author 6944 suspect records
duplicate not found
other inconsistencies 152 suspect records
locality data
inventory country - state - municipality
name of the country/state 18 suspect records
outlier 4 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero 5 suspect records
long/lat in the sea (Brazil) 31 suspect records
municipality name (Brazil) 429 suspect records
coordinate unit analysis (Brazil) 4 suspect records
other inconsistencies 3 suspect records

date collected
collected before 1900 5291 suspect records
last update previous to date collected not found
identification year previous to date collected 21 suspect records
suggestions for blank fields
long/lat (Brazil) 4532 suggestions  
country/state name 18 suggestions
municipality name (Brazil) 1316 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA