dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HRCB
total number of records online 30793 
- without coordinates 21032 
- georeferenced 9761 
- access to georeferenced data denied
- in the sea 890 
- blank catalognumber 18 
  smaller: 1   larger: 547834 [ gap ]  
repeated records
catalog number 278 
duplicate records 36 
collector's name and number 564 
last update  -  error logs
of the collection:  14-12-2020 of dataCleaning:  15-12-2020
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 564 suspect records
genus 186 suspect records
species 398 suspect records
subspecies 36 suspect records
author 6320 suspect records
duplicate 2643 suspect records
other inconsistencies 38 suspect records
annotations 52 annotations
locality data
inventory country - state - municipality
name of the country/state 284 suspect records
outlier 441 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero 753 suspect records
long/lat in the sea (Brazil) 450 suspect records
municipality name (Brazil) 1731 suspect records
coordinate unit analysis (Brazil) 25 suspect records
other inconsistencies 65 suspect records

date collected
collected before 1888 not found
identification year previous to date collected 46 suspect records
suggestions for blank fields
long/lat (Brazil) 18360 suggestions  
country/state name 15 suggestions
municipality name (Brazil) 232 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA