dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HUNEB
total number of records online 14597 
- without coordinates 5848 
- georeferenced 8749 
- access to georeferenced data denied
- in the sea 710 
- blank catalognumber 127 
 
repeated records
catalog number 127 
duplicate records 107 
collector's name and number 1471 
last update  -  error logs
of the collection:  05-03-2020 of dataCleaning:  06-03-2020
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 83 suspect records
genus 137 suspect records
species 325 suspect records
subspecies not found
author 5177 suspect records
duplicate 3344 suspect records
other inconsistencies 24 suspect records
annotations 125 annotations
locality data
inventory country - state - municipality
name of the country/state 47 suspect records
outlier 30 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero 517 suspect records
long/lat in the sea (Brazil) 497 suspect records
municipality name (Brazil) 1929 suspect records
coordinate unit analysis (Brazil) 3 suspect records
other inconsistencies 127 suspect records

date collected
collected before 1930 not found
identification year previous to date collected 1 suspect records
suggestions for blank fields
long/lat (Brazil) 5507 suggestions  
country/state name 1 suggestions
municipality name (Brazil) 36 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA