dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    ICN
total number of records online 178848 
- without coordinates 145853 
- georeferenced 32995 
- access to georeferenced data denied
- in the sea 380 
- blank catalognumber 1 
  smaller: 2   larger: 995623 [ gap ]   [ susp ]
repeated records
catalog number 108 
duplicate records 23 
collector's name and number 4909 
last update  -  error logs
of the collection:  03-04-2024 of dataCleaning:  04-04-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus 142 suspect records
species 788 suspect records
subspecies 5 suspect records
author 745 suspect records
duplicate 8560 suspect records
other inconsistencies 1 suspect records
annotations 156 annotations
locality data
inventory country - state - municipality
name of the country/state 442 suspect records
outlier 12 suspect records
long/lat outside the world limit 3 suspect records
equal long/lat not found
long or lat equal to zero 10 suspect records
long/lat in the sea (Brazil) 136 suspect records
municipality name (Brazil) 3949 suspect records
coordinate unit analysis (Brazil) 724 suspect records
other inconsistencies 92 suspect records

date collected
collected before 1800 4 suspect records
identification year previous to date collected 79 suspect records
suggestions for blank fields
long/lat (Brazil) 131168 suggestions  
country/state name 15 suggestions
municipality name (Brazil) 88 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA