dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    ESA
total number of records online 153897 
- without coordinates 141946 
- georeferenced 11951 
- access to georeferenced data denied
- in the sea 3555 
- blank catalognumber
  [ susp ]
repeated records
catalog number 176 
duplicate records 6 
collector's name and number 5965 
last update  -  error logs
of the collection:  22-03-2024 of dataCleaning:  25-03-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 156 suspect records
genus 1311 suspect records
species 8033 suspect records
subspecies not found
author 78987 suspect records
duplicate 19192 suspect records
other inconsistencies 8 suspect records
annotations 1157 annotations
locality data
inventory country - state - municipality
name of the country/state 392 suspect records
outlier 958 suspect records
long/lat outside the world limit 7 suspect records
equal long/lat 3 suspect records
long or lat equal to zero 1573 suspect records
long/lat in the sea (Brazil) 3265 suspect records
municipality name (Brazil) 2160 suspect records
coordinate unit analysis (Brazil) 1368 suspect records
other inconsistencies 554 suspect records

date collected
collected before 1871 1 suspect records
identification year previous to date collected 14543 suspect records
suggestions for blank fields
long/lat (Brazil) 123419 suggestions  
country/state name not found
municipality name (Brazil) 25 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA