dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    US
total number of records online 1049991 
- without coordinates 750266 
- georeferenced 299725 
- access to georeferenced data denied
- in the sea 7592 
- blank catalognumber 61407 
  [ susp ]
repeated records
catalog number 79837 
duplicate records 2316 
collector's name and number 134242 
last update  -  error logs
of the collection:  15-01-2024 of dataCleaning:  11-02-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus 1059 suspect records
species 3117 suspect records
subspecies 61 suspect records
author 26654 suspect records
duplicate 97359 suspect records
other inconsistencies 46 suspect records
annotations 643 annotations
locality data
inventory country - state - municipality
name of the country/state 47809 suspect records
outlier 9 suspect records
long/lat outside the world limit not found
equal long/lat 61 suspect records
long or lat equal to zero 178 suspect records
long/lat in the sea (Brazil) 1747 suspect records
municipality name (Brazil) 15457 suspect records
coordinate unit analysis (Brazil) 84 suspect records
other inconsistencies 79 suspect records

date collected
collected before 1760 3 suspect records
identification year previous to date collected not found
suggestions for blank fields
long/lat (Brazil) 47962 suggestions  
country/state name 30036 suggestions
municipality name (Brazil) 22084 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA