dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    UB
total number of records online 276621 
- without coordinates 75966 
- georeferenced 200655 
- access to georeferenced data denied
- in the sea 6712 
- blank catalognumber 69860 
  smaller: 1   larger: 241138 [ gap ]  
repeated records
catalog number 70702 
duplicate records 4046 
collector's name and number
last update  -  error logs
of the collection:  01-02-2024 of dataCleaning:  18-02-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 12867 suspect records
family not found
genus 2 suspect records
species 444 suspect records
subspecies 102 suspect records
author 2583 suspect records
duplicate 22698 suspect records
other inconsistencies 7 suspect records
annotations 411 annotations
locality data
inventory country - state - municipality
name of the country/state 11212 suspect records
outlier not found
long/lat outside the world limit 2 suspect records
equal long/lat 8 suspect records
long or lat equal to zero 12 suspect records
long/lat in the sea (Brazil) 1028 suspect records
municipality name (Brazil) 30790 suspect records
coordinate unit analysis (Brazil) 1200 suspect records
other inconsistencies 13059 suspect records

date collected
collected before 1800 not found
identification year previous to date collected 171 suspect records
suggestions for blank fields
long/lat (Brazil) 27348 suggestions  
country/state name 3133 suggestions
municipality name (Brazil) not found

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA