dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HRCB
total number of records online 42125 
- without coordinates 25055 
- georeferenced 17070 
- access to georeferenced data denied
- in the sea 666 
- blank catalognumber
  smaller: 1   larger: 547834 [ gap ]  
repeated records
catalog number 514 
duplicate records 6 
collector's name and number 1178 
last update  -  error logs
of the collection:  09-02-2024 of dataCleaning:  12-02-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 6 suspect records
genus 3 suspect records
species 1179 suspect records
subspecies 53 suspect records
author 9511 suspect records
duplicate 6301 suspect records
other inconsistencies 9 suspect records
annotations 79 annotations
locality data
inventory country - state - municipality
name of the country/state 556 suspect records
outlier 15 suspect records
long/lat outside the world limit 1 suspect records
equal long/lat not found
long or lat equal to zero 534 suspect records
long/lat in the sea (Brazil) 319 suspect records
municipality name (Brazil) 2264 suspect records
coordinate unit analysis (Brazil) 28 suspect records
other inconsistencies 90 suspect records

date collected
collected before 1888 not found
identification year previous to date collected 84 suspect records
suggestions for blank fields
long/lat (Brazil) 21896 suggestions  
country/state name 26 suggestions
municipality name (Brazil) 323 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA