dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HTSA
total number of records online 9010 
- without coordinates 3388 
- georeferenced 5622 
- access to georeferenced data denied
- in the sea 98 
- blank catalognumber 10 
  smaller: 1   larger: 9001 [ gap ]  
repeated records
catalog number 10 
duplicate records 3 
collector's name and number 228 
last update  -  error logs
of the collection:  07-11-2025 of dataCleaning:  08-11-2025
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus 4 suspect records
species not found
subspecies not found
author 206 suspect records
duplicate 1535 suspect records
other inconsistencies not found
annotations 23 annotations
locality data
inventory country - state - municipality
name of the country/state 1117 suspect records
outlier 4 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero not found
long/lat in the sea (Brazil) 1 suspect records
municipality name (Brazil) 2636 suspect records
coordinate unit analysis (Brazil) 15 suspect records
other inconsistencies 3 suspect records

date collected
collected before 1930 not found
identification year previous to date collected 4 suspect records
suggestions for blank fields
long/lat (Brazil) 3069 suggestions  
country/state name 2 suggestions
municipality name (Brazil) 66 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA