dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HUEFS
total number of records online 292208 
- without coordinates 83569 
- georeferenced 208639 
- access to georeferenced data denied
- in the sea 7164 
- blank catalognumber
  smaller: 0   larger: 296549 [ gap ]  
repeated records
catalog number
duplicate records
collector's name and number 10767 
last update  -  error logs
of the collection:  03-06-2026 of dataCleaning:  07-06-2026
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 45264 suspect records
family not found
genus 73 suspect records
species 968 suspect records
subspecies not found
author 62861 suspect records
duplicate 37476 suspect records
other inconsistencies 264 suspect records
annotations 2392 annotations
locality data
inventory country - state - municipality
name of the country/state 4565 suspect records
outlier 11 suspect records
long/lat outside the world limit not found
equal long/lat 6 suspect records
long or lat equal to zero 2403 suspect records
long/lat in the sea (Brazil) 2910 suspect records
municipality name (Brazil) 34802 suspect records
coordinate unit analysis (Brazil) 564 suspect records
other inconsistencies 529 suspect records

date collected
collected before 1807 4 suspect records
identification year previous to date collected 395 suspect records
suggestions for blank fields
long/lat (Brazil) 69346 suggestions  
country/state name 1316 suggestions
municipality name (Brazil) 2359 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA