dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    HEPH
total number of records online 37566 
- without coordinates 5450 
- georeferenced 32116 
- access to georeferenced data denied
- in the sea 104 
- blank catalognumber 25 
  smaller: 1   larger: 38530 [ gap ]  
repeated records
catalog number 1181 
duplicate records 884 
collector's name and number 467 
last update  -  error logs
of the collection:  16-10-2023 of dataCleaning:  17-10-2023
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus not found
species not found
subspecies not found
author 2 suspect records
duplicate 2758 suspect records
other inconsistencies not found
annotations 37 annotations
locality data
inventory country - state - municipality
name of the country/state 1682 suspect records
outlier 181 suspect records
long/lat outside the world limit not found
equal long/lat 16 suspect records
long or lat equal to zero 9 suspect records
long/lat in the sea (Brazil) 32 suspect records
municipality name (Brazil) 5345 suspect records
coordinate unit analysis (Brazil) 182 suspect records
other inconsistencies not found

date collected
collected before 1921 not found
identification year previous to date collected 8 suspect records
suggestions for blank fields
long/lat (Brazil) 4370 suggestions  
country/state name 15 suggestions
municipality name (Brazil) 241 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA