dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    CPAP
total number of records online 24954 
- without coordinates 5067 
- georeferenced 19887 
- access to georeferenced data denied
- in the sea 134 
- blank catalognumber 24 
  smaller: 1   larger: 25714 [ gap ]  
repeated records
catalog number 36 
duplicate records
collector's name and number 159 
last update  -  error logs
of the collection:  04-04-2025 of dataCleaning:  05-04-2025
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 104 suspect records
genus 128 suspect records
species 255 suspect records
subspecies not found
author 3768 suspect records
duplicate 6 suspect records
other inconsistencies 2 suspect records
annotations 26 annotations
locality data
inventory country - state - municipality
name of the country/state 116 suspect records
outlier 153 suspect records
long/lat outside the world limit 2 suspect records
equal long/lat not found
long or lat equal to zero 165 suspect records
long/lat in the sea (Brazil) 127 suspect records
municipality name (Brazil) 699 suspect records
coordinate unit analysis (Brazil) 9 suspect records
other inconsistencies 285 suspect records

date collected
collected before 1930 3 suspect records
identification year previous to date collected 33 suspect records
suggestions for blank fields
long/lat (Brazil) 4479 suggestions  
country/state name 31 suggestions
municipality name (Brazil) not found

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA