dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    RPSP
total number of records online 113948 
- without coordinates 42030 
- georeferenced 71918 
- access to georeferenced data denied
- in the sea 560 
- blank catalognumber 6479 
  [ susp ]
repeated records
catalog number 44142 
duplicate records 41611 
collector's name and number
last update  -  error logs
of the collection:  04-05-2026 of dataCleaning:  06-05-2026
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus not found
species not found
subspecies 339 suspect records
author 1551 suspect records
duplicate not found
other inconsistencies 1 suspect records
annotations 4 annotations
locality data
inventory country - state - municipality
name of the country/state 579 suspect records
outlier 1 suspect records
long/lat outside the world limit 100 suspect records
equal long/lat not found
long or lat equal to zero 176 suspect records
long/lat in the sea (Brazil) 319 suspect records
municipality name (Brazil) 13107 suspect records
coordinate unit analysis (Brazil) 33 suspect records
other inconsistencies 664 suspect records

date collected
collected before 1930 145 suspect records
identification year previous to date collected 372 suspect records
suggestions for blank fields
long/lat (Brazil) 21731 suggestions  
country/state name 370 suggestions
municipality name (Brazil) 27 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA