dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    CGMS
total number of records online 90841 
- without coordinates 24323 
- georeferenced 66518 
- access to georeferenced data denied
- in the sea 1781 
- blank catalognumber
  smaller: 1   larger: 90658 [ gap ]   [ susp ]
repeated records
catalog number 552 
duplicate records 18 
collector's name and number
last update  -  error logs
of the collection:  01-03-2024 of dataCleaning:  04-03-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 2451 suspect records
family not found
genus 57 suspect records
species 309 suspect records
subspecies not found
author 561 suspect records
duplicate 4321 suspect records
other inconsistencies 44 suspect records
annotations 45 annotations
locality data
inventory country - state - municipality
name of the country/state 2392 suspect records
outlier 98 suspect records
long/lat outside the world limit not found
equal long/lat 1 suspect records
long or lat equal to zero 21 suspect records
long/lat in the sea (Brazil) 59 suspect records
municipality name (Brazil) 5440 suspect records
coordinate unit analysis (Brazil) 234 suspect records
other inconsistencies 2259 suspect records

date collected
collected before 1912 not found
identification year previous to date collected 4 suspect records
suggestions for blank fields
long/lat (Brazil) 21253 suggestions  
country/state name 83 suggestions
municipality name (Brazil) not found

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA