dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    NPM
total number of records online 7450 
- without coordinates 1560 
- georeferenced 5890 
- access to georeferenced data denied
- in the sea 4576 
- blank catalognumber
  smaller: 1   larger: 7450  
repeated records
catalog number
duplicate records
collector's name and number 6 
last update  -  error logs
of the collection:  24-04-2024 of dataCleaning:  25-04-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus not found
species not found
subspecies not found
author not found
duplicate not found
other inconsistencies not found
annotations 0 annotations
locality data
inventory country - state - municipality
name of the country/state 6 suspect records
outlier 65 suspect records
long/lat outside the world limit 1 suspect records
equal long/lat 2 suspect records
long or lat equal to zero 2 suspect records
long/lat in the sea (Brazil) 90 suspect records
municipality name (Brazil) 88 suspect records
coordinate unit analysis (Brazil) 27 suspect records
other inconsistencies 34 suspect records

date collected
collected before 1930 5 suspect records
identification year previous to date collected not found
suggestions for blank fields
long/lat (Brazil) 1043 suggestions  
country/state name 25 suggestions
municipality name (Brazil) 20 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA