dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    RECOLNAT_MNHN_P
total number of records online 261318 
- without coordinates 236785 
- georeferenced 24533 
- access to georeferenced data denied
- in the sea 1308 
- blank catalognumber
  [ susp ]
repeated records
catalog number
duplicate records
collector's name and number
last update  -  error logs
of the collection:  25-04-2024 of dataCleaning:  03-05-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 261318 suspect records
family 407 suspect records
genus 318 suspect records
species 1039 suspect records
subspecies not found
author 6891 suspect records
duplicate not found
other inconsistencies 127 suspect records
annotations 371 annotations
locality data
inventory country - state - municipality
name of the country/state 1953 suspect records
outlier 11 suspect records
long/lat outside the world limit 11 suspect records
equal long/lat 1 suspect records
long or lat equal to zero 532 suspect records
long/lat in the sea (Brazil) 152 suspect records
municipality name (Brazil) 2503 suspect records
coordinate unit analysis (Brazil) 17 suspect records
other inconsistencies 1069 suspect records

date collected
collected before 1767 not found
identification year previous to date collected 124 suspect records
suggestions for blank fields
long/lat (Brazil) 21927 suggestions  
country/state name 11643 suggestions
municipality name (Brazil) 1714 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA