dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    INPA-Herbario
total number of records online 269445 
- without coordinates 41252 
- georeferenced 228193 
- access to georeferenced data denied
- in the sea 1088 
- blank catalognumber 1305 
  smaller: 1   larger: 2748602 [ gap ]  
repeated records
catalog number 9564 
duplicate records 2050 
collector's name and number
last update  -  error logs
of the collection:  07-02-2024 of dataCleaning:  08-02-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family not found
genus 179 suspect records
species 380 suspect records
subspecies not found
author 4900 suspect records
duplicate not found
other inconsistencies not found
annotations 788 annotations
locality data
inventory country - state - municipality
name of the country/state 11256 suspect records
outlier 310 suspect records
long/lat outside the world limit not found
equal long/lat 1 suspect records
long or lat equal to zero 844 suspect records
long/lat in the sea (Brazil) 574 suspect records
municipality name (Brazil) 56296 suspect records
coordinate unit analysis (Brazil) 1701 suspect records
other inconsistencies 838 suspect records

date collected
collected before 1840 not found
identification year previous to date collected 880 suspect records
suggestions for blank fields
long/lat (Brazil) 20821 suggestions  
country/state name 4394 suggestions
municipality name (Brazil) 28554 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA