dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    MBML-Herbario
total number of records online 56614 
- without coordinates 7560 
- georeferenced 49054 
- access to georeferenced data denied
- in the sea 185 
- blank catalognumber
  smaller: 1   larger: 56614  
repeated records
catalog number
duplicate records
collector's name and number 2028 
last update  -  error logs
of the collection:  07-03-2024 of dataCleaning:  08-03-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom 1 suspect records
family not found
genus 131 suspect records
species 119 suspect records
subspecies 21 suspect records
author 19711 suspect records
duplicate 6294 suspect records
other inconsistencies not found
annotations 522 annotations
locality data
inventory country - state - municipality
name of the country/state 447 suspect records
outlier 18 suspect records
long/lat outside the world limit not found
equal long/lat not found
long or lat equal to zero 3 suspect records
long/lat in the sea (Brazil) 14 suspect records
municipality name (Brazil) 1564 suspect records
coordinate unit analysis (Brazil) 131 suspect records
other inconsistencies 2 suspect records

date collected
collected before 1905 not found
identification year previous to date collected 42 suspect records
suggestions for blank fields
long/lat (Brazil) 7104 suggestions  
country/state name 4 suggestions
municipality name (Brazil) 2 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA