dataCleaning
português
This tool aims at helping curators in identifying possible errors and to standardize data. Records are not modified. The system just presents "suspect" records, recommending that they be checked by each author or curator. The tool is under constant development, so any suggestion is more then welcome.

Select a collection 
collection:    MAR
total number of records online 14000 
- without coordinates 487 
- georeferenced 13513 
- access to georeferenced data denied
- in the sea 516 
- blank catalognumber
  smaller: 1   larger: 15004 [ gap ]  
repeated records
catalog number
duplicate records
collector's name and number 809 
last update  -  error logs
of the collection:  08-04-2024 of dataCleaning:  09-04-2024
geographic distribution of the specimens

collection profile
dataCleaning statistics
geographic coordinates analysis

taxonomic data
inventory scientific name - collector - identifier - types
kingdom not found
family 25 suspect records
genus 2 suspect records
species 14 suspect records
subspecies not found
author 1138 suspect records
duplicate 554 suspect records
other inconsistencies 1 suspect records
annotations 1 annotations
locality data
inventory country - state - municipality
name of the country/state 80 suspect records
outlier 235 suspect records
long/lat outside the world limit not found
equal long/lat 1 suspect records
long or lat equal to zero 367 suspect records
long/lat in the sea (Brazil) 127 suspect records
municipality name (Brazil) 376 suspect records
coordinate unit analysis (Brazil) not found
other inconsistencies 9 suspect records

date collected
collected before 1930 1 suspect records
identification year previous to date collected 5 suspect records
suggestions for blank fields
long/lat (Brazil) 326 suggestions  
country/state name 9 suggestions
municipality name (Brazil) 315 suggestions

search
dataCleaning
email
Centro de Referência em Informação Ambiental, CRIA